开发者

Size of statically allocated shared memory per block with Compute Prof (Cuda/OpenCL)

开发者 https://www.devze.com 2023-01-19 18:08 出处:网络
In Nvidia\'s compute prof there is a column called \"static private mem per work group\" and the tooltip of it says \"Size of statically allocated shared memory per block\". My application shows that

In Nvidia's compute prof there is a column called "static private mem per work group" and the tooltip of it says "Size of statically allocated shared memory per block". My application shows that I am getting 64 (bytes I assume) per block. Does that mean I am using somewhere between 1-64 of those bytes or is the profiler just t开发者_StackOverflowelling me that this amount of shared memory was allocated and who knows if it was used at all?


If it's allocated, it's probably because you used it. AFAIK CUDA passes parameters to kernels via shared memory, so it's must be that.

0

精彩评论

暂无评论...
验证码 换一张
取 消