CUDA
Busy spin in CUDA
How can I implement a busy spin mechanism of the form while(variable == 0); where variable is updated to 1 by some other CUDA thread after some event has occured.[详细]
2023-04-13 07:45 分类:问答Improving kernel performance by increasing occupancy?
Here is an output of Compute Visual Profiler for my kernel on GT 440: Kernel details: Grid size: [100 1 1], Block size: [256 1 1][详细]
2023-04-12 16:19 分类:问答getting wrong values in cuda c programm [closed]
It's difficult to tell what is being asked here. This question is ambiguous, vague, incomplete, overly broad, or rhetorical andcannot be reasonably answered in its current form. For help clari[详细]
2023-04-12 08:47 分类:问答CUDA OPENGL Interoperability: cudaGLSetGLDevice
Following the Programming Giude of CUDA 4.0, I call cudaGLSetGLDevice before any other runtime calls. But the next cuda call, cudaMalloc, return \"all CUDA-capable devices are busy or unavailable.\"[详细]
2023-04-12 03:25 分类:问答High Performance Computing Terminology: What's a GF/s? [closed]
This question is unlikely to help any future visitors; it is only rele开发者_如何学Cvant to a small geographic area, a specific moment in time,or an extraordinarily narrow situation that is not ge[详细]
2023-04-12 03:15 分类:问答How to create 64-bit CUDA applications? (Win7 x64, CUDA 4, VS 2010 Express)
I\'m mostly set up for CUDA development. I\'ve installed the developer drivers, CUDA 4.0 toolkit, and the 4.0 SDK, as well as the bugfix. I\'m running Windows 7 x64, and am using Visual C++ 2010 Expre[详细]
2023-04-12 02:18 分类:问答Finding the maximum element value AND its position using CUDA Thrust
How do I get not only the v开发者_开发问答alue but also the position of the maximum (minimum) element (res.val and res.pos)?[详细]
2023-04-12 01:52 分类:问答How much memory can I actually allocated on a cuda card
I\'m writing a server process that performs calculations on a GPU using cuda. I want to queue up in-coming requests until enough memory is available on the device to run the job, but I\'m having a har[详细]
2023-04-11 23:43 分类:问答How can I copy the members of nested structs to a CUDA device's memory space?
I\'m trying to copy some nested structs to device memory for kernel use in a CUDA-accelerated neural network simulator. This code links and runs, but it throws some exceptions and CUDA errors:[详细]
2023-04-11 17:16 分类:问答Atomic operations on Shared Memory in CUDA
I use a GTX 280, which has compute capability 1.3 and supports atomic operations on shared memory. I am using cuda SDK 2.2 and VS 2005. In my program I have to extensively use atomic operations becaus[详细]
2023-04-11 17:13 分类:问答