CUDA_开发者

Busy spin in CUDA

How can I implement a busy spin mechanism of the form while(variable == 0); where variable is updated to 1 by some other CUDA thread after some event has occured.[详细]

2023-04-13 07:45 分类：问答

Improving kernel performance by increasing occupancy?

Here is an output of Compute Visual Profiler for my kernel on GT 440: Kernel details: Grid size: [100 1 1], Block size: [256 1 1][详细]

2023-04-12 16:19 分类：问答

getting wrong values in cuda c programm [closed]

It's difficult to tell what is being asked here. This question is ambiguous, vague, incomplete, overly broad, or rhetorical andcannot be reasonably answered in its current form. For help clari[详细]

2023-04-12 08:47 分类：问答

CUDA OPENGL Interoperability: cudaGLSetGLDevice

Following the Programming Giude of CUDA 4.0, I call cudaGLSetGLDevice before any other runtime calls. But the next cuda call, cudaMalloc, return \"all CUDA-capable devices are busy or unavailable.\"[详细]

2023-04-12 03:25 分类：问答

High Performance Computing Terminology: What's a GF/s? [closed]

This question is unlikely to help any future visitors; it is only rele开发者_如何学Cvant to a small geographic area, a specific moment in time,or an extraordinarily narrow situation that is not ge[详细]

2023-04-12 03:15 分类：问答

How to create 64-bit CUDA applications? (Win7 x64, CUDA 4, VS 2010 Express)

I\'m mostly set up for CUDA development. I\'ve installed the developer drivers, CUDA 4.0 toolkit, and the 4.0 SDK, as well as the bugfix. I\'m running Windows 7 x64, and am using Visual C++ 2010 Expre[详细]

2023-04-12 02:18 分类：问答

Finding the maximum element value AND its position using CUDA Thrust

How do I get not only the v开发者_开发问答alue but also the position of the maximum (minimum) element (res.val and res.pos)?[详细]

2023-04-12 01:52 分类：问答

How much memory can I actually allocated on a cuda card

I\'m writing a server process that performs calculations on a GPU using cuda. I want to queue up in-coming requests until enough memory is available on the device to run the job, but I\'m having a har[详细]

2023-04-11 23:43 分类：问答

How can I copy the members of nested structs to a CUDA device's memory space?

I\'m trying to copy some nested structs to device memory for kernel use in a CUDA-accelerated neural network simulator. This code links and runs, but it throws some exceptions and CUDA errors:[详细]

2023-04-11 17:16 分类：问答

Atomic operations on Shared Memory in CUDA

I use a GTX 280, which has compute capability 1.3 and supports atomic operations on shared memory. I am using cuda SDK 2.2 and VS 2005. In my program I have to extensively use atomic operations becaus[详细]

2023-04-11 17:13 分类：问答