gpgpu
Recommendation for OpenCL GPGPU [closed]
As it currently stands, this question is not a good fit for our Q&A format. We expect answers to be supported by facts, references,or expertise, but this question will likely solicit debate, a[详细]
2023-03-24 13:09 分类:问答A quick hack to sorting: am I doing this right?
I was looking into different sorting algorithms, and trying 开发者_Python百科to think how to port them to GPUs when I got this idea of sorting without actually sorting. This is how my kernel looks:[详细]
2023-03-23 17:44 分类:问答How to measure the gflops of a matrix multiplication kernel?
In the book Programming Massively Parallel Processors the number of gflops is used to compare the effi开发者_如何转开发ciency of different matrix multiplication kernels. How would I compute this for m[详细]
2023-03-23 15:41 分类:问答GPU-friendly 2D line segment intersection algorithm
I\'m looking for an algorithm that tests whether 2 line segments are intersecting in a GPU-friendly way.The line segments are in 2D.While there are many algorithms discussed on the web for doing this,[详细]
2023-03-22 05:40 分类:问答Fermi L2 cache hit latency?
Does anyone know related information about L2 cache in Fermi? I have heard that it开发者_如何学Go is as slow as global memory, and the use of L2 is just to enlarge the memory bandwidth. But I can\'t f[详细]
2023-03-21 01:23 分类:问答Disassemble an OpenCL kernel?
I\'m not sure if it\'s possible. I want to study OpenCL in-depth, so I was wondering if there is a tool to disas开发者_如何转开发semble an compiled OpenCL kernel.[详细]
2023-03-20 09:16 分类:问答Is cudamalloc slower than cudamemcpy?
i am working on a code which needs to be time efficient and thus using Cufftfor this purpose but when i try to compute fft of a very large data in parallel it is slower than cpufftw and the reason i f[详细]
2023-03-20 07:14 分类:问答Synchronizations in GPUs
I have some question about how GPUs perform synchronizations. As I know, when a warp encounters a barrier (assuming it is in OpenCL), and it knows that the other warps of the same group haven\'t been[详细]
2023-03-20 03:39 分类:问答High precision output from a GLES2 shader
I\'m doing some GPGPU stuff on a GLES2 platform that supports maximum RGBA8 render targets (iOS). I need to o开发者_开发知识库utput a vec2 in the range +/- 2.0 with as much precision as I can get, so[详细]
2023-03-19 23:41 分类:问答Number of active warps in GPU (Fermi)
I have a quick question about the active warp开发者_如何学运维s in GPU (I would prefer to know it in Fermi).[详细]
2023-03-19 17:42 分类:问答
加载中,请稍侯......