As it currently stands, this question is not a good fit for our Q&A format. We expect answers to be supported by facts, references,or expertise, but this question will likely solicit debate, a
I am programming an OpenGL3 2D Engine. Currently, I am trying to solve a bottleneck. Please hence the following output of the AMD Profiler:
Following the post , if I have header file,which has some functions implementations in it and should be included in several kernels(I mean these function are auxilary in all kernels and I don`t want t
Consider the following code: typedef float image_buffer[1024][1024]; void f(image_buffer *b) { for (int i = 0; i < 1024; i++)
I\'m looking for an algorithm that tests whether 2 line segments are intersecting in a GPU-friendly way.The line segments are in 2D.While there are many algorithms discussed on the web for doing this,
It's difficult to tell what is being asked here. This question is ambiguous, vague, incomplete, overly broad, or rhetorical andcannot be reasonably answered in its current form. For help clari
What is TEX in the object code of my OpenCL program? I put my kernel in the kernel analyzer and I saw the label TEX in the object code. I\'m no开发者_如何转开发t sure what it means. I didn\'t find the
Does anyone know related information about L2 cache in Fermi? I have heard that it开发者_如何学Go is as slow as global memory, and the use of L2 is just to enlarge the memory bandwidth. But I can\'t f
I am new to OpenCL. I have worked with OpenCL kernel with 1-D data. But when I tried to pass a 3-D pointer, it fails to b开发者_运维问答uild the kernel. To be specific I\'m getting CL_BUILD_PROGRAM_FA
Given a collection of thousands of points in 3D, I need to get the list of neighbours for each particl开发者_StackOverflowe that fall inside some cutoff value (in terms of euclidean distance), and if