OpenCL Events and Command Queues

2023-01-12 14:04 问答作者：

I'm working on translating a CUDA application (this if you must know) to OpenCL. The original application uses the C-style CUDA API, with a single stream just to avoid the automatic busy-wait when reading the results.

Now I notice that OpenCL command queues look a lot like CUDA streams. But in the device read command, and likewise in the write and kernel execute commands, I notice parameters for events too. So I'm wondering, what does it take to execute a device write, a number of kernels (e.g. one call to one kernel then 100 call开发者_如何转开发s to another kernel), and a device read, all sequentially?

If I just enqueue them sequentially into the same queue, will they execute sequentially like they do in CUDA?
If that doesn't work, can/should I daisy-chain events, making each call's wait list the previous call's event?
Or should I add all previous events to each call's wait list, like if there's an N^2 search for dependencies or something?
Or do I just have to event.wait() for each call individually, like it says to in AMD's tutorial?

Thanks!

That depends on how you create the command queue. in clCreateCommandQueue there's a properties parameter that can contain CL_QUEUE_OUT_OF_ORDER_EXEC_MODE_ENABLE, which enables non-sequential execution in the command queue.

If that property is set, commands might execute out of order or in parallel, and the only way of synchronize them is using events.

When that property is not set, commands execute sequentially in the queue.

继续阅读：opencl parallel-processing

OpenCL Events and Command Queues

更多精彩内容

精彩评论

最新问答

央视是哪个频道？

请问买过的朋友，舒提啦旅行箱实际使用体验如何？？

检查不孕不育需要的费用？

海信ULED电视画质有什么不同的地方?？

钉子可以挂的住画框幕布吗？

问答排行榜

河神2九牛入海钓河妖是第几集河妖什么来历可活吞牛？

性激素六项检查的最佳时间是多久？多少钱？？

Easiest way to get words of one line from istream into a vector?

《梦在燃烧 (《三国演义》动画片主题曲)》MP3歌词-汤子星？

抽烟只抽炫赫门？