Passing cuda context to worker pthreads

2023-03-24 11:23 问答作者：

I have some CUDA kernels I want to run in individual pthreads.

I basically have to have each pthread execute, say, 3 cuda kernels, and they must be executed sequentially.

I thought I would try to pass each pthread a reference to a stream, and so each of those 3 cud开发者_开发问答a kernels would all execute sequentially, in the same stream.

I could get this working with a different context for pthread, which would then execute the kernels as normal, but that seems to take a lot of overhead.

So how do I make each pthread work in the same context, concurrently with the other pthreads?

Thanks

Before CUDA 4.0, the way to access a given context from different CPU threads was to use cuCtxPopCurrent()/cuCtxPushCurrent(). A context could only be current to one CPU thread at a time.

In CUDA 4.0, you can call cudaSetDevice() in each pthread and it can be current to more than one thread at a time.

The kernel invocations will be serialized by the context in the order received, but you may have to perform CPU thread synchronization to make sure the work is submitted in the order desired.

继续阅读：cuda-context multithreading

Passing cuda context to worker pthreads

更多精彩内容

精彩评论

最新问答

央视是哪个频道？

请问买过的朋友，舒提啦旅行箱实际使用体验如何？？

检查不孕不育需要的费用？

海信ULED电视画质有什么不同的地方?？

钉子可以挂的住画框幕布吗？

问答排行榜

Best solution for private video database [closed]

王昌瑞《潜梦追凶》剧组庆生新锐演员未来可期？

Is it allowed to ask users to enter credit card details for own payment method?

Escaping "<" in Perl-generated XML

imessage会显示已读吗？