multithreading hack

2023-01-12 19:56 问答作者：

I am writing a numerical program that should be somewhat fast, and is also multithreaded. I have a class that represents a number, and I want to use in it a random number generator. Now, I don't really require my RNG to be a true RNG, I just need it to generate integers with even distribution between 0 and NMAX.

So I have this in my class:

// use just an int here and forget about multithreading.
static uint32 rand = 开发者_运维百科NMAX/4; 
// this will be called multithreadedly
static uint32 GetRand() { return rand = ( rand + 1 ) % NMAX; }

Now, in a single threaded world, this is totally fine for my purposes.

Since this is multithreaded, I'm assuming that the only possible bad thing that could happen is that occasionally (like <1% of the time) the update gets dropped. Meaning two threads read rand, update it in a register, return the updated value, and then write it, twice, with the same value. This is totally fine.

My question is could anything worse than that happen? I'm totally fine with each thread using its own rand variable but that's just a huge pain to make happen. What I definitely can't do is make it so that each instance of the class uses its own rand variable, as that would use way too much memory.

UPDATE:

So, why do I want to do this? Full story is its a floating point class that uses 1 or 2 bytes. So it has to be fast and such, and this just seems the best way. In fact I think I'll update it from ( rand + 1 ) % NMAX to something like ( rand + [some prime] ) % NMAX since it seems to work better. This is an example of one of those cases where a more robust solution would require more code, make things less generic and more dependency ridden, make code less clear and easier to break, and all for the idea that "proper synchronization should be used".

Mostly I'm worried of some weird optimization that the compiler might do, so that an update to rand is not just dropped, but rand becomes total garbage. Now that I think about it though, even that would be okay (the way this number is used), since the next use of GetRand would %NMAX it anyway, the error would only cause at most one use of GetRand to be outside the given range of [0,NMAX). Thanks for any answers.

I'm totally fine with each thread using its own rand variable but that's just a huge pain to make happen.

It's not necessarily so difficult to do it that way. Some compilers (e.g. GCC) support thread-local storage, which allows each thread to have its own copy of a given variable.

Having said that, there's only one problem that I can think of — other than the one you mentioned — with your current way of doing it. If each thread is run on a different core, and the random value is stored in each core's non-shared cache, the updates may fail to be propagated among the cores for an indefinite amount of time. You can avoid this by using a memory barrier (which can be created via the use of locks), but this will probably be bad for performance.

For the purpose of discussion let's assume the following implementation:

The Mersenne Twister (mt19937), which generates batches of 624 random numbers per call, is used.
Each instance of your class (which is used solely inside a single thread) reads off a number from the batch and increments the global index counter so the next call (from any instance) will fetch the next number in the batch. When the global index reaches the end of array, the RNG will be locked and a new batch of 624 random numbers will be generated, after which the global index is reset.

My suggestion for improvement is for each instance to fetch, say, 16 numbers at a time. The 16 numbers do not have to be stored (copied) inside the instance: you simply increment the global index by 16 (making them unavailable to other instances), so that the instance can consume them one by one.

You may use TLS so that each thread will have "its own" variable.

__declspec(thread) static uint32 rand = NMAX/4; 
// this will be called multithreadedly
static uint32 GetRand() { return rand = ( rand + 1 ) % NMAX; }

In your particular case it's very easy to fix the code to make it thread-safe.

static long rand = NMAX/4; 
// this will be called multithreadedly
static uint32 GetRand() { return InterlockedIncrement(&rand) % NMAX; }

If two threads call GetRand at the same time, the classic unsynchronized error may happen. For example, rand = 10. After two threads call GetRand, rand is expected to be 12, but actually it may be 11. If this OK for you, you can leave this code without changes. But I think it is better to use synchronization, because without it both the code and its result look a bit strange. Another programming can think that is is bug.

Edit.

rand = ( rand + 1 ) % NMAX;

Worst case: two or more threads read the same rand variable rand from the memory. Each thread locally makes calculation ( rand + 1 ) % NMAX. Then all threads put the same result back to the memory. This doesn't corrupt the rand variable value, this doesn't cause this variable to be out of scope and number generator will continue to compute valid numbers.

继续阅读：synchronization

multithreading hack

更多精彩内容

精彩评论

最新问答

央视是哪个频道？

请问买过的朋友，舒提啦旅行箱实际使用体验如何？？

检查不孕不育需要的费用？

海信ULED电视画质有什么不同的地方?？

钉子可以挂的住画框幕布吗？

问答排行榜

河神2九牛入海钓河妖是第几集河妖什么来历可活吞牛？

性激素六项检查的最佳时间是多久？多少钱？？

Easiest way to get words of one line from istream into a vector?

《梦在燃烧 (《三国演义》动画片主题曲)》MP3歌词-汤子星？

抽烟只抽炫赫门？