Cuda GPU optimization
i have read that there were 100X acceleration on certain problems when you use NVIDIA GPU instead of CPU.
what are the best performance acc开发者_StackOverflow社区eleration timings using cuda on different problems. please state the problem and the acceleration factor along with links for papers if possible.These are a few striking examples from natural sciences:
Ab initio quantum chemistry calculation (TeraChem): up to 50x
Molecular dynamics simulations (HOOMD): up to 32x
Molecular orbitals visualization with VMD: 20x-100x
More could be found here:
http://www.nvidia.com/object/tesla_bio_workbench.html
Papers are to be found within the link. Unfortunately I could not show
more direct links, since my status (new account) does not permit more than
one hyperlink.
Thanks.
Check out the CUDA community showcase: http://www.nvidia.com/object/cuda_showcase_html.html
Gumerov was able to speed up the FMM for the Laplace Potential up to ~70x. You can read his excellent paper here (pdf).
However, such results are usually rather meaningless. For example, the Intel Core i7 980 XE is rated at 109GFLOPS, whereas the Nvidia GTX 480 reaches 672 GFLOPS. If both architectures are fully utilized, the maximum speedup achievable would be about 6 times. Of course, for certain problems it is easy to get a high utilization on the GPU but hard on the CPU.
精彩评论