Does Blitz++ use BLAS routines when it is possible and appropriate to
I know that Blitz++ gets its performance plus by extensive usage of ex开发者_运维知识库pression templates and template metaprogramms. But at some point you can't get more out of your code by using these techniques - you have to multiply and sum some floats up. At this point you can get a final performance kick by using the highly optimized (especially for special architectures) BLAS routines. Does the current implementation of Blitz++ use BLAS routines whenever it is possible?
Only for benchmarks you must specify it when you configure blitz++:
./configure -with-blas=...
Blitz does not use Blas routines.
精彩评论