Intel® oneAPI Math Kernel Library
Ask questions and share information with other developers who use Intel® Math Kernel Library.

Small complex matrix performance issue

AndrewC
新分销商 III
1,051 次查看
There seems to be some kind of significant setup overhead with MKL routines. When doing operations on "small" (6x6) double precision complex matrices (zgemm), MKL performs somewhat worse than "regular" LAPACK/BLAS. MKL pulls ahead with larger matrices.
Unfortunately, though it is possible to "hand-code" an approach for these 6x6 matrices, a "naive" matrix multiply loop is much slower than using zgemm, MKL or not.

Andrew
0 项奖励
2 回复数
TimP
名誉分销商 III
1,051 次查看
MKL ZGEMM performance for complex data types would not be outstanding, unless you have a recent release, with an SSE3 specific version. Even then, for such a small matrix, it may be possible to write an in-line version which is faster. If you are able to use the MKL 8.0, with the version optimized for your processor, and you still find this slowdown, please submit a problem report on premier.intel.com.
0 项奖励
AndrewC
新分销商 III
1,051 次查看
Sorry,
I should have said I am using MKL 8.0 on a Pentium D
0 项奖励
回复