Intel® oneAPI Math Kernel Library
Ask questions and share information with other developers who use Intel® Math Kernel Library.
Announcements
FPGA community forums and blogs on community.intel.com are migrating to the new Altera Community and are read-only. For urgent support needs during this transition, please visit the FPGA Design Resources page or contact an Altera Authorized Distributor.

Small complex matrix performance issue

AndrewC
New Contributor III
538 Views
There seems to be some kind of significant setup overhead with MKL routines. When doing operations on "small" (6x6) double precision complex matrices (zgemm), MKL performs somewhat worse than "regular" LAPACK/BLAS. MKL pulls ahead with larger matrices.
Unfortunately, though it is possible to "hand-code" an approach for these 6x6 matrices, a "naive" matrix multiply loop is much slower than using zgemm, MKL or not.

Andrew
0 Kudos
2 Replies
TimP
Honored Contributor III
538 Views
MKL ZGEMM performance for complex data types would not be outstanding, unless you have a recent release, with an SSE3 specific version. Even then, for such a small matrix, it may be possible to write an in-line version which is faster. If you are able to use the MKL 8.0, with the version optimized for your processor, and you still find this slowdown, please submit a problem report on premier.intel.com.
0 Kudos
AndrewC
New Contributor III
538 Views
Sorry,
I should have said I am using MKL 8.0 on a Pentium D
0 Kudos
Reply