- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
There seems to be some kind of significant setup overhead with MKL routines. When doing operations on "small" (6x6) double precision complex matrices (zgemm), MKL performs somewhat worse than "regular" LAPACK/BLAS. MKL pulls ahead with larger matrices.
Unfortunately, though it is possible to "hand-code" an approach for these 6x6 matrices, a "naive" matrix multiply loop is much slower than using zgemm, MKL or not.
Andrew
Unfortunately, though it is possible to "hand-code" an approach for these 6x6 matrices, a "naive" matrix multiply loop is much slower than using zgemm, MKL or not.
Andrew
Link Copied
2 Replies
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
MKL ZGEMM performance for complex data types would not be outstanding, unless you have a recent release, with an SSE3 specific version. Even then, for such a small matrix, it may be possible to write an in-line version which is faster. If you are able to use the MKL 8.0, with the version optimized for your processor, and you still find this slowdown, please submit a problem report on premier.intel.com.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Sorry,
I should have said I am using MKL 8.0 on a Pentium D
I should have said I am using MKL 8.0 on a Pentium D

Reply
Topic Options
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page