Intel® oneAPI Math Kernel Library
Ask questions and share information with other developers who use Intel® Math Kernel Library.
Announcements
FPGA community forums and blogs on community.intel.com are migrating to the new Altera Community and are read-only. For urgent support needs during this transition, please visit the FPGA Design Resources page or contact an Altera Authorized Distributor.

Speed of DGGHRD and DGGHD3

thijssteel
Beginner
1,100 Views

I noticed something about the routines DGGHRD and DGGHD3. They both reduce a pencil to Hessenberg triangular form for use in the QZ algorithm, with DGGHD3 being the blocked version.

When linked with the reference LAPACK or with OpenBLAS, DGGHD3 is faster than DHHGRD. However, when the test program is linked with MKL, DGGHD3 loses its speed advantage. I also don't see any multicore usage in that routine when linked with MKL. Is it possible that the blocked version is not (fully) implemented in MKL?

Labels (1)
0 Kudos
1 Reply
thijssteel
Beginner
1,090 Views

Update, it looks like the blocksize returned by ilaenv is always 1 for DGGHD3. Changing that seems to fix the performance issue.

0 Kudos
Reply