Intel® oneAPI Math Kernel Library
Ask questions and share information with other developers who use Intel® Math Kernel Library.
Announcements
FPGA community forums and blogs on community.intel.com are migrating to the new Altera Community and are read-only. For urgent support needs during this transition, please visit the FPGA Design Resources page or contact an Altera Authorized Distributor.
7234 Discussions

Performance of MKL BLAS routines vs self compiled BLAS

Morag_A_Intel
Employee
878 Views

Hi

I am using BLAS with my software, especially various GEMM & GEMV routines.

I have used Intel vTune to profile my software, and found out that using my own BLAS library (compiled with Intel Fortran Compiler) I get better performance (run-time) than using Intel MKL by 5-10%.

Does it make sense? Is it possible that taking BLAS sources from www.netlib.org/blas/ and compiling them myself will result in better optimized library than Intel MKL?

 

Regards,

Morag Agmon (Intel)

0 Kudos
1 Reply
Gennady_F_Intel
Moderator
878 Views

Morag,

that's not expected from our side. Where do you see 5-10% of MKL's performance gap?  is that ?gemm routine? what is the problem size?

why do you use VTune ( did you use hotspot analys?) instead of directly measure execution time of these routines?  What is CPU type you are running on? 

 

0 Kudos
Reply