Intel® oneAPI Math Kernel Library
Ask questions and share information with other developers who use Intel® Math Kernel Library.
7222 Discussões

cblas_dgemm slows down a lot for Linux on Pentium 4 machine

xearthl
Principiante
1.099 Visualizações
Hi,
I've found in my benchmark that compared with MKL 7.2, in the release 10.1.1.019 for Linux, cblas_dgemm slows down a lot when matrix size is amall and beta is set to 0 on Pentium 4 machine.
Is this a known issue? Has it been fixed?
Thanks a lot!
0 Kudos
5 Respostas
TimP
Colaborador honorário III
1.099 Visualizações
Quoting - xearthl

I've found in my benchmark that compared with MKL 7.2, in the release 10.1.1.019 for Linux, cblas_dgemm slows down a lot when matrix size is amall and beta is set to 0 on Pentium 4 machine.
Is this a known issue? Has it been fixed?

Did MKL 7.2 have a library specifically optimized for P4? It might not be entirely surprising that a recent MKL was not optimized specifically for an out of production CPU. Of course, P4 covered a fairly wide range from the original 32-bit one to the later 64-bit version. "small" might be in the eyes of the beholder; I doubt there was ever an effort to optimize MKL for cases such as 6x6 or less, where MKL never could compete with Fortran MATMUL.
Artem_V_Intel
Funcionário
1.099 Visualizações
Quoting - xearthl
Hi,
I've found in my benchmark that compared with MKL 7.2, in the release 10.1.1.019 for Linux, cblas_dgemm slows down a lot when matrix size is amall and beta is set to 0 on Pentium 4 machine.
Is this a known issue? Has it been fixed?
Thanks a lot!

Hello,

First of all the newest version of MKL 10.2 Update 2 is available now at the Registration Center.

Could you tell please what is the size of your matrix?

Thanks,
Art
xearthl
Principiante
1.099 Visualizações

Hello,

First of all the newest version of MKL 10.2 Update 2 is available now at the Registration Center.

Could you tell please what is the size of your matrix?

Thanks,
Art

Hi,
The execution time doubles when the matrix is 32*32 in my benchmark.
Thanks,
xearthl
Artem_V_Intel
Funcionário
1.099 Visualizações
Hi xearthl,

I compared MKL 7.2 and MKL 10.2 Update 2 and obtained about 15% increase in preformance with MKL 10.2 Update 2.

I performed multiplication of 10000 random matrices of size 32*32 using the code like this:

start_time = dsecnd();

for(i = 0; i < 10000; i++)
{
cblas_dgemm(...);
}

end_time = dsecnd();

printf("Execution time in seconds: %fn", tend - tstart);

Could you please provide an example of code that causes the performance slow down?

Thanks,
Art

xearthl
Principiante
1.099 Visualizações

Hi Art,

Thanks a lot for your great effort!

I haven't try the latest version. But it used to only slow down when beta is set to 0.

Thanks,

xearthl

Responder