Intel® oneAPI Math Kernel Library
Ask questions and share information with other developers who use Intel® Math Kernel Library.
6938 Discussions

LinX after MKL 11.2.2.010 have error!

User__Core
Beginner
1,319 Views

Hi)

LinX after MKL 11.2.2.010 have error!

w_lpk_p_11.2.2.010.zip
 
c:\test>linpack_xeon64.exe
Input data or print help ? Type [data]/help :
 
Number of equations to solve (problem size): 8135
Leading dimension of array: 8136
Number of trials to run: 10
Data alignment value (in Kbytes): 4
Current date/time: Sun Aug 05 19:59:41 2018
 
CPU frequency:    3.399 GHz
Number of CPUs: 1
Number of cores: 6
Number of threads: 12
 
Parameters are set to:
 
Number of tests: 1
Number of equations to solve (problem size) : 8135
Leading dimension of array                  : 8136
Number of trials to run                     : 10
Data alignment value (in Kbytes)            : 4
 
Maximum memory requested that can be used=1066524512, at the size=8135
 
=================== Timing linear equation system solver ===================
 
Size   LDA    Align. Time(s)    GFlops   Residual     Residual(norm) Check
8135   8136   4    2.411      148.9028 4.503162e-011 2.387924e-002   pass
8135   8136   4    2.378      150.9828 4.503162e-011 2.387924e-002   pass
8135   8136   4    2.262      158.7518 4.503162e-011 2.387924e-002   pass
8135   8136   4    2.329      154.1468 4.503162e-011 2.387924e-002   pass
8135   8136   4    2.496      143.8229 4.503162e-011 2.387924e-002   pass
8135   8136   4    2.258      159.0080 4.503162e-011 2.387924e-002   pass
8135   8136   4    2.358      152.2624 4.503162e-011 2.387924e-002   pass
8135   8136   4    2.323      154.5865 4.503162e-011 2.387924e-002   pass
8135   8136   4    2.427      147.9225 4.503162e-011 2.387924e-002   pass
8135   8136   4    2.346      153.0549 4.503162e-011 2.387924e-002   pass
 
Performance Summary (GFlops)
 
Size   LDA    Align.  Average  Maximal
8135   8136   4     152.3442 159.0080
 
Residual checks PASSED
 
End of tests
  
c:\test>
w_mklb_p_11.1.3.005.zip
 
c:\test>linpack_xeon64.exe
Input data or print help ? Type [data]/help :
 
Number of equations to solve (problem size): 8135
Leading dimension of array: 8136
Number of trials to run: 10
Data alignment value (in Kbytes): 4
Current date/time: Sun Aug 05 19:56:41 2018
 
CPU frequency:    3.399 GHz
Number of CPUs: 1
Number of cores: 6
Number of threads: 6
 
Parameters are set to:
 
Number of tests: 1
Number of equations to solve (problem size) : 8135
Leading dimension of array                  : 8136
Number of trials to run                     : 10
Data alignment value (in Kbytes)            : 4
 
Maximum memory requested that can be used=529719136, at the size=8135
 
=================== Timing linear equation system solver ===================
 
Size   LDA    Align. Time(s)    GFlops   Residual     Residual(norm) Check
8135   8136   4     2.225      161.3400 7.189260e-011 3.812300e-002   pass
8135   8136   4     2.094      171.4791 7.189260e-011 3.812300e-002   pass
8135   8136   4     1.935      185.5586 7.189260e-011 3.812300e-002   pass
8135   8136   4     2.024      177.3962 7.189260e-011 3.812300e-002   pass
8135   8136   4     2.230      160.9874 7.189260e-011 3.812300e-002   pass
8135   8136   4     2.053      174.8977 7.189260e-011 3.812300e-002   pass
8135   8136   4     2.233      160.7589 7.189260e-011 3.812300e-002   pass
8135   8136   4     1.717      209.0689 7.189260e-011 3.812300e-002   pass
8135   8136   4     2.127      168.7899 7.189260e-011 3.812300e-002   pass
8135   8136   4     2.128      168.7352 7.189260e-011 3.812300e-002   pass
 
Performance Summary (GFlops)
 
Size   LDA    Align.  Average  Maximal
8135   8136   4      173.9012 209.0689
 
Residual checks PASSED
 
End of tests
 
c:\test>
w_mklb_p_2018.2.010.zip
 
c:\test>linpack_xeon64.exe
Input data or print help ? Type [data]/help :
 
Number of equations to solve (problem size): 8135
Leading dimension of array: 8136
Number of trials to run: 10
Data alignment value (in Kbytes): 4
Current date/time: Sun Aug 05 20:09:57 2018
 
CPU frequency:    3.339 GHz
Number of CPUs: 1
Number of cores: 6
Number of threads: 6
 
Parameters are set to:
 
Number of tests: 1
Number of equations to solve (problem size) : 8135
Leading dimension of array                  : 8136
Number of trials to run                     : 10
Data alignment value (in Kbytes)            : 4
 
Maximum memory requested that can be used=529719136, at the size=8135
 
=================== Timing linear equation system solver ===================
 
Size   LDA    Align. Time(s)    GFlops   Residual     Residual(norm) Check
8135   8136   4     2.231      160.9364 6.790949e-011 3.601085e-002   pass
8135   8136   4     2.084      172.3152 6.790949e-011 3.601085e-002   pass
8135   8136   4     2.117      169.6332 6.790949e-011 3.601085e-002   pass
8135   8136   4     2.132      168.3802 6.790949e-011 3.601085e-002   pass
8135   8136   4     2.069      173.5236 6.790949e-011 3.601085e-002   pass
8135   8136   4     2.000      179.5242 6.790949e-011 3.601085e-002   pass
8135   8136   4     2.109      170.2193 6.790949e-011 3.601085e-002   pass
8135   8136   4     2.018      177.8865 6.790949e-011 3.601085e-002   pass
8135   8136   4     1.964      182.8301 6.790949e-011 3.601085e-002   pass
8135   8136   4     2.050      175.0993 6.790949e-011 3.601085e-002   pass
 
Performance Summary (GFlops)
 
Size   LDA    Align.  Average  Maximal
8135   8136   4      173.0348 182.8301
 
Residual checks PASSED
 
End of tests
 

If run w_mklb_p_2018.2.010.zip as 12 Thread or as 5, 9,10, 11,12 Threads, in any shell  will be this:

Intel(R) LINPACK 64-bit data - LinX 0.6.5

Current date/time: Sat Aug 18 00:36:31 2018

CPU frequency:    3.398 GHz
Number of CPUs: 1
Number of cores: 6
Number of threads: 12

Parameters are set to:

Number of tests: 1

Number of equations to solve (problem size) : 8135
Leading dimension of array                  : 8136
Number of trials to run                     : 10  
Data alignment value (in Kbytes)            : 4   
Maximum memory requested that can be used=529657696, at the size=8135

=================== Timing linear equation system solver ===================

Size   LDA    Align. Time(s)    GFlops   Residual     Residual(norm) Check
8135   8136   4      1.956      183.5389 6.526518e-011 3.460863e-002   pass
8135   8136   4      2.187      164.2005 6.619277e-011 3.510051e-002   pass
8135   8136   4      2.872      125.0139 6.526518e-011 3.460863e-002   pass
8135   8136   4      2.100      170.9365 5.353940e-011 2.839072e-002   pass
8135   8136   4      2.231      160.9549 7.749093e-011 4.109167e-002   pass
8135   8136   4      2.702      132.8717 6.938300e-011 3.679222e-002   pass
8135   8136   4      2.145      167.3753 6.537959e-011 3.466930e-002   pass
8135   8136   4      2.661      134.9388 6.526518e-011 3.460863e-002   pass
8135   8136   4      2.043      175.7050 6.526518e-011 3.460863e-002   pass
8135   8136   4      2.353      152.5804 6.553075e-011 3.474946e-002   pass

Performance Summary (GFlops)

Size   LDA    Align.  Average  Maximal
8135   8136   4       156.8116 183.5389

Residual checks PASSED

End of tests

Please, fix it!

In first I random , pointed out 524288 as data aligment

0 Kudos
6 Replies
Ying_H_Intel
Employee
1,319 Views

Dear users, 

​do you mean the performance  is dropped  

Or the thread number is not right one?
Number of cores: 6
Number of threads: 12  (MKL may only use the physical cpu threads by default)

​Could you please tell how do you  run w_mklb_p_2018.2.010.zip as 12 Thread or as 5, 9,10, 11,12 Threads?

yhu5@dell-r640:~/mkl2018_issue/lapackf/linpack$ ./xlinpack_xeon64
Input data or print help ? Type [data]/help :
Number of equations to solve (problem size): 8135
Leading dimension of array: 8136
Number of trials to run: 10
Data alignment value (in Kbytes): 4
Current date/time: Thu Aug 30 00:30:11 2018
CPU frequency:    2.992 GHz
Number of CPUs: 2
Number of cores: 16
Number of threads: 16
Parameters are set to:
Number of tests: 1
Number of equations to solve (problem size) : 8135
Leading dimension of array                  : 8136
Number of trials to run                     : 10
Data alignment value (in Kbytes)            : 4
Maximum memory requested that can be used=529657696, at the size=8135
=================== Timing linear equation system solver ===================
Size   LDA    Align. Time(s)    GFlops   Residual     Residual(norm) Check
8135   8136   4      1.806      198.8366 6.680470e-11 3.542500e-02   pass
8135   8136   4      1.599      224.5952 6.680470e-11 3.542500e-02   pass
8135   8136   4      1.649      217.7127 6.680470e-11 3.542500e-02   pass
8135   8136   4      1.660      216.2920 6.680470e-11 3.542500e-02   pass
8135   8136   4      1.569      228.8110 6.680470e-11 3.542500e-02   pass
8135   8136   4      1.562      229.8294 6.680470e-11 3.542500e-02   pass
8135   8136   4      1.641      218.7681 6.680470e-11 3.542500e-02   pass
8135   8136   4      1.564      229.5655 6.680470e-11 3.542500e-02   pass
8135   8136   4      1.548      231.8641 6.680470e-11 3.542500e-02   pass
8135   8136   4      1.574      228.1587 6.680470e-11 3.542500e-02   pass
Performance Summary (GFlops)
Size   LDA    Align.  Average  Maximal
8135   8136   4       222.4433 231.8641
Residual checks PASSED
End of tests


Thanks
​Ying

0 Kudos
User__Core
Beginner
1,319 Views

Oh yes , I'm can tell to you this )
Easy I'm use Shell from LinX 0.6.5
https://yadi.sk/d/6U010N0r3akPaZ
If rewrite files in folder x64 from newest versions version run w_mklb_p_2018.2.010.zip . and run Linx.exe then press settings and set value in Number of threads as 5 , 7 , 9 , 10 ,11 , 12 ,13 , 14 o any other value will be error.

Include new MKLB

0 Kudos
titov__vladimir
Beginner
1,319 Views

It’s a case of non-standard command line arguments usage.

0 Kudos
Ying_H_Intel
Employee
1,319 Views

Hi  Core, 

Do you mean the ./xlinpack_xeon64 don't use the thread you set in LinX environment,  the read color part in the image? 

I run under command line, it runs ok.   and about the threading, the main reason may be that : 

The Intel Optimized LINPACK Benchmark determines the optimal number of OpenMP threads to use. To run a different number, you can set the OMP_NUM_THREADS or MKL_NUM_THREADS environment variable inside a sample script. If you run the Intel Optimized LINPACK Benchmark without setting the number of threads, it defaults to the number of physical cores.

Best Regards,
Ying 

User__Core
Beginner
1,319 Views

Hovewer no.
I affirm what LinX after MKL 11.1.3.305 having errors. This errors be if use any quantity of flow which different from standard quantity of cores
This not fixed and now , this easy have.
So , LinX  not for correct tests , because have not fixed errors.

0 Kudos
Gennady_F_Intel
Moderator
1,319 Views

I don't think this is a problem with mkl but if this case affects your application, please submit the case into Intel Online Service Center and describe all technical details and business needs around this question.

0 Kudos
Reply