Intel® oneAPI Math Kernel Library
Ask questions and share information with other developers who use Intel® Math Kernel Library.

LinX after MKL 11.2.2.010 have error!

User__Core
Beginner
2,036 Views

Hi)

LinX after MKL 11.2.2.010 have error!

w_lpk_p_11.2.2.010.zip
 
c:\test>linpack_xeon64.exe
Input data or print help ? Type [data]/help :
 
Number of equations to solve (problem size): 8135
Leading dimension of array: 8136
Number of trials to run: 10
Data alignment value (in Kbytes): 4
Current date/time: Sun Aug 05 19:59:41 2018
 
CPU frequency:    3.399 GHz
Number of CPUs: 1
Number of cores: 6
Number of threads: 12
 
Parameters are set to:
 
Number of tests: 1
Number of equations to solve (problem size) : 8135
Leading dimension of array                  : 8136
Number of trials to run                     : 10
Data alignment value (in Kbytes)            : 4
 
Maximum memory requested that can be used=1066524512, at the size=8135
 
=================== Timing linear equation system solver ===================
 
Size   LDA    Align. Time(s)    GFlops   Residual     Residual(norm) Check
8135   8136   4    2.411      148.9028 4.503162e-011 2.387924e-002   pass
8135   8136   4    2.378      150.9828 4.503162e-011 2.387924e-002   pass
8135   8136   4    2.262      158.7518 4.503162e-011 2.387924e-002   pass
8135   8136   4    2.329      154.1468 4.503162e-011 2.387924e-002   pass
8135   8136   4    2.496      143.8229 4.503162e-011 2.387924e-002   pass
8135   8136   4    2.258      159.0080 4.503162e-011 2.387924e-002   pass
8135   8136   4    2.358      152.2624 4.503162e-011 2.387924e-002   pass
8135   8136   4    2.323      154.5865 4.503162e-011 2.387924e-002   pass
8135   8136   4    2.427      147.9225 4.503162e-011 2.387924e-002   pass
8135   8136   4    2.346      153.0549 4.503162e-011 2.387924e-002   pass
 
Performance Summary (GFlops)
 
Size   LDA    Align.  Average  Maximal
8135   8136   4     152.3442 159.0080
 
Residual checks PASSED
 
End of tests
  
c:\test>
w_mklb_p_11.1.3.005.zip
 
c:\test>linpack_xeon64.exe
Input data or print help ? Type [data]/help :
 
Number of equations to solve (problem size): 8135
Leading dimension of array: 8136
Number of trials to run: 10
Data alignment value (in Kbytes): 4
Current date/time: Sun Aug 05 19:56:41 2018
 
CPU frequency:    3.399 GHz
Number of CPUs: 1
Number of cores: 6
Number of threads: 6
 
Parameters are set to:
 
Number of tests: 1
Number of equations to solve (problem size) : 8135
Leading dimension of array                  : 8136
Number of trials to run                     : 10
Data alignment value (in Kbytes)            : 4
 
Maximum memory requested that can be used=529719136, at the size=8135
 
=================== Timing linear equation system solver ===================
 
Size   LDA    Align. Time(s)    GFlops   Residual     Residual(norm) Check
8135   8136   4     2.225      161.3400 7.189260e-011 3.812300e-002   pass
8135   8136   4     2.094      171.4791 7.189260e-011 3.812300e-002   pass
8135   8136   4     1.935      185.5586 7.189260e-011 3.812300e-002   pass
8135   8136   4     2.024      177.3962 7.189260e-011 3.812300e-002   pass
8135   8136   4     2.230      160.9874 7.189260e-011 3.812300e-002   pass
8135   8136   4     2.053      174.8977 7.189260e-011 3.812300e-002   pass
8135   8136   4     2.233      160.7589 7.189260e-011 3.812300e-002   pass
8135   8136   4     1.717      209.0689 7.189260e-011 3.812300e-002   pass
8135   8136   4     2.127      168.7899 7.189260e-011 3.812300e-002   pass
8135   8136   4     2.128      168.7352 7.189260e-011 3.812300e-002   pass
 
Performance Summary (GFlops)
 
Size   LDA    Align.  Average  Maximal
8135   8136   4      173.9012 209.0689
 
Residual checks PASSED
 
End of tests
 
c:\test>
w_mklb_p_2018.2.010.zip
 
c:\test>linpack_xeon64.exe
Input data or print help ? Type [data]/help :
 
Number of equations to solve (problem size): 8135
Leading dimension of array: 8136
Number of trials to run: 10
Data alignment value (in Kbytes): 4
Current date/time: Sun Aug 05 20:09:57 2018
 
CPU frequency:    3.339 GHz
Number of CPUs: 1
Number of cores: 6
Number of threads: 6
 
Parameters are set to:
 
Number of tests: 1
Number of equations to solve (problem size) : 8135
Leading dimension of array                  : 8136
Number of trials to run                     : 10
Data alignment value (in Kbytes)            : 4
 
Maximum memory requested that can be used=529719136, at the size=8135
 
=================== Timing linear equation system solver ===================
 
Size   LDA    Align. Time(s)    GFlops   Residual     Residual(norm) Check
8135   8136   4     2.231      160.9364 6.790949e-011 3.601085e-002   pass
8135   8136   4     2.084      172.3152 6.790949e-011 3.601085e-002   pass
8135   8136   4     2.117      169.6332 6.790949e-011 3.601085e-002   pass
8135   8136   4     2.132      168.3802 6.790949e-011 3.601085e-002   pass
8135   8136   4     2.069      173.5236 6.790949e-011 3.601085e-002   pass
8135   8136   4     2.000      179.5242 6.790949e-011 3.601085e-002   pass
8135   8136   4     2.109      170.2193 6.790949e-011 3.601085e-002   pass
8135   8136   4     2.018      177.8865 6.790949e-011 3.601085e-002   pass
8135   8136   4     1.964      182.8301 6.790949e-011 3.601085e-002   pass
8135   8136   4     2.050      175.0993 6.790949e-011 3.601085e-002   pass
 
Performance Summary (GFlops)
 
Size   LDA    Align.  Average  Maximal
8135   8136   4      173.0348 182.8301
 
Residual checks PASSED
 
End of tests
 

If run w_mklb_p_2018.2.010.zip as 12 Thread or as 5, 9,10, 11,12 Threads, in any shell  will be this:

Intel(R) LINPACK 64-bit data - LinX 0.6.5

Current date/time: Sat Aug 18 00:36:31 2018

CPU frequency:    3.398 GHz
Number of CPUs: 1
Number of cores: 6
Number of threads: 12

Parameters are set to:

Number of tests: 1

Number of equations to solve (problem size) : 8135
Leading dimension of array                  : 8136
Number of trials to run                     : 10  
Data alignment value (in Kbytes)            : 4   
Maximum memory requested that can be used=529657696, at the size=8135

=================== Timing linear equation system solver ===================

Size   LDA    Align. Time(s)    GFlops   Residual     Residual(norm) Check
8135   8136   4      1.956      183.5389 6.526518e-011 3.460863e-002   pass
8135   8136   4      2.187      164.2005 6.619277e-011 3.510051e-002   pass
8135   8136   4      2.872      125.0139 6.526518e-011 3.460863e-002   pass
8135   8136   4      2.100      170.9365 5.353940e-011 2.839072e-002   pass
8135   8136   4      2.231      160.9549 7.749093e-011 4.109167e-002   pass
8135   8136   4      2.702      132.8717 6.938300e-011 3.679222e-002   pass
8135   8136   4      2.145      167.3753 6.537959e-011 3.466930e-002   pass
8135   8136   4      2.661      134.9388 6.526518e-011 3.460863e-002   pass
8135   8136   4      2.043      175.7050 6.526518e-011 3.460863e-002   pass
8135   8136   4      2.353      152.5804 6.553075e-011 3.474946e-002   pass

Performance Summary (GFlops)

Size   LDA    Align.  Average  Maximal
8135   8136   4       156.8116 183.5389

Residual checks PASSED

End of tests

Please, fix it!

In first I random , pointed out 524288 as data aligment

0 Kudos
6 Replies
Ying_H_Intel
Employee
2,036 Views

Dear users, 

​do you mean the performance  is dropped  

Or the thread number is not right one?
Number of cores: 6
Number of threads: 12  (MKL may only use the physical cpu threads by default)

​Could you please tell how do you  run w_mklb_p_2018.2.010.zip as 12 Thread or as 5, 9,10, 11,12 Threads?

yhu5@dell-r640:~/mkl2018_issue/lapackf/linpack$ ./xlinpack_xeon64
Input data or print help ? Type [data]/help :
Number of equations to solve (problem size): 8135
Leading dimension of array: 8136
Number of trials to run: 10
Data alignment value (in Kbytes): 4
Current date/time: Thu Aug 30 00:30:11 2018
CPU frequency:    2.992 GHz
Number of CPUs: 2
Number of cores: 16
Number of threads: 16
Parameters are set to:
Number of tests: 1
Number of equations to solve (problem size) : 8135
Leading dimension of array                  : 8136
Number of trials to run                     : 10
Data alignment value (in Kbytes)            : 4
Maximum memory requested that can be used=529657696, at the size=8135
=================== Timing linear equation system solver ===================
Size   LDA    Align. Time(s)    GFlops   Residual     Residual(norm) Check
8135   8136   4      1.806      198.8366 6.680470e-11 3.542500e-02   pass
8135   8136   4      1.599      224.5952 6.680470e-11 3.542500e-02   pass
8135   8136   4      1.649      217.7127 6.680470e-11 3.542500e-02   pass
8135   8136   4      1.660      216.2920 6.680470e-11 3.542500e-02   pass
8135   8136   4      1.569      228.8110 6.680470e-11 3.542500e-02   pass
8135   8136   4      1.562      229.8294 6.680470e-11 3.542500e-02   pass
8135   8136   4      1.641      218.7681 6.680470e-11 3.542500e-02   pass
8135   8136   4      1.564      229.5655 6.680470e-11 3.542500e-02   pass
8135   8136   4      1.548      231.8641 6.680470e-11 3.542500e-02   pass
8135   8136   4      1.574      228.1587 6.680470e-11 3.542500e-02   pass
Performance Summary (GFlops)
Size   LDA    Align.  Average  Maximal
8135   8136   4       222.4433 231.8641
Residual checks PASSED
End of tests


Thanks
​Ying

0 Kudos
User__Core
Beginner
2,036 Views

Oh yes , I'm can tell to you this )
Easy I'm use Shell from LinX 0.6.5
https://yadi.sk/d/6U010N0r3akPaZ
If rewrite files in folder x64 from newest versions version run w_mklb_p_2018.2.010.zip . and run Linx.exe then press settings and set value in Number of threads as 5 , 7 , 9 , 10 ,11 , 12 ,13 , 14 o any other value will be error.

Include new MKLB

0 Kudos
titov__vladimir
Beginner
2,036 Views

It’s a case of non-standard command line arguments usage.

0 Kudos
Ying_H_Intel
Employee
2,036 Views

Hi  Core, 

Do you mean the ./xlinpack_xeon64 don't use the thread you set in LinX environment,  the read color part in the image? 

I run under command line, it runs ok.   and about the threading, the main reason may be that : 

The Intel Optimized LINPACK Benchmark determines the optimal number of OpenMP threads to use. To run a different number, you can set the OMP_NUM_THREADS or MKL_NUM_THREADS environment variable inside a sample script. If you run the Intel Optimized LINPACK Benchmark without setting the number of threads, it defaults to the number of physical cores.

Best Regards,
Ying 

(Virus scan in progress ...)
User__Core
Beginner
2,036 Views

Hovewer no.
I affirm what LinX after MKL 11.1.3.305 having errors. This errors be if use any quantity of flow which different from standard quantity of cores
This not fixed and now , this easy have.
So , LinX  not for correct tests , because have not fixed errors.

0 Kudos
Gennady_F_Intel
Moderator
2,036 Views

I don't think this is a problem with mkl but if this case affects your application, please submit the case into Intel Online Service Center and describe all technical details and business needs around this question.

0 Kudos
Reply