<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic in addition to this routine, in Intel® oneAPI Math Kernel Library</title>
    <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/lapack-function-cpbtrs-slower-in-mkl-18-0-vs-14-0/m-p/1160987#M27915</link>
    <description>&lt;P&gt;in addition to this routine, some performance degradation of MKL PARDISO has been fixed in MKL 2019.&lt;/P&gt;</description>
    <pubDate>Tue, 20 Nov 2018 11:51:30 GMT</pubDate>
    <dc:creator>Gennady_F_Intel</dc:creator>
    <dc:date>2018-11-20T11:51:30Z</dc:date>
    <item>
      <title>lapack function cpbtrs slower in mkl 18.0 vs 14.0</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/lapack-function-cpbtrs-slower-in-mkl-18-0-vs-14-0/m-p/1160980#M27908</link>
      <description>&lt;P&gt;&amp;nbsp;&lt;/P&gt;

&lt;P&gt;Hi,&lt;/P&gt;

&lt;P&gt;I am experiencing a slowdown of cpbtrs&amp;nbsp;function in mkl&amp;nbsp;18.0 comparing with mkl 14.0.&amp;nbsp;&lt;SPAN style="font-size: 1em;"&gt;My &lt;/SPAN&gt;&lt;SPAN style="font-size: 1em;"&gt;syste&lt;/SPAN&gt;&lt;SPAN style="font-size: 1em;"&gt;m is a Xeon E3-1240 v3&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;The following (single-threaded) code seems to run more than 2x slower with 18.0:&lt;/P&gt;

&lt;PRE class="brush:fortran; class-name:dark;"&gt;         niter = 100000
         n   = 60
         nbd = 11
         ldb = 181

         allocate(a(nbd*n,niter))
         allocate(b(ldb,niter))

         a = cmplx(0.1,0.1)
         b = cmplx(0.5,0.5)

         do iter = 1 ,niter
            call CPBTRS('U', n,nbd -1, 1, a(:,iter), nbd, b(:,iter),ldb, status)
         enddo&lt;/PRE&gt;

&lt;P&gt;The linking command that I used with ifort&amp;nbsp;18.0:&lt;/P&gt;

&lt;PRE class="brush:; class-name:dark;"&gt;$INTEL_HOME/ifort -I$MKL_HOME/include/ cpbtrs.f90 
-Wl,--start-group -Wl,-Bstatic -L$MKL_HOME_LIB/lib 
-lmkl_core -lmkl_intel_lp64 -lmkl_intel_thread -lmkl_lapack95_lp64 -liomp5 -Wl,--end-group&lt;/PRE&gt;

&lt;P&gt;&amp;nbsp; &amp;nbsp;&amp;nbsp;&lt;/P&gt;

&lt;P&gt;&lt;SPAN style="font-size: 1em;"&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 24 Jul 2018 23:45:33 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/lapack-function-cpbtrs-slower-in-mkl-18-0-vs-14-0/m-p/1160980#M27908</guid>
      <dc:creator>gn164</dc:creator>
      <dc:date>2018-07-24T23:45:33Z</dc:date>
    </item>
    <item>
      <title>You link with threaded</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/lapack-function-cpbtrs-slower-in-mkl-18-0-vs-14-0/m-p/1160981#M27909</link>
      <description>&lt;P&gt;You link with threaded version of MKL 2018. Do you set MKL_NUM_THREADS=1 to run this test into single-threaded mode?&lt;/P&gt;

&lt;P&gt;Is that lin or Windows?&amp;nbsp; wrt mkl 14 - there is no such verions - could you loot in mkl.h and give us exact version of mkl&lt;/P&gt;

&lt;P&gt;smth like this:&lt;/P&gt;

&lt;DIV&gt;&lt;SPAN style="font-size: 13.008px;"&gt;#define __INTEL_MKL__ 11&lt;/SPAN&gt;&lt;/DIV&gt;

&lt;DIV&gt;&lt;SPAN style="font-size: 13.008px;"&gt;#define __INTEL_MKL_MINOR__ 0&lt;/SPAN&gt;&lt;/DIV&gt;

&lt;DIV&gt;&lt;SPAN style="font-size: 13.008px;"&gt;#define __INTEL_MKL_UPDATE__ 2&lt;/SPAN&gt;&lt;/DIV&gt;</description>
      <pubDate>Wed, 25 Jul 2018 02:24:42 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/lapack-function-cpbtrs-slower-in-mkl-18-0-vs-14-0/m-p/1160981#M27909</guid>
      <dc:creator>Gennady_F_Intel</dc:creator>
      <dc:date>2018-07-25T02:24:42Z</dc:date>
    </item>
    <item>
      <title> </title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/lapack-function-cpbtrs-slower-in-mkl-18-0-vs-14-0/m-p/1160982#M27910</link>
      <description>&lt;P&gt;&amp;nbsp;&lt;/P&gt;

&lt;P&gt;Greetings Gennady,&lt;/P&gt;

&lt;P&gt;&lt;SPAN style="font-size: 1em;"&gt;This is &lt;/SPAN&gt;linux&lt;SPAN style="font-size: 1em;"&gt;, I have set the MKL_NUM_THREADS to 1 but I see no difference in the timings.&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;The versions I am comparing are:&lt;/P&gt;

&lt;P&gt;#define __INTEL_MKL__ 11&lt;/P&gt;

&lt;P&gt;#define __INTEL_MKL_MINOR__ 1&lt;/P&gt;

&lt;P&gt;&lt;SPAN style="font-size: 1em;"&gt;#define __INTEL_MKL_UPDATE__ 1&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;

&lt;P&gt;&lt;SPAN style="font-size: 1em;"&gt;#define __INTEL_MKL__ 2018&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;#define __INTEL_MKL_MINOR__ 0&lt;/P&gt;

&lt;P&gt;#define __INTEL_MKL_UPDATE__ 2&lt;/P&gt;

&lt;P&gt;If that helps, the profiling of the test program linked with those is:&lt;/P&gt;

&lt;P&gt;&lt;STRONG&gt;&amp;nbsp;mkl 11.1.1&lt;/STRONG&gt;&lt;/P&gt;

&lt;PRE class="brush:; class-name:dark;"&gt;time   seconds   seconds    calls  ms/call  ms/call  name

40.28      0.29     0.29        1   290.00   290.00  MAIN__
26.39      0.48     0.19                             mkl_blas_avx_ctbsv_vial1
16.67      0.60     0.12                             mkl_blas_avx_xcdotc
12.50      0.69     0.09                             mkl_blas_avx_xcaxpy_a
 1.39      0.70     0.01                             mkl_blas_ctbsv
 1.39      0.71     0.01                             mkl_lapack_cpbtrs
 1.39      0.72     0.01                             mkl_lapack_ilaenv

&lt;/PRE&gt;

&lt;P&gt;&lt;STRONG&gt;&amp;nbsp;mkl 2018&lt;/STRONG&gt;&lt;/P&gt;

&lt;PRE class="brush:; class-name:dark;"&gt;14.29      0.28     0.28        1   280.00   280.00  MAIN__
13.27      0.54     0.26                             mkl_blas_avx_cgemm_pst
  8.67      0.71     0.17                             mkl_lapack_xcpbtrs
  8.16      0.87     0.16                             mkl_blas_avx_ctrmv_in
  6.63      1.00     0.13                             mkl_blas_avx_ctrsv_ucn
  5.61      1.11     0.11                             mkl_blas_avx_ctrsv_unn
  5.10      1.21     0.10                             mkl_blas_avx_xcaxpy
  5.10      1.31     0.10                             mkl_lapack_ilaenv
  4.85      1.41     0.10                             mkl_blas_avx_ctrsv
  4.08      1.49     0.08                             mkl_blas_avx_xscopy
  3.32      1.55     0.07                             mkl_blas_avx_xctrmv
  2.55      1.60     0.05                             mkl_blas_ctrsv
  2.04      1.64     0.04                             mkl_blas_cgemm
  2.04      1.68     0.04                             mkl_blas_cgemm_omp_driver_v1
  1.53      1.71     0.03                             mkl_blas_xctrmv
  1.28      1.74     0.03                             mkl_blas_avx_xccopy
  1.28      1.76     0.03                             mkl_blas_xcgemm
  1.02      1.78     0.02                             LY16_A16_j2_i8gas_1
  1.02      1.80     0.02                             mkl_blas_avx_xcgemm
  1.02      1.82     0.02                             mkl_blas_cgemm_host
  1.02      1.84     0.02                             mkl_serv_cbwr_get
  0.51      1.85     0.01                             LY16_A16_j2gas_1
  0.51      1.86     0.01                             Lend_Y16_A16_j2gas_1
  0.51      1.87     0.01                             mkl_blas_avx_cgemm_get_optimal_kernel
  0.51      1.88     0.01                             mkl_blas_avx_cgemm_zero_desc
  0.51      1.89     0.01                             mkl_blas_avx_cgemv_n_even
  0.51      1.90     0.01                             mkl_blas_avx_xcgemv
  0.51      1.91     0.01                             mkl_blas_cgemv
  0.51      1.92     0.01                             mkl_blas_ctrmv
  0.51      1.93     0.01                             mkl_blas_xcgemv
  0.51      1.94     0.01                             mkl_lapack_cpbtrs
  0.26      1.95     0.01                             mkl_blas_avx_xctrmv_in_thread
  0.26      1.95     0.01                             mkl_blas_get_kernel_api_version
  0.26      1.96     0.01                             mkl_serv_get_num_stripes
  0.26      1.96     0.01                             mkl_serv_omp_in_parallel&lt;/PRE&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 25 Jul 2018 21:49:27 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/lapack-function-cpbtrs-slower-in-mkl-18-0-vs-14-0/m-p/1160982#M27910</guid>
      <dc:creator>gn164</dc:creator>
      <dc:date>2018-07-25T21:49:27Z</dc:date>
    </item>
    <item>
      <title> </title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/lapack-function-cpbtrs-slower-in-mkl-18-0-vs-14-0/m-p/1160983#M27911</link>
      <description>&lt;P&gt;&amp;nbsp;&lt;/P&gt;

&lt;P&gt;Hi,&lt;/P&gt;

&lt;P&gt;A followup to this, similar slowdown can be observed in the cpbtrf function in mkl 18.0&lt;/P&gt;</description>
      <pubDate>Thu, 02 Aug 2018 13:05:54 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/lapack-function-cpbtrs-slower-in-mkl-18-0-vs-14-0/m-p/1160983#M27911</guid>
      <dc:creator>gn164</dc:creator>
      <dc:date>2018-08-02T13:05:54Z</dc:date>
    </item>
    <item>
      <title>we confirmed this issue the</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/lapack-function-cpbtrs-slower-in-mkl-18-0-vs-14-0/m-p/1160984#M27912</link>
      <description>&lt;P&gt;we confirmed this issue the issue is escalated. The thread would be updated asap.&lt;/P&gt;</description>
      <pubDate>Fri, 03 Aug 2018 03:42:40 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/lapack-function-cpbtrs-slower-in-mkl-18-0-vs-14-0/m-p/1160984#M27912</guid>
      <dc:creator>Gennady_F_Intel</dc:creator>
      <dc:date>2018-08-03T03:42:40Z</dc:date>
    </item>
    <item>
      <title>the fix of the problem</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/lapack-function-cpbtrs-slower-in-mkl-18-0-vs-14-0/m-p/1160985#M27913</link>
      <description>&lt;P&gt;the fix of the problem available into latest MKL 2019 update 1 which is released recently. Could you please take and try this update and let us know how this works on your side.&lt;/P&gt;</description>
      <pubDate>Mon, 19 Nov 2018 05:39:17 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/lapack-function-cpbtrs-slower-in-mkl-18-0-vs-14-0/m-p/1160985#M27913</guid>
      <dc:creator>Gennady_F_Intel</dc:creator>
      <dc:date>2018-11-19T05:39:17Z</dc:date>
    </item>
    <item>
      <title>Greetings Gennady,</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/lapack-function-cpbtrs-slower-in-mkl-18-0-vs-14-0/m-p/1160986#M27914</link>
      <description>&lt;P&gt;Greetings Gennady,&lt;/P&gt;&lt;P&gt;Thank you for the fix.&lt;/P&gt;&lt;P&gt;Do you know if there any other mkl (lapack or non-lapack) functions that are slower in mkl 18.0 and could be affected by the fix made in MKL 2019 update 1.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 19 Nov 2018 12:05:09 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/lapack-function-cpbtrs-slower-in-mkl-18-0-vs-14-0/m-p/1160986#M27914</guid>
      <dc:creator>gn164</dc:creator>
      <dc:date>2018-11-19T12:05:09Z</dc:date>
    </item>
    <item>
      <title>in addition to this routine,</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/lapack-function-cpbtrs-slower-in-mkl-18-0-vs-14-0/m-p/1160987#M27915</link>
      <description>&lt;P&gt;in addition to this routine, some performance degradation of MKL PARDISO has been fixed in MKL 2019.&lt;/P&gt;</description>
      <pubDate>Tue, 20 Nov 2018 11:51:30 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/lapack-function-cpbtrs-slower-in-mkl-18-0-vs-14-0/m-p/1160987#M27915</guid>
      <dc:creator>Gennady_F_Intel</dc:creator>
      <dc:date>2018-11-20T11:51:30Z</dc:date>
    </item>
  </channel>
</rss>

