<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic MKL on multiple CPU XEON in Intel® oneAPI Math Kernel Library</title>
    <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/MKL-on-multiple-CPU-XEON/m-p/1572440#M35830</link>
    <description>&lt;P&gt;I have issue (total loss of precision) when using MKL calling DGESV() to invert a matrix in Fortran 90 on dual Dual core XEON Gold 6248R with 1TB RAM system and compiling for multiple thread execution. For instance if I link my program using the following libraries&lt;/P&gt;&lt;P&gt;..\intel64_win\mkl_intel_lp64.lib&lt;BR /&gt;..\intel\intel64_win\mkl_core.lib&lt;BR /&gt;..\intel\intel64_win\mkl_tbb_thread_dll.lib&lt;/P&gt;&lt;P&gt;the code would run using all the computer cores and threads (96) and will only produce valid results if the size on the matrix is &amp;lt;7999 elements. At n=8000 the solution would be just noise.&lt;/P&gt;&lt;P&gt;One interesting feature if I use the exact same libraries on a different system with single CPU (ex Intel 13th gen with 64GB Ram) the code would run just fine without any loss of precision for any matrix size.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;If instead I link the code on the Dual core XEON Gold 6248R against following library&lt;/P&gt;&lt;P&gt;..\intel\intel64_win\mkl_rt.lib&lt;/P&gt;&lt;P&gt;the code would run using all the physical cores&amp;nbsp; (48) at about 80% of their speed and will produce valid results regardless the size of the matrix but at lower speed since half of the threads aren't used.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Any possible solutions on how to run my code using all the available 96 threads ?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Source code and bat files use for compilation are attached&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Thu, 15 Feb 2024 01:43:23 GMT</pubDate>
    <dc:creator>FrancoFDF</dc:creator>
    <dc:date>2024-02-15T01:43:23Z</dc:date>
    <item>
      <title>MKL on multiple CPU XEON</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/MKL-on-multiple-CPU-XEON/m-p/1572440#M35830</link>
      <description>&lt;P&gt;I have issue (total loss of precision) when using MKL calling DGESV() to invert a matrix in Fortran 90 on dual Dual core XEON Gold 6248R with 1TB RAM system and compiling for multiple thread execution. For instance if I link my program using the following libraries&lt;/P&gt;&lt;P&gt;..\intel64_win\mkl_intel_lp64.lib&lt;BR /&gt;..\intel\intel64_win\mkl_core.lib&lt;BR /&gt;..\intel\intel64_win\mkl_tbb_thread_dll.lib&lt;/P&gt;&lt;P&gt;the code would run using all the computer cores and threads (96) and will only produce valid results if the size on the matrix is &amp;lt;7999 elements. At n=8000 the solution would be just noise.&lt;/P&gt;&lt;P&gt;One interesting feature if I use the exact same libraries on a different system with single CPU (ex Intel 13th gen with 64GB Ram) the code would run just fine without any loss of precision for any matrix size.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;If instead I link the code on the Dual core XEON Gold 6248R against following library&lt;/P&gt;&lt;P&gt;..\intel\intel64_win\mkl_rt.lib&lt;/P&gt;&lt;P&gt;the code would run using all the physical cores&amp;nbsp; (48) at about 80% of their speed and will produce valid results regardless the size of the matrix but at lower speed since half of the threads aren't used.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Any possible solutions on how to run my code using all the available 96 threads ?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Source code and bat files use for compilation are attached&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 15 Feb 2024 01:43:23 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/MKL-on-multiple-CPU-XEON/m-p/1572440#M35830</guid>
      <dc:creator>FrancoFDF</dc:creator>
      <dc:date>2024-02-15T01:43:23Z</dc:date>
    </item>
    <item>
      <title>Re: MKL on multiple CPU XEON</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/MKL-on-multiple-CPU-XEON/m-p/1572865#M35841</link>
      <description>&lt;P&gt;MKL uses the physical cores by default. You see the developer guide for more details.&amp;nbsp; If you are interested in using hyperthreads also, then you could set MKL_DYNAMIC environment variables.&lt;/P&gt;
&lt;P&gt;Example, the local system, &amp;nbsp;– 40 physycal cores, 80 hyperthreading :&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Default behavior: - 40 threads have been used, see the verbose outpus:&lt;/P&gt;
&lt;P&gt;MKL_VERBOSE=2 ./a.out 2000&lt;/P&gt;
&lt;P&gt;MKL_VERBOSE oneMKL 2024.0 Product build 20231011 for Intel(R) 64 architecture Intel(R) Advanced Vector Extensions 512 (Intel(R) AVX-512) enabled processors, Lnx 2.40GHz lp64 intel_thread&lt;/P&gt;
&lt;P&gt;MKL_VERBOSE DGEMM(N,N,2000,2000,2000,0x7ffd8d611e18,0x14d86a5d4080,2000,0x14d86874f080,2000,0x7ffd8d611e28,0x14d8668ca080,2000) 65.43ms CNR:OFF Dyn:1 FastMM:1 TID:0&amp;nbsp; &lt;STRONG&gt;NThr:40&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;And here when we explicitly say MKL to use HT threads as well:&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;MKL_DYNAMIC=FALSE&lt;/STRONG&gt; MKL_VERBOSE=2 ./a.out 2000&lt;/P&gt;
&lt;P&gt;MKL_VERBOSE oneMKL 2024.0 Product build 20231011 for Intel(R) 64 architecture Intel(R) Advanced Vector Extensions 512 (Intel(R) AVX-512) enabled processors, Lnx 2.40GHz lp64 intel_thread&lt;/P&gt;
&lt;P&gt;MKL_VERBOSE DGEMM(N,N,2000,2000,2000,0x7ffe77399688,0x1526551d4080,2000,0x15265334f080,2000,0x7ffe77399698,0x1526514ca080,2000) 50.29ms CNR:OFF Dyn:0 FastMM:1 TID:0 &amp;nbsp;&lt;STRONG&gt;NThr:80&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;--Gennady&lt;/P&gt;</description>
      <pubDate>Fri, 16 Feb 2024 07:31:15 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/MKL-on-multiple-CPU-XEON/m-p/1572865#M35841</guid>
      <dc:creator>Gennady_F_Intel</dc:creator>
      <dc:date>2024-02-16T07:31:15Z</dc:date>
    </item>
    <item>
      <title>Re: MKL on multiple CPU XEON</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/MKL-on-multiple-CPU-XEON/m-p/1572867#M35842</link>
      <description>&lt;P&gt;and one more - check the current (latest ) version of MKL ( 2024.0 ) if the&amp;nbsp;&lt;SPAN&gt;DGESV() problem still exists when, as you reported, the problem size &amp;gt;8K&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 16 Feb 2024 07:33:53 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/MKL-on-multiple-CPU-XEON/m-p/1572867#M35842</guid>
      <dc:creator>Gennady_F_Intel</dc:creator>
      <dc:date>2024-02-16T07:33:53Z</dc:date>
    </item>
    <item>
      <title>Re: MKL on multiple CPU XEON</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/MKL-on-multiple-CPU-XEON/m-p/1573241#M35848</link>
      <description>&lt;P&gt;Ok thank you for both replies, to follow your latest comment instead of dealing with the old installation I update my compiler by installing the following :&lt;/P&gt;&lt;P&gt;w_Base_Kit_2024.0.1.45&lt;/P&gt;&lt;P&gt;w_fortran-compiler_p_2024.0.2.27&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;integrating with visual studio 2022. Initially the integration in visual studio fail, so following other people comments on the forum I downgraded from visual studio 2022.17.8 back to visual studio 2022.0.x.&lt;/P&gt;&lt;P&gt;After that all the installation went well no errors and no complains, but when I start visual studio I get the following message&lt;/P&gt;&lt;P&gt;The 'oneApiPackage' package did not load correctly (see attached picture) so I am wondering what went wrong ?&lt;/P&gt;&lt;P&gt;After that I will try to see if the 8k bug disappear in the new MKL&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks&lt;/P&gt;&lt;P&gt;Franco&lt;/P&gt;</description>
      <pubDate>Sun, 18 Feb 2024 08:32:09 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/MKL-on-multiple-CPU-XEON/m-p/1573241#M35848</guid>
      <dc:creator>FrancoFDF</dc:creator>
      <dc:date>2024-02-18T08:32:09Z</dc:date>
    </item>
    <item>
      <title>Re:MKL on multiple CPU XEON</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/MKL-on-multiple-CPU-XEON/m-p/1573440#M35852</link>
      <description>&lt;P&gt;please create the new thread to follow with new topic and set the new TAG there . The current topic is resolved.&lt;/P&gt;&lt;P&gt;--Gennady&lt;/P&gt;&lt;BR /&gt;</description>
      <pubDate>Mon, 19 Feb 2024 05:32:32 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/MKL-on-multiple-CPU-XEON/m-p/1573440#M35852</guid>
      <dc:creator>Gennady_F_Intel</dc:creator>
      <dc:date>2024-02-19T05:32:32Z</dc:date>
    </item>
  </channel>
</rss>

