<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re:Difference in computed result with avx-512 compared to computed with avx2 in Intel® oneAPI Math Kernel Library</title>
    <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Difference-in-computed-result-with-avx-512-compared-to-computed/m-p/1343537#M32428</link>
    <description>&lt;P&gt;This issue is closed and we will no longer respond to this thread.&amp;nbsp;If you require additional assistance from Intel, please start a new thread.&amp;nbsp;Any further interaction in this thread will be considered community only.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;BR /&gt;</description>
    <pubDate>Mon, 13 Dec 2021 03:42:19 GMT</pubDate>
    <dc:creator>Gennady_F_Intel</dc:creator>
    <dc:date>2021-12-13T03:42:19Z</dc:date>
    <item>
      <title>Difference in computed result with avx-512 compared to computed with avx2</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Difference-in-computed-result-with-avx-512-compared-to-computed/m-p/1341247#M32388</link>
      <description>&lt;P&gt;I am working on a computation intensive application which uses Intel mkl library for the heavy lifting.&lt;BR /&gt;We discovered that the unit tests compute slightly different results when run on a processor with AVX-512 extensions compared to when the test suite runs on a machine which only has AVX2, enough difference to trip over the accepted error bound.&lt;BR /&gt;&lt;BR /&gt;On one machine the application loads:&lt;/P&gt;
&lt;P&gt;Intel\mkl_avx512.dll (2019.0.5.1)&lt;BR /&gt;Intel\mkl_vml_avx512.dll (2019.0.5.1)&lt;/P&gt;
&lt;P&gt;On another machine it loads:&lt;BR /&gt;Intel\mkl_avx2.dll (2019.0.5.1)&lt;BR /&gt;Intel\mkl_vml_avx2.dll (2019.0.5.1)&lt;BR /&gt;&lt;BR /&gt;In both cases the OS is Windows 10 64 bit.&lt;BR /&gt;I know that by setting the environment variable&amp;nbsp;MKL_ENABLE_INSTRUCTIONS:&lt;BR /&gt;MKL_ENABLE_INSTRUCTIONS=AVX2&lt;BR /&gt;you can restrict the usage of the instruction extension set to AVX2.&lt;BR /&gt;And this&amp;nbsp; removes the difference in computed results.&lt;BR /&gt;&lt;BR /&gt;But we would like to understand what is causing the difference in computation.&lt;BR /&gt;Are there instructions in AVX-512 which give different values compared to the&amp;nbsp;&lt;BR /&gt;AVX2 counter part ?&lt;BR /&gt;Is fma implemented / used differently in the ... _512.dll's compared to the ... _avx2.dll's for version&amp;nbsp;2019.0.5.1 on&amp;nbsp;Windows 10 64 bit ?&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 03 Dec 2021 13:38:06 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Difference-in-computed-result-with-avx-512-compared-to-computed/m-p/1341247#M32388</guid>
      <dc:creator>Joe_the_dev</dc:creator>
      <dc:date>2021-12-03T13:38:06Z</dc:date>
    </item>
    <item>
      <title>Re:Difference in computed result with avx-512 compared to computed with avx2</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Difference-in-computed-result-with-avx-512-compared-to-computed/m-p/1341441#M32391</link>
      <description>&lt;P&gt;we would recommend checking the mkl developer guide as well as the knowledge base articles follow the links:&lt;/P&gt;&lt;P&gt;&lt;A href="https://www.intel.com/content/www/us/en/develop/documentation/onemkl-linux-developer-guide/top/obtaining-numerically-reproducible-results/get-started-with-conditional-num-reproducibility.html" target="_blank"&gt;https://www.intel.com/content/www/us/en/develop/documentation/onemkl-linux-developer-guide/top/obtaining-numerically-reproducible-results/get-started-with-conditional-num-reproducibility.html&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&lt;A href="https://www.intel.com/content/www/us/en/developer/articles/technical/introduction-to-the-conditional-numerical-reproducibility-cnr.html" target="_blank"&gt;https://www.intel.com/content/www/us/en/developer/articles/technical/introduction-to-the-conditional-numerical-reproducibility-cnr.html&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Call &lt;SPAN style="font-family: monospace;"&gt;mkl_cbwr_set(MKL_CBWR_AVX2) or &lt;/SPAN&gt;Set the environment variable: &lt;SPAN style="font-family: monospace;"&gt;export MKL_CBWR = AVX2&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: monospace;"&gt;will allow you to see the same results from run to run on avx2 and avx-512 based systems when the #of threads would be the same.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: monospace;"&gt;-Gennady&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;BR /&gt;</description>
      <pubDate>Sat, 04 Dec 2021 06:41:52 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Difference-in-computed-result-with-avx-512-compared-to-computed/m-p/1341441#M32391</guid>
      <dc:creator>Gennady_F_Intel</dc:creator>
      <dc:date>2021-12-04T06:41:52Z</dc:date>
    </item>
    <item>
      <title>Re: Difference in computed result with avx-512 compared to computed with avx2</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Difference-in-computed-result-with-avx-512-compared-to-computed/m-p/1341851#M32393</link>
      <description>&lt;P&gt;Thanks, yes I read those and set &lt;SPAN&gt;MKL_CBWR&lt;/SPAN&gt;:&lt;BR /&gt;&lt;SPAN&gt;MKL_CBWR = AVX2,STRICT&lt;BR /&gt;&lt;/SPAN&gt;But that loads the avx2 dll's as well,&amp;nbsp;&lt;BR /&gt;so the same as with:&lt;BR /&gt;&lt;SPAN&gt;MKL_ENABLE_INSTRUCTIONS=AVX2.&lt;/SPAN&gt;&lt;BR /&gt;But is it not possible to get a better matching result, but computed with the avx512 dll's ?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 06 Dec 2021 19:46:07 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Difference-in-computed-result-with-avx-512-compared-to-computed/m-p/1341851#M32393</guid>
      <dc:creator>Joe_the_dev</dc:creator>
      <dc:date>2021-12-06T19:46:07Z</dc:date>
    </item>
    <item>
      <title>Re:Difference in computed result with avx-512 compared to computed with avx2</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Difference-in-computed-result-with-avx-512-compared-to-computed/m-p/1341982#M32396</link>
      <description>&lt;P&gt;&amp;gt;&amp;gt; &lt;SPAN style="font-family: Arial, sans-serif; font-size: 10pt;"&gt;But is it not possible to get a better matching result, but computed with the avx512 dll's ?&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: Arial, sans-serif; font-size: 10pt;"&gt;no, it is not possible.&lt;/SPAN&gt;&lt;/P&gt;&lt;BR /&gt;</description>
      <pubDate>Tue, 07 Dec 2021 04:34:57 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Difference-in-computed-result-with-avx-512-compared-to-computed/m-p/1341982#M32396</guid>
      <dc:creator>Gennady_F_Intel</dc:creator>
      <dc:date>2021-12-07T04:34:57Z</dc:date>
    </item>
    <item>
      <title>Re:Difference in computed result with avx-512 compared to computed with avx2</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Difference-in-computed-result-with-avx-512-compared-to-computed/m-p/1343537#M32428</link>
      <description>&lt;P&gt;This issue is closed and we will no longer respond to this thread.&amp;nbsp;If you require additional assistance from Intel, please start a new thread.&amp;nbsp;Any further interaction in this thread will be considered community only.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;BR /&gt;</description>
      <pubDate>Mon, 13 Dec 2021 03:42:19 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Difference-in-computed-result-with-avx-512-compared-to-computed/m-p/1343537#M32428</guid>
      <dc:creator>Gennady_F_Intel</dc:creator>
      <dc:date>2021-12-13T03:42:19Z</dc:date>
    </item>
  </channel>
</rss>

