<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Performance of MKL FFTs on Westemere and Sandybridge in Intel® oneAPI Math Kernel Library</title>
    <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Performance-of-MKL-FFTs-on-Westemere-and-Sandybridge/m-p/953880#M15375</link>
    <description>&lt;P&gt;Why is the performance of MKL FFTs on Sandybridge node is 3-5 times faster than Westmere node?&lt;/P&gt;
&lt;P&gt;I could think of these 2 reasons&lt;/P&gt;
&lt;P&gt;1. Sandybridge has more CPU cores than Westemere&lt;/P&gt;
&lt;P&gt;2. May be MKL is smart enough to use the AVX support available on Sandybridge.&lt;/P&gt;
&lt;P&gt;Can anyone eplain why the performance of MKL differs between Sandybridge and Westmere?&lt;/P&gt;</description>
    <pubDate>Mon, 25 Feb 2013 23:26:04 GMT</pubDate>
    <dc:creator>subramanian__vedapra</dc:creator>
    <dc:date>2013-02-25T23:26:04Z</dc:date>
    <item>
      <title>Performance of MKL FFTs on Westemere and Sandybridge</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Performance-of-MKL-FFTs-on-Westemere-and-Sandybridge/m-p/953880#M15375</link>
      <description>&lt;P&gt;Why is the performance of MKL FFTs on Sandybridge node is 3-5 times faster than Westmere node?&lt;/P&gt;
&lt;P&gt;I could think of these 2 reasons&lt;/P&gt;
&lt;P&gt;1. Sandybridge has more CPU cores than Westemere&lt;/P&gt;
&lt;P&gt;2. May be MKL is smart enough to use the AVX support available on Sandybridge.&lt;/P&gt;
&lt;P&gt;Can anyone eplain why the performance of MKL differs between Sandybridge and Westmere?&lt;/P&gt;</description>
      <pubDate>Mon, 25 Feb 2013 23:26:04 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Performance-of-MKL-FFTs-on-Westemere-and-Sandybridge/m-p/953880#M15375</guid>
      <dc:creator>subramanian__vedapra</dc:creator>
      <dc:date>2013-02-25T23:26:04Z</dc:date>
    </item>
    <item>
      <title>&gt;&gt;...Can anyone eplain why</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Performance-of-MKL-FFTs-on-Westemere-and-Sandybridge/m-p/953881#M15376</link>
      <description>&amp;gt;&amp;gt;...Can anyone eplain why the performance of MKL differs between Sandybridge and Westmere?

I'd like to request some additional information. Could you provide exact models of CPUs for both systems? For example, if you're on a Windows platform &lt;STRONG&gt;msinfo32.exe&lt;/STRONG&gt; utility displays that information. In all the rest cases you could use a &lt;STRONG&gt;CPUid&lt;/STRONG&gt; utility.

Thanks in advance.</description>
      <pubDate>Wed, 27 Feb 2013 15:15:34 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Performance-of-MKL-FFTs-on-Westemere-and-Sandybridge/m-p/953881#M15376</guid>
      <dc:creator>SergeyKostrov</dc:creator>
      <dc:date>2013-02-27T15:15:34Z</dc:date>
    </item>
    <item>
      <title>&gt;&gt;&gt;Why is the performance of</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Performance-of-MKL-FFTs-on-Westemere-and-Sandybridge/m-p/953882#M15377</link>
      <description>&lt;P&gt;&amp;gt;&amp;gt;&amp;gt;Why is the performance of MKL FFTs on Sandybridge node is 3-5 times faster than Westmere node&amp;gt;&amp;gt;&amp;gt;&lt;/P&gt;
&lt;P&gt;As you stated in your post it is very likely that improvement is due to newer architecture.The main improvemnt can be wider 256-bit units so you have twice FP throughput.If the uops do not contain interdependencies Port0 and Port1 can both at the same time issue one AVX mul and one AVX add operations.This can be combined at the same time with the two load/store opeartions performaed by the Port2 and Port3.Nehalem/Westmere was able to issue only one load and one store operations on its Ports.&lt;/P&gt;</description>
      <pubDate>Thu, 28 Feb 2013 06:56:45 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Performance-of-MKL-FFTs-on-Westemere-and-Sandybridge/m-p/953882#M15377</guid>
      <dc:creator>Bernard</dc:creator>
      <dc:date>2013-02-28T06:56:45Z</dc:date>
    </item>
    <item>
      <title>&gt;&gt;...2. May be MKL is smart</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Performance-of-MKL-FFTs-on-Westemere-and-Sandybridge/m-p/953883#M15378</link>
      <description>&amp;gt;&amp;gt;...2. May be MKL is smart enough to use the &lt;STRONG&gt;AVX&lt;/STRONG&gt; support available on Sandybridge...

There are lots of details related to performance improvements of &lt;STRONG&gt;MKL&lt;/STRONG&gt; in &lt;STRONG&gt;Release Notes&lt;/STRONG&gt;:

software.intel.com/en-us/articles/intel-mkl-110-release-notes

but, unfortunately, I didn't see any notes related to applications of &lt;STRONG&gt;AVX&lt;/STRONG&gt; for &lt;STRONG&gt;FFT&lt;/STRONG&gt;.</description>
      <pubDate>Fri, 01 Mar 2013 04:41:26 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Performance-of-MKL-FFTs-on-Westemere-and-Sandybridge/m-p/953883#M15378</guid>
      <dc:creator>SergeyKostrov</dc:creator>
      <dc:date>2013-03-01T04:41:26Z</dc:date>
    </item>
    <item>
      <title>Vedaprakash,</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Performance-of-MKL-FFTs-on-Westemere-and-Sandybridge/m-p/953884#M15379</link>
      <description>&lt;P&gt;Vedaprakash,&lt;/P&gt;
&lt;P&gt;Please&amp;nbsp;provide details regarding FFT lengths, type of signal (complex/real), computing environment (cluster/single computer),&amp;nbsp;and the processors that you are comparing (e.g. "Xeon E7-8867L",&amp;nbsp;"Core&amp;nbsp;i7-3820QM", etc.)&lt;/P&gt;
&lt;P&gt;Thanks,&lt;/P&gt;
&lt;P&gt;Evgueni.&lt;/P&gt;</description>
      <pubDate>Fri, 01 Mar 2013 05:32:02 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Performance-of-MKL-FFTs-on-Westemere-and-Sandybridge/m-p/953884#M15379</guid>
      <dc:creator>Evgueni_P_Intel</dc:creator>
      <dc:date>2013-03-01T05:32:02Z</dc:date>
    </item>
  </channel>
</rss>

