<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Is DFTI_NUMBER_OF_TRANSFORMS data-parallel? in Intel® oneAPI Math Kernel Library</title>
    <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Is-DFTI-NUMBER-OF-TRANSFORMS-data-parallel/m-p/1174397#M28825</link>
    <description>&lt;P&gt;If I set DFTI_NUMBER_OF_TRANSFORMS to 4 on a AVX computer, or 8 on a AVX-512 KNL, will MKL's &lt;SPAN class="kwd"&gt;DftiComputeForward/Backward compute the FFT's of similar but independant, non-overlapping arrays simultaneously in SIMD or sequentially one after the other?&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;Thanks&lt;/P&gt;</description>
    <pubDate>Mon, 28 Aug 2017 18:44:09 GMT</pubDate>
    <dc:creator>M_A_1</dc:creator>
    <dc:date>2017-08-28T18:44:09Z</dc:date>
    <item>
      <title>Is DFTI_NUMBER_OF_TRANSFORMS data-parallel?</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Is-DFTI-NUMBER-OF-TRANSFORMS-data-parallel/m-p/1174397#M28825</link>
      <description>&lt;P&gt;If I set DFTI_NUMBER_OF_TRANSFORMS to 4 on a AVX computer, or 8 on a AVX-512 KNL, will MKL's &lt;SPAN class="kwd"&gt;DftiComputeForward/Backward compute the FFT's of similar but independant, non-overlapping arrays simultaneously in SIMD or sequentially one after the other?&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;Thanks&lt;/P&gt;</description>
      <pubDate>Mon, 28 Aug 2017 18:44:09 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Is-DFTI-NUMBER-OF-TRANSFORMS-data-parallel/m-p/1174397#M28825</guid>
      <dc:creator>M_A_1</dc:creator>
      <dc:date>2017-08-28T18:44:09Z</dc:date>
    </item>
    <item>
      <title>Hi,</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Is-DFTI-NUMBER-OF-TRANSFORMS-data-parallel/m-p/1174398#M28826</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;

&lt;P&gt;There's no direct relationship between value of&amp;nbsp;&lt;SPAN style="font-size: 12px;"&gt;DFTI_NUMBER_OF_TRANSFORMS and SIMD. The&amp;nbsp;DFTI_NUMBER_OF_TRANSFORMS is actually for performing a number of FFT transforms with a single call. It is similar to writing a for loop to perform FFT backward/forward N times.&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;MKL FFT supports configuration setting variables to control parallel processing. You could use&amp;nbsp;DFTI_THREAD_LIMIT to set parallel or sequential for each transform of single call methods (DFTI_NUMBER_OF_TRANSFORMS&amp;gt;1) when MKL is parallel mode.&lt;/P&gt;

&lt;P&gt;By default, the FFT processing is parallel for large size, but sequential for small transform. If you are using a bunch of small transforms, each FFT transform would be sequential. But if you are using a bunch of large transform and DFTI_THREAD_LIMIT!=1, each transform would be parallel.&amp;nbsp;&lt;/P&gt;

&lt;P&gt;Best regards,&lt;BR /&gt;
	Fiona&lt;/P&gt;</description>
      <pubDate>Wed, 30 Aug 2017 06:33:31 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Is-DFTI-NUMBER-OF-TRANSFORMS-data-parallel/m-p/1174398#M28826</guid>
      <dc:creator>Zhen_Z_Intel</dc:creator>
      <dc:date>2017-08-30T06:33:31Z</dc:date>
    </item>
    <item>
      <title>Thanks for the explanaition!</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Is-DFTI-NUMBER-OF-TRANSFORMS-data-parallel/m-p/1174399#M28827</link>
      <description>&lt;P&gt;Thanks for the explanaition! I'm wondering if, for a number of small single precision FT's e.g. 24x24, it would be more efficient or less to run them in parallel in SIMD (in each thread), in particular with AVX-512. Has this been investigated by Intel?&lt;/P&gt;

&lt;P&gt;Thanks&lt;BR /&gt;
	&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 30 Aug 2017 16:15:00 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Is-DFTI-NUMBER-OF-TRANSFORMS-data-parallel/m-p/1174399#M28827</guid>
      <dc:creator>M_A_1</dc:creator>
      <dc:date>2017-08-30T16:15:00Z</dc:date>
    </item>
    <item>
      <title>Hi,</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Is-DFTI-NUMBER-OF-TRANSFORMS-data-parallel/m-p/1174400#M28828</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;

&lt;P&gt;If you are using a bunch of small transforms, where function call overhead comprises a noticeable part of the transform time, doing the bunch within a single call by DFTI_NUMBER_OF_TRANSFORMS&amp;nbsp;probably would be more efficient. Thanks.&lt;/P&gt;

&lt;P&gt;Best regards,&lt;BR /&gt;
	Fiona&lt;/P&gt;</description>
      <pubDate>Tue, 05 Sep 2017 02:15:41 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Is-DFTI-NUMBER-OF-TRANSFORMS-data-parallel/m-p/1174400#M28828</guid>
      <dc:creator>Zhen_Z_Intel</dc:creator>
      <dc:date>2017-09-05T02:15:41Z</dc:date>
    </item>
  </channel>
</rss>

