<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Using DFTI with larger numbers of processors in Intel® oneAPI Math Kernel Library</title>
    <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Using-DFTI-with-larger-numbers-of-processors/m-p/881609#M9621</link>
    <description>&lt;DIV style="margin:0px;"&gt;
&lt;DIV id="quote_reply" style="width: 100%; margin-top: 5px;"&gt;
&lt;DIV style="margin-left:2px;margin-right:2px;"&gt;Quoting - &lt;A href="https://community.intel.com/en-us/profile/93647"&gt;Dmitry Baksheev (Intel)&lt;/A&gt;&lt;/DIV&gt;
&lt;DIV style="background-color:#E5E5E5; padding:5px;border: 1px; border-style: inset;margin-left:2px;margin-right:2px;"&gt;&lt;EM&gt; &lt;BR /&gt;Davis,&lt;BR /&gt;&lt;BR /&gt;If you don't link with libiomp5 then perhaps setting KMP_STACKSIZE has no effect.&lt;BR /&gt;&lt;BR /&gt;The version of MKL that you use has two memory leak problems in DFTI that are fixed in later releases. The problems may hypothetically cause DftiCommitDescriptor to produce DFTI_MKL_INTERNAL_ERROR in a long run or in a tight memory. The memory leak may only accumulate if DftiCreate/Commit/Compute/Free is called in a loop. If the descriptor is created a few times, then this likely is not the cause. &lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;Thanks&lt;BR /&gt;Dima&lt;BR /&gt;&lt;BR /&gt;&lt;/EM&gt;&lt;/DIV&gt;
&lt;/DIV&gt;
&lt;/DIV&gt;
&lt;BR /&gt;The program is crashing on its first call to the Intel MKL libraries...so there is no loop to accumulate memory.&lt;BR /&gt;&lt;BR /&gt;OK, I'll try upgrading MKL.&lt;BR /&gt;</description>
    <pubDate>Fri, 20 Nov 2009 20:35:07 GMT</pubDate>
    <dc:creator>Justin_D_1</dc:creator>
    <dc:date>2009-11-20T20:35:07Z</dc:date>
    <item>
      <title>Using DFTI with larger numbers of processors</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Using-DFTI-with-larger-numbers-of-processors/m-p/881603#M9615</link>
      <description>I've written an MPI code which is using the DFTI interface to compute FFT's . It's a domain decomposition type of problem&lt;BR /&gt;where each processor solves its own group of FFTs. Everything works fine for NP=1,2,32,64,128 but fails&lt;BR /&gt;when NP=256 with an error which looks like:&lt;BR /&gt; &lt;BR /&gt;DFTI_MKL_INTERNAL_ERROR&lt;BR /&gt;&lt;BR /&gt;The code I'm using is the same regardless of the number of processors (the FFT function itself is just called less often).&lt;BR /&gt;The code which fails is the commit descriptor line and it fails on the first instance of being called:&lt;BR /&gt;&lt;BR /&gt;type(DFTI_DESCRIPTOR), POINTER :: DFTI_HANDLE&lt;BR /&gt;...&lt;BR /&gt;STATUS = DftiCreateDescriptor( DFTI_HANDLE, DFTI_DOUBLE, DFTI_COMPLEX, 1, 192)&lt;BR /&gt;STATUS = DftiCommitDescriptor( DFTI_HANDLE )&lt;BR /&gt;&lt;BR /&gt;I've tried both statically and dynamically linking, neither help and I'm using the sequential (num threads = 1) version.&lt;BR /&gt;&lt;BR /&gt;Static&lt;BR /&gt;-i_dynamic -lmkl_core -lmkl_sequential -lmkl_intel_lp64&lt;BR /&gt;&lt;BR /&gt;Dynamic&lt;BR /&gt;#$MKLPATH/libmkl_solver_lp64_sequential.a -Wl,--start-group $MKLPATH/libmkl_intel_lp64.a $MKLPATH/libmkl_sequential.a $MKLPATH/libmkl_core.a -Wl,--end-group -i_dynamic&lt;BR /&gt;&lt;BR /&gt;Also, things mostly work o.k. for a smaller number of FFT points, e.g. 32, but it doesn't work for 192 or 256.&lt;BR /&gt;&lt;BR /&gt;I've compiled with "-check all" and nothing is found...so I think the code is ok.&lt;BR /&gt;&lt;BR /&gt;Does this problem sound familiar to anyone?&lt;BR /&gt;&lt;BR /&gt;thx&lt;BR /&gt;&lt;BR /&gt;jrd</description>
      <pubDate>Wed, 18 Nov 2009 22:50:31 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Using-DFTI-with-larger-numbers-of-processors/m-p/881603#M9615</guid>
      <dc:creator>Justin_D_1</dc:creator>
      <dc:date>2009-11-18T22:50:31Z</dc:date>
    </item>
    <item>
      <title>Re: Using DFTI with larger numbers of processors</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Using-DFTI-with-larger-numbers-of-processors/m-p/881604#M9616</link>
      <description>&lt;DIV style="margin: 0px; height: auto;"&gt;&lt;/DIV&gt;
&lt;BR /&gt;Davis, what MKL and MPI versions you are using? &lt;BR /&gt;--Gennady&lt;BR /&gt;</description>
      <pubDate>Thu, 19 Nov 2009 05:35:18 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Using-DFTI-with-larger-numbers-of-processors/m-p/881604#M9616</guid>
      <dc:creator>Gennady_F_Intel</dc:creator>
      <dc:date>2009-11-19T05:35:18Z</dc:date>
    </item>
    <item>
      <title>Re: Using DFTI with larger numbers of processors</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Using-DFTI-with-larger-numbers-of-processors/m-p/881605#M9617</link>
      <description>&lt;DIV style="margin:0px;"&gt;
&lt;DIV id="quote_reply" style="width: 100%; margin-top: 5px;"&gt;
&lt;DIV style="margin-left:2px;margin-right:2px;"&gt;Quoting - &lt;A href="https://community.intel.com/en-us/profile/334681"&gt;Gennady Fedorov (Intel)&lt;/A&gt;&lt;/DIV&gt;
&lt;DIV style="background-color:#E5E5E5; padding:5px;border: 1px; border-style: inset;margin-left:2px;margin-right:2px;"&gt;&lt;EM&gt; &lt;BR /&gt;Davis, what MKL and MPI versions you are using? &lt;BR /&gt;--Gennady&lt;BR /&gt;&lt;/EM&gt;&lt;/DIV&gt;
&lt;/DIV&gt;
&lt;/DIV&gt;
&lt;BR /&gt;ifort Intel Fortran Compiler for applications running on Intel 64, Version 10.1 Build 20080312 Package ID: l_fc_p_10.1.015&lt;BR /&gt;&lt;BR /&gt;MKL 10.0.2.018&lt;BR /&gt;&lt;BR /&gt;MPI mvapich_intel10-0.9.9 currently, but also tried openmpi_intel-1.2.7&lt;BR /&gt;&lt;BR /&gt;Also, I stripped out everything in my program so that all it does it commit and then free the descriptor. This does work.&lt;BR /&gt;So it sort of looks like a stack limit size problem...within the shell my stack is unlimited...but perhaps there is&lt;BR /&gt;some environment stack variable that needs to be set...I tried setting KMP_STACKSIZE large per a previous post I saw:&lt;BR /&gt;&lt;BR /&gt;KMP_STACKSIZE=10000000000&lt;BR /&gt;export KMP_STACKSIZE&lt;BR /&gt;&lt;BR /&gt;but that did not help either.&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;</description>
      <pubDate>Thu, 19 Nov 2009 15:07:44 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Using-DFTI-with-larger-numbers-of-processors/m-p/881605#M9617</guid>
      <dc:creator>Justin_D_1</dc:creator>
      <dc:date>2009-11-19T15:07:44Z</dc:date>
    </item>
    <item>
      <title>Re: Using DFTI with larger numbers of processors</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Using-DFTI-with-larger-numbers-of-processors/m-p/881606#M9618</link>
      <description>&lt;DIV style="margin:0px;"&gt;&lt;/DIV&gt;
Davis,&lt;BR /&gt;&lt;BR /&gt;My question may seem strange to you but...&lt;BR /&gt;Are you sure all your MPI processes are actually run on their respective nodes?&lt;BR /&gt;To see the nodes on which you are actually running you may replace the name of your executable file with "uname -n".&lt;BR /&gt;&lt;BR /&gt;Best regards,&lt;BR /&gt;-Vladimir&lt;BR /&gt;</description>
      <pubDate>Fri, 20 Nov 2009 09:28:27 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Using-DFTI-with-larger-numbers-of-processors/m-p/881606#M9618</guid>
      <dc:creator>Vladimir_Petrov__Int</dc:creator>
      <dc:date>2009-11-20T09:28:27Z</dc:date>
    </item>
    <item>
      <title>Re: Using DFTI with larger numbers of processors</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Using-DFTI-with-larger-numbers-of-processors/m-p/881607#M9619</link>
      <description>&lt;DIV style="margin:0px;"&gt;
&lt;DIV id="quote_reply" style="width: 100%; margin-top: 5px;"&gt;
&lt;DIV style="margin-left:2px;margin-right:2px;"&gt;Quoting - &lt;A href="https://community.intel.com/en-us/profile/93654"&gt;Vladimir Petrov (Intel)&lt;/A&gt;&lt;/DIV&gt;
&lt;DIV style="background-color:#E5E5E5; padding:5px;border: 1px; border-style: inset;margin-left:2px;margin-right:2px;"&gt;&lt;EM&gt; Davis,&lt;BR /&gt;&lt;BR /&gt;My question may seem strange to you but...&lt;BR /&gt;Are you sure all your MPI processes are actually run on their respective nodes?&lt;BR /&gt;To see the nodes on which you are actually running you may replace the name of your executable file with "uname -n".&lt;BR /&gt;&lt;BR /&gt;Best regards,&lt;BR /&gt;-Vladimir&lt;BR /&gt;&lt;/EM&gt;&lt;/DIV&gt;
&lt;/DIV&gt;
&lt;/DIV&gt;
&lt;BR /&gt;I am already requesting that MPI provide the machine name, so I can check this fairly easily. For a 256 simulation, I am using&lt;BR /&gt;120 unique physical machines (4 cores per machine). Of those 120 machines, Of those 120:&lt;BR /&gt;&lt;BR /&gt;1 core per machine 38&lt;BR /&gt;2         28&lt;BR /&gt;3         54&lt;BR /&gt;&lt;BR /&gt;Is that what you were looking for?&lt;BR /&gt;&lt;BR /&gt;</description>
      <pubDate>Fri, 20 Nov 2009 13:27:09 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Using-DFTI-with-larger-numbers-of-processors/m-p/881607#M9619</guid>
      <dc:creator>Justin_D_1</dc:creator>
      <dc:date>2009-11-20T13:27:09Z</dc:date>
    </item>
    <item>
      <title>Re: Using DFTI with larger numbers of processors</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Using-DFTI-with-larger-numbers-of-processors/m-p/881608#M9620</link>
      <description>&lt;DIV style="margin:0px;"&gt;&lt;/DIV&gt;
&lt;BR /&gt;Davis,&lt;BR /&gt;&lt;BR /&gt;If you don't link with libiomp5 then perhaps setting KMP_STACKSIZE has no effect.&lt;BR /&gt;&lt;BR /&gt;The version of MKL that you use has two memory leak problems in DFTI that are fixed in later releases. The problems may hypothetically cause DftiCommitDescriptor to produce DFTI_MKL_INTERNAL_ERROR in a long run or in a tight memory. The memory leak may only accumulate if DftiCreate/Commit/Compute/Free is called in a loop. If the descriptor is created a few times, then this likely is not the cause. &lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;Thanks&lt;BR /&gt;Dima&lt;BR /&gt;&lt;BR /&gt;</description>
      <pubDate>Fri, 20 Nov 2009 13:49:06 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Using-DFTI-with-larger-numbers-of-processors/m-p/881608#M9620</guid>
      <dc:creator>Dmitry_B_Intel</dc:creator>
      <dc:date>2009-11-20T13:49:06Z</dc:date>
    </item>
    <item>
      <title>Re: Using DFTI with larger numbers of processors</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Using-DFTI-with-larger-numbers-of-processors/m-p/881609#M9621</link>
      <description>&lt;DIV style="margin:0px;"&gt;
&lt;DIV id="quote_reply" style="width: 100%; margin-top: 5px;"&gt;
&lt;DIV style="margin-left:2px;margin-right:2px;"&gt;Quoting - &lt;A href="https://community.intel.com/en-us/profile/93647"&gt;Dmitry Baksheev (Intel)&lt;/A&gt;&lt;/DIV&gt;
&lt;DIV style="background-color:#E5E5E5; padding:5px;border: 1px; border-style: inset;margin-left:2px;margin-right:2px;"&gt;&lt;EM&gt; &lt;BR /&gt;Davis,&lt;BR /&gt;&lt;BR /&gt;If you don't link with libiomp5 then perhaps setting KMP_STACKSIZE has no effect.&lt;BR /&gt;&lt;BR /&gt;The version of MKL that you use has two memory leak problems in DFTI that are fixed in later releases. The problems may hypothetically cause DftiCommitDescriptor to produce DFTI_MKL_INTERNAL_ERROR in a long run or in a tight memory. The memory leak may only accumulate if DftiCreate/Commit/Compute/Free is called in a loop. If the descriptor is created a few times, then this likely is not the cause. &lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;Thanks&lt;BR /&gt;Dima&lt;BR /&gt;&lt;BR /&gt;&lt;/EM&gt;&lt;/DIV&gt;
&lt;/DIV&gt;
&lt;/DIV&gt;
&lt;BR /&gt;The program is crashing on its first call to the Intel MKL libraries...so there is no loop to accumulate memory.&lt;BR /&gt;&lt;BR /&gt;OK, I'll try upgrading MKL.&lt;BR /&gt;</description>
      <pubDate>Fri, 20 Nov 2009 20:35:07 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Using-DFTI-with-larger-numbers-of-processors/m-p/881609#M9621</guid>
      <dc:creator>Justin_D_1</dc:creator>
      <dc:date>2009-11-20T20:35:07Z</dc:date>
    </item>
    <item>
      <title>Re: Using DFTI with larger numbers of processors</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Using-DFTI-with-larger-numbers-of-processors/m-p/881610#M9622</link>
      <description>&lt;DIV style="margin:0px;"&gt;
&lt;DIV id="quote_reply" style="width: 100%; margin-top: 5px;"&gt;
&lt;DIV style="margin-left:2px;margin-right:2px;"&gt;Quoting - &lt;A href="https://community.intel.com/en-us/profile/74853"&gt;davis@coastal.ufl.edu&lt;/A&gt;&lt;/DIV&gt;
&lt;DIV style="background-color:#E5E5E5; padding:5px;border: 1px; border-style: inset;margin-left:2px;margin-right:2px;"&gt;&lt;EM&gt;
&lt;DIV style="margin:0px;"&gt;
&lt;DIV id="quote_reply" style="width: 100%; margin-top: 5px;"&gt;
&lt;DIV style="margin-left:2px;margin-right:2px;"&gt;Quoting - &lt;A href="https://community.intel.com/en-us/profile/93647"&gt;Dmitry Baksheev (Intel)&lt;/A&gt;&lt;/DIV&gt;
&lt;DIV style="background-color:#E5E5E5; padding:5px;border: 1px; border-style: inset;margin-left:2px;margin-right:2px;"&gt;&lt;EM&gt; &lt;BR /&gt;Davis,&lt;BR /&gt;&lt;BR /&gt;If you don't link with libiomp5 then perhaps setting KMP_STACKSIZE has no effect.&lt;BR /&gt;&lt;BR /&gt;The version of MKL that you use has two memory leak problems in DFTI that are fixed in later releases. The problems may hypothetically cause DftiCommitDescriptor to produce DFTI_MKL_INTERNAL_ERROR in a long run or in a tight memory. The memory leak may only accumulate if DftiCreate/Commit/Compute/Free is called in a loop. If the descriptor is created a few times, then this likely is not the cause. &lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;Thanks&lt;BR /&gt;Dima&lt;BR /&gt;&lt;BR /&gt;&lt;/EM&gt;&lt;/DIV&gt;
&lt;/DIV&gt;
&lt;/DIV&gt;
&lt;BR /&gt;The program is crashing on its first call to the Intel MKL libraries...so there is no loop to accumulate memory.&lt;BR /&gt;&lt;BR /&gt;OK, I'll try upgrading MKL.&lt;BR /&gt;&lt;/EM&gt;&lt;/DIV&gt;
&lt;/DIV&gt;
&lt;/DIV&gt;
&lt;BR /&gt;Davis. please let us know the probelm will still with the new version.&lt;BR /&gt;--Gennady&lt;BR /&gt;</description>
      <pubDate>Sat, 21 Nov 2009 18:18:05 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Using-DFTI-with-larger-numbers-of-processors/m-p/881610#M9622</guid>
      <dc:creator>Gennady_F_Intel</dc:creator>
      <dc:date>2009-11-21T18:18:05Z</dc:date>
    </item>
  </channel>
</rss>

