<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Quote:Sergey Kostrov wrote: in Intel® oneAPI Math Kernel Library</title>
    <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Problem-with-MKL-Scalapack-PDGETRF/m-p/973879#M16941</link>
    <description>&lt;P&gt;&lt;/P&gt;&lt;BLOCKQUOTE&gt;Sergey Kostrov wrote:&lt;BR /&gt;&lt;P&gt;&lt;/P&gt;
&lt;P&gt;Also, did you do any testing in Non-MPI environment?&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;&lt;/BLOCKQUOTE&gt;&lt;P&gt;&lt;/P&gt;
&lt;P&gt;Thanks for the reply. Yes, it works fine.&lt;/P&gt;</description>
    <pubDate>Sat, 10 Aug 2013 15:09:20 GMT</pubDate>
    <dc:creator>siegfriedyoung</dc:creator>
    <dc:date>2013-08-10T15:09:20Z</dc:date>
    <item>
      <title>Problem with MKL Scalapack PDGETRF</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Problem-with-MKL-Scalapack-PDGETRF/m-p/973877#M16939</link>
      <description>&lt;P&gt;Hi all,&lt;/P&gt;
&lt;P&gt;I am trying to use&amp;nbsp;MKL PBLAS/ScaLAPACK routine pdgetrf to do the LU decompostion. I wrote a simple test fortran program and it worked well with 2*2 processes on the cluster. However, when I tried to use more processes, like 'mpiexec -n 16', The program got stuck.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;One possible reason might be that the BLAS spawns too many threads which lead to a performance disaster ( for ref:&amp;nbsp;&lt;A href="https://icl.cs.utk.edu/lapack-forum/viewtopic.php?f=6&amp;amp;t=3371"&gt;https://icl.cs.utk.edu/lapack-forum/viewtopic.php?f=6&amp;amp;t=3371&lt;/A&gt;&amp;nbsp;). So I tried to export OMP_NUM_THREADS=1 or MKL_NUM_THREADS=1, set different combinations of pbs -l select=:ncpu:mpiprocs: to submit the job. But none of them solved the problem.&lt;/P&gt;
&lt;P&gt;I have no idea now why it is fine with 2*2 procs but fails with 4*4 or more procs, hope someone here can help me. Any suggestion would be greatly appreciated.&lt;/P&gt;
&lt;P&gt;Cluster compiler info:&lt;/P&gt;
&lt;P&gt;Intel® Fortran Composer 13.0.1 and MPICH 3.0.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Sieg&lt;/P&gt;</description>
      <pubDate>Fri, 09 Aug 2013 18:30:08 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Problem-with-MKL-Scalapack-PDGETRF/m-p/973877#M16939</guid>
      <dc:creator>siegfriedyoung</dc:creator>
      <dc:date>2013-08-09T18:30:08Z</dc:date>
    </item>
    <item>
      <title>&gt;&gt;...https://icl.cs.utk.edu</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Problem-with-MKL-Scalapack-PDGETRF/m-p/973878#M16940</link>
      <description>&amp;gt;&amp;gt;...https://icl.cs.utk.edu/lapack-forum/viewtopic.php?f=6&amp;amp;t=3371 )...

It is a quote from a post on another forum:

&amp;gt;&amp;gt;Re: Scaling parallel, banded LU decomposition and solve
&amp;gt;&amp;gt;by &lt;STRONG&gt;Julien Langou&lt;/STRONG&gt; » Tue May 29, 2012 7:14 pm 
&amp;gt;&amp;gt;
&amp;gt;&amp;gt;...
&amp;gt;&amp;gt;Note:
&amp;gt;&amp;gt;OMP_NUM_THREADS takes care of most of the BLAS libraries but not all, &lt;STRONG&gt;some have their own environment variable to control&lt;/STRONG&gt;
&amp;gt;&amp;gt;&lt;STRONG&gt;the number of threads&lt;/STRONG&gt; they are running on...
&amp;gt;&amp;gt;...

Please ask &lt;STRONG&gt;Julien Langou&lt;/STRONG&gt; to provide additional technical details about these internal environment variables.

Also, did you do any testing in Non-MPI environment?</description>
      <pubDate>Sat, 10 Aug 2013 01:30:02 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Problem-with-MKL-Scalapack-PDGETRF/m-p/973878#M16940</guid>
      <dc:creator>SergeyKostrov</dc:creator>
      <dc:date>2013-08-10T01:30:02Z</dc:date>
    </item>
    <item>
      <title>Quote:Sergey Kostrov wrote:</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Problem-with-MKL-Scalapack-PDGETRF/m-p/973879#M16941</link>
      <description>&lt;P&gt;&lt;/P&gt;&lt;BLOCKQUOTE&gt;Sergey Kostrov wrote:&lt;BR /&gt;&lt;P&gt;&lt;/P&gt;
&lt;P&gt;Also, did you do any testing in Non-MPI environment?&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;&lt;/BLOCKQUOTE&gt;&lt;P&gt;&lt;/P&gt;
&lt;P&gt;Thanks for the reply. Yes, it works fine.&lt;/P&gt;</description>
      <pubDate>Sat, 10 Aug 2013 15:09:20 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Problem-with-MKL-Scalapack-PDGETRF/m-p/973879#M16941</guid>
      <dc:creator>siegfriedyoung</dc:creator>
      <dc:date>2013-08-10T15:09:20Z</dc:date>
    </item>
    <item>
      <title>Problem solved. It seems the</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Problem-with-MKL-Scalapack-PDGETRF/m-p/973880#M16942</link>
      <description>&lt;P&gt;Problem solved. It seems the MKL scalapack packed in composer 13.0.1 doesn't get along with MPICH3.0. After switching to the Intel MPI, the pdgetrf routine works fine with more procs.&lt;/P&gt;
&lt;P&gt;Havn't try MPICH2 and Newer release of MKL. Hope the issue has been solved already.&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 13 Aug 2013 23:35:11 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Problem-with-MKL-Scalapack-PDGETRF/m-p/973880#M16942</guid>
      <dc:creator>siegfriedyoung</dc:creator>
      <dc:date>2013-08-13T23:35:11Z</dc:date>
    </item>
  </channel>
</rss>

