<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic the issue is escalated and in Intel® oneAPI Math Kernel Library</title>
    <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Pardiso-example-terminates-when-using-72-or-more-cpus/m-p/1082299#M22852</link>
    <description>&lt;P&gt;the issue is escalated and the fix of the problem is targeted to be released the next update.&lt;/P&gt;</description>
    <pubDate>Thu, 04 Aug 2016 04:09:04 GMT</pubDate>
    <dc:creator>Gennady_F_Intel</dc:creator>
    <dc:date>2016-08-04T04:09:04Z</dc:date>
    <item>
      <title>Pardiso example terminates when using 72 or more cpus</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Pardiso-example-terminates-when-using-72-or-more-cpus/m-p/1082298#M22851</link>
      <description>&lt;P&gt;I studied the example cl_solver_sym_sp_0_based_c.c in cluster_sparse_solverc/source . I compiled it using:&lt;/P&gt;

&lt;P&gt;make libintel64 example=cl_solver_sym_sp_0_based_c&lt;/P&gt;

&lt;P&gt;It runs fine . However the matrix is too small to look at performance. So I modified the example to read in a 3million^2 matrix from a text file. When I run it with 24 cpus ( 1 host ), it factors the matrix in 30 second. When I run it with 48 cpus ( 2 hosts ) it factors it in 20 seconds. This is great! But when I run it with 72 or more cpus, I keep getting this after the reordering stage:&lt;/P&gt;

&lt;P&gt;Reordering completed ...&lt;BR /&gt;
	===================================================================================&lt;BR /&gt;
	=&amp;nbsp;&amp;nbsp; BAD TERMINATION OF ONE OF YOUR APPLICATION PROCESSES&lt;BR /&gt;
	=&amp;nbsp;&amp;nbsp; PID 106418 RUNNING AT cforge201&lt;BR /&gt;
	=&amp;nbsp;&amp;nbsp; EXIT CODE: 11&lt;BR /&gt;
	=&amp;nbsp;&amp;nbsp; CLEANING UP REMAINING PROCESSES&lt;BR /&gt;
	=&amp;nbsp;&amp;nbsp; YOU CAN IGNORE THE BELOW CLEANUP MESSAGES&lt;BR /&gt;
	==================================================================================&lt;/P&gt;

&lt;P&gt;The command I am using is:&lt;/P&gt;

&lt;P&gt;mpirun -np 3 -machinefile ./hostfile ./cl_solver_sym_sp_0_based_c.exe&lt;/P&gt;

&lt;P&gt;Where hostfile contains:&lt;/P&gt;

&lt;P&gt;cforge200:1&lt;BR /&gt;
	cforge201:1&lt;BR /&gt;
	cforge202:1&lt;/P&gt;

&lt;P&gt;Here are my example files to see if issue is reproduceable:&lt;/P&gt;

&lt;P&gt;cl_solver_sym_sp_0_based_c.c - Edit all the occurences of *.txt to the path where the files are on your system&lt;/P&gt;

&lt;P&gt;&lt;A href="https://www.dropbox.com/s/ndkzi9zojxuh1xo/cl_solver_sym_sp_0_based_c.c?dl=0" target="_blank"&gt;https://www.dropbox.com/s/ndkzi9zojxuh1xo/cl_solver_sym_sp_0_based_c.c?dl=0&lt;/A&gt;&lt;/P&gt;

&lt;P&gt;ia, ja, a, and b data in text files:&lt;/P&gt;

&lt;P&gt;&lt;A href="https://www.dropbox.com/s/3dkhbillyso03kc/ia_ja_a_b_data.tar.gz?dl=0" target="_blank"&gt;https://www.dropbox.com/s/3dkhbillyso03kc/ia_ja_a_b_data.tar.gz?dl=0&lt;/A&gt;&lt;/P&gt;

&lt;P&gt;Curious what kind of performance improvement you get when running with MPI on 12, 24, 48, and 72 cpus!&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 03 Aug 2016 21:59:55 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Pardiso-example-terminates-when-using-72-or-more-cpus/m-p/1082298#M22851</guid>
      <dc:creator>Ferris_H_</dc:creator>
      <dc:date>2016-08-03T21:59:55Z</dc:date>
    </item>
    <item>
      <title>the issue is escalated and</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Pardiso-example-terminates-when-using-72-or-more-cpus/m-p/1082299#M22852</link>
      <description>&lt;P&gt;the issue is escalated and the fix of the problem is targeted to be released the next update.&lt;/P&gt;</description>
      <pubDate>Thu, 04 Aug 2016 04:09:04 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Pardiso-example-terminates-when-using-72-or-more-cpus/m-p/1082299#M22852</guid>
      <dc:creator>Gennady_F_Intel</dc:creator>
      <dc:date>2016-08-04T04:09:04Z</dc:date>
    </item>
    <item>
      <title>Excellent. Since I had two</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Pardiso-example-terminates-when-using-72-or-more-cpus/m-p/1082300#M22853</link>
      <description>&lt;P&gt;Excellent. Since I had two problems with Pardiso, I was not sure which problem the fix was meant for. It sounds like it is meant for the one where it can not run on more than 2 hosts ( 48 cpus ).&lt;/P&gt;</description>
      <pubDate>Thu, 04 Aug 2016 04:14:40 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Pardiso-example-terminates-when-using-72-or-more-cpus/m-p/1082300#M22853</guid>
      <dc:creator>Ferris_H_</dc:creator>
      <dc:date>2016-08-04T04:14:40Z</dc:date>
    </item>
  </channel>
</rss>

