<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Hi, in Intel® oneAPI Math Kernel Library</title>
    <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/PARDISO-optimization/m-p/975913#M17072</link>
    <description>&lt;P&gt;Hi,&lt;/P&gt;

&lt;P&gt;Could you provide information of value of n and provide output of pardiso if you set msglvl to 1?&lt;/P&gt;

&lt;P&gt;Thanks,&lt;/P&gt;

&lt;P&gt;Alex&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Wed, 09 Apr 2014 17:37:57 GMT</pubDate>
    <dc:creator>Alexander_K_Intel2</dc:creator>
    <dc:date>2014-04-09T17:37:57Z</dc:date>
    <item>
      <title>PARDISO optimization?</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/PARDISO-optimization/m-p/975910#M17069</link>
      <description>&lt;P&gt;Here goes,&lt;/P&gt;

&lt;P&gt;Relevant information (I hope):&lt;/P&gt;

&lt;UL&gt;
	&lt;LI&gt;Using Pardiso to solve linear set of equations&amp;nbsp;&lt;STRONG&gt;Ax = b&lt;/STRONG&gt;&lt;/LI&gt;
	&lt;LI&gt;matrix&amp;nbsp;&lt;STRONG&gt;A &lt;/STRONG&gt;can be&amp;nbsp;extremely large &amp;amp; sparse, nonsymmetric, and is also put into csr format using mkl routine&lt;/LI&gt;
&lt;/UL&gt;

&lt;P&gt;Below is the section of code where I call Pardiso, it works great everything is fine. I was really hoping someone could look at my set up and let me know if there is something I can do to make it even faster, or if it is as fast as it's going to get.&lt;/P&gt;

&lt;P&gt;I also use &lt;STRONG&gt;mkl_set_num_threads(n)&lt;/STRONG&gt; above the code to make use of multiple cores according to the desired &lt;STRONG&gt;n&lt;/STRONG&gt;.&lt;/P&gt;

&lt;P&gt;&lt;U&gt;&lt;STRONG&gt;CODE&lt;/STRONG&gt;&lt;/U&gt;&lt;/P&gt;

&lt;P&gt;//&amp;nbsp;&amp;nbsp; &amp;nbsp;Call pardiso solver&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp;MKL_INT pt[64], iparm[64];&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp;for(i = 0; i &amp;lt; 64; i++) {&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp;pt&lt;I&gt; = 0;&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp;iparm&lt;I&gt; = 0;&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp;}&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp;MKL_INT *perm;&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp;perm = (MKL_INT*)mkl_malloc(m*sizeof(MKL_INT),16);&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp;if (perm == NULL)&amp;nbsp;&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp;{&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp;cout &amp;lt;&amp;lt; "&amp;gt;&amp;gt;&amp;gt; error allocating perm" &amp;lt;&amp;lt; endl;&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp;return (0);&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp;}&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;&lt;/I&gt;&lt;/I&gt;&lt;/P&gt;

&lt;P&gt;MKL_INT maxfct, mnum, mtype, phase, nrhs, msglvl, error;&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp;maxfct = 1;&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp;mnum = 1;&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp;mtype = 11;&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp;nrhs = 1;&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp;msglvl = 1;&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp;iparm[0] = 1;&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp;iparm[1] = 3;&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp;iparm[26] = 1;&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp;iparm[34] = 1;&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp;iparm[60] = 0;&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp;error = 0;&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp;&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp;//&amp;nbsp;&amp;nbsp; &amp;nbsp;Pardiso Direct Solver&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp;phase = 13;&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp;pardiso (pt, &amp;amp;maxfct, &amp;amp;mnum, &amp;amp;mtype, &amp;amp;phase, &amp;amp;n, acsr, ia, ja, perm, &amp;amp;nrhs, iparm, &amp;amp;msglvl, b, solution, &amp;amp;error);&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp;if (error != 0) {&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp;cout &amp;lt;&amp;lt; "ERROR during symbolic factorization: " &amp;lt;&amp;lt; error &amp;lt;&amp;lt; endl;&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp;}&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;

&lt;P&gt;Again all I am really seeking is whether or not my call could be any better, or if this is optimal. Let me know if there is any additional information needed, and I will be checking my email frequently today to respond quickly.&lt;/P&gt;

&lt;P&gt;Sincerely, Jared&lt;/P&gt;</description>
      <pubDate>Wed, 09 Apr 2014 14:03:57 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/PARDISO-optimization/m-p/975910#M17069</guid>
      <dc:creator>Jared_W_</dc:creator>
      <dc:date>2014-04-09T14:03:57Z</dc:date>
    </item>
    <item>
      <title>Hi Jared,</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/PARDISO-optimization/m-p/975911#M17070</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;SPAN style="font-family: Arial, Helvetica, sans-serif; font-size: 12px; line-height: 18px;"&gt;Jared,&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;&lt;SPAN style="font-family: Arial, Helvetica, sans-serif; font-size: 12px; line-height: 18px;"&gt;What kind of optimal you need to have? If you want to achieve better performance it is recommended to switch off matrix checker (iparm[26])&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;&lt;SPAN style="font-family: Arial, Helvetica, sans-serif; font-size: 12px; line-height: 18px;"&gt;Thanks,&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;&lt;SPAN style="font-family: Arial, Helvetica, sans-serif; font-size: 12px; line-height: 18px;"&gt;Alex&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 09 Apr 2014 16:35:35 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/PARDISO-optimization/m-p/975911#M17070</guid>
      <dc:creator>Alexander_K_Intel2</dc:creator>
      <dc:date>2014-04-09T16:35:35Z</dc:date>
    </item>
    <item>
      <title>Alex,</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/PARDISO-optimization/m-p/975912#M17071</link>
      <description>&lt;P&gt;Alex,&lt;/P&gt;

&lt;P&gt;Thanks for the quick response. I just want to make sure it solves the matrix A the fastest possible. I probably should have mentioned one thing I thought was interesting was it seems that the specification of&amp;nbsp;&lt;STRONG&gt;mkl_set_num_threads(n)&lt;/STRONG&gt;&amp;nbsp;isn't affecting how quickly pardiso solves. Does this seem reasonable based on the set up I described?&lt;/P&gt;</description>
      <pubDate>Wed, 09 Apr 2014 17:29:37 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/PARDISO-optimization/m-p/975912#M17071</guid>
      <dc:creator>Jared_W_</dc:creator>
      <dc:date>2014-04-09T17:29:37Z</dc:date>
    </item>
    <item>
      <title>Hi,</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/PARDISO-optimization/m-p/975913#M17072</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;

&lt;P&gt;Could you provide information of value of n and provide output of pardiso if you set msglvl to 1?&lt;/P&gt;

&lt;P&gt;Thanks,&lt;/P&gt;

&lt;P&gt;Alex&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 09 Apr 2014 17:37:57 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/PARDISO-optimization/m-p/975913#M17072</guid>
      <dc:creator>Alexander_K_Intel2</dc:creator>
      <dc:date>2014-04-09T17:37:57Z</dc:date>
    </item>
    <item>
      <title>I have attached two cases, n</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/PARDISO-optimization/m-p/975914#M17073</link>
      <description>&lt;P&gt;I have attached two cases, n = 2 and n = 8.&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 09 Apr 2014 18:08:07 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/PARDISO-optimization/m-p/975914#M17073</guid>
      <dc:creator>Jared_W_</dc:creator>
      <dc:date>2014-04-09T18:08:07Z</dc:date>
    </item>
    <item>
      <title>HI,</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/PARDISO-optimization/m-p/975915#M17074</link>
      <description>&lt;P&gt;HI,&lt;/P&gt;

&lt;P&gt;First of all your number of threads changes - number of&amp;nbsp;&lt;/P&gt;

&lt;PRE style="color: rgb(0, 0, 0); line-height: normal; word-wrap: break-word; white-space: pre-wrap;"&gt;&amp;lt; Parallel Direct Factorization with #processors: &amp;gt;&lt;/PRE&gt;

&lt;P&gt;is different.&lt;/P&gt;

&lt;P&gt;Secondary - as i see factorization time decrease when number of threads increased that expected. In any case you matrix is quite small that's why not all part of PARDISO decrease significantly after using additional threads. Also, what version of MKL do you use?&lt;/P&gt;

&lt;P&gt;Thanks,&lt;/P&gt;

&lt;P&gt;Alex&lt;/P&gt;</description>
      <pubDate>Wed, 09 Apr 2014 18:48:07 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/PARDISO-optimization/m-p/975915#M17074</guid>
      <dc:creator>Alexander_K_Intel2</dc:creator>
      <dc:date>2014-04-09T18:48:07Z</dc:date>
    </item>
    <item>
      <title>Ok, so that makes sense for</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/PARDISO-optimization/m-p/975916#M17075</link>
      <description>&lt;P&gt;Ok, so that makes sense for the factorization time to decrease since I changed the #processors from 2 to 8. So just so I have a scale on things, the 30K X 30K matrix is small relatively?&lt;/P&gt;</description>
      <pubDate>Wed, 09 Apr 2014 19:30:36 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/PARDISO-optimization/m-p/975916#M17075</guid>
      <dc:creator>Jared_W_</dc:creator>
      <dc:date>2014-04-09T19:30:36Z</dc:date>
    </item>
    <item>
      <title>You are correct. But i have a</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/PARDISO-optimization/m-p/975917#M17076</link>
      <description>&lt;P&gt;You are correct. But i have a question - if you are worried about part of seconds time than you call pardiso several time - am I correct? May I ask you about model of using PARDISO in your code to help you modify model of using that could help in reducing overall time.&lt;/P&gt;

&lt;P&gt;Thanks,&lt;/P&gt;

&lt;P&gt;Alex&lt;/P&gt;</description>
      <pubDate>Thu, 10 Apr 2014 09:17:38 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/PARDISO-optimization/m-p/975917#M17076</guid>
      <dc:creator>Alexander_K_Intel2</dc:creator>
      <dc:date>2014-04-10T09:17:38Z</dc:date>
    </item>
    <item>
      <title>Alex,</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/PARDISO-optimization/m-p/975918#M17077</link>
      <description>&lt;P&gt;Alex,&lt;/P&gt;

&lt;P&gt;You're right, I am calling it several times. It is the solver in my finite element code for viscous, incompressible fluid flow. At my current largest mesh (30k degrees of freedom), it is actually solving quite fast at approximately 0.8 s, but this entire process iterates over and over until convergence. I have been looking at other parts of my code and it seems other spots are adding up to the majority of the time. One in particular though is the &lt;STRONG&gt;ddnscsr&lt;/STRONG&gt; routine. It is taking approximately &lt;STRONG&gt;2 seconds&lt;/STRONG&gt; to &lt;STRONG&gt;convert sparse format to csr&lt;/STRONG&gt;, does this seem right to you?&lt;/P&gt;</description>
      <pubDate>Thu, 10 Apr 2014 10:53:22 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/PARDISO-optimization/m-p/975918#M17077</guid>
      <dc:creator>Jared_W_</dc:creator>
      <dc:date>2014-04-10T10:53:22Z</dc:date>
    </item>
    <item>
      <title>HI,</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/PARDISO-optimization/m-p/975919#M17078</link>
      <description>&lt;P&gt;HI,&lt;/P&gt;

&lt;P&gt;I can't recommend using converters on each step in such algorithm. It can significantly improve performance if you will calculate elements of matrix in csr format, but not in dense and convert it. Actually, your matrix structure didn't change, is it true?&lt;/P&gt;

&lt;P&gt;Thanks,&lt;/P&gt;

&lt;P&gt;Alex&lt;/P&gt;</description>
      <pubDate>Fri, 11 Apr 2014 02:33:15 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/PARDISO-optimization/m-p/975919#M17078</guid>
      <dc:creator>Alexander_K_Intel2</dc:creator>
      <dc:date>2014-04-11T02:33:15Z</dc:date>
    </item>
  </channel>
</rss>

