<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Hi, in Intel® oneAPI Math Kernel Library</title>
    <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Speed-of-solvers-with-CSR-format/m-p/1041612#M20728</link>
    <description>&lt;P&gt;Hi,&lt;/P&gt;

&lt;P&gt;&lt;SPAN style="font-size: 12px; line-height: 1.5;"&gt;&amp;gt; &amp;nbsp;I wonder that using full element CSR format is much faster than having half CSR format.&lt;/SPAN&gt;&lt;/P&gt;

&lt;P style="font-size: 12px;"&gt;May i ask about time result in for different phase and input parameters? Such fact can be describe, for example, by additional iterative step that have been done for symmetric case because of pivot or 2 level algorithm of factorization that is default for nonsymmetric case and can provide performance improvement for many-core processors.&amp;nbsp;&lt;/P&gt;

&lt;P style="font-size: 12px;"&gt;&lt;SPAN style="line-height: 1.5;"&gt;&amp;nbsp;&amp;gt;Also, there is&amp;nbsp;a example with jacobi precondition CG in MKL folder.&lt;/SPAN&gt;&lt;/P&gt;

&lt;P style="font-size: 12px;"&gt;&amp;gt;Are there another preconditioned CG examples?&lt;/P&gt;

&lt;P style="font-size: 12px;"&gt;In current version of MKL there is only one preconditioner example that you mentioned that covered main step of CG RCI routines.&lt;/P&gt;

&lt;P style="font-size: 12px;"&gt;&amp;nbsp;&lt;/P&gt;

&lt;P style="font-size: 12px;"&gt;Thanks,&lt;/P&gt;

&lt;P style="font-size: 12px;"&gt;Alex&lt;/P&gt;</description>
    <pubDate>Tue, 17 Jun 2014 16:20:10 GMT</pubDate>
    <dc:creator>Alexander_K_Intel2</dc:creator>
    <dc:date>2014-06-17T16:20:10Z</dc:date>
    <item>
      <title>Speed of solvers with CSR format</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Speed-of-solvers-with-CSR-format/m-p/1041609#M20725</link>
      <description>&lt;P&gt;&lt;SPAN style="font-size: 1em; line-height: 1.5;"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;&lt;SPAN style="font-size: 1em; line-height: 1.5;"&gt;&amp;nbsp;I have been using MKL in composer 2013. Especially, pardiso and preconditioned conjugate gradient solver with CSR format to solve symmetric matrices&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;&amp;nbsp;I wonder that using full element CSR format is much faster than having half CSR format.&lt;/P&gt;

&lt;P&gt;&amp;nbsp;Also, there is&lt;SPAN style="font-size: 1em; line-height: 1.5;"&gt;&amp;nbsp;a example with jacobi precondition CG in MKL folder.&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;Are there another preconditioned CG examples?&lt;/P&gt;</description>
      <pubDate>Tue, 17 Jun 2014 01:55:12 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Speed-of-solvers-with-CSR-format/m-p/1041609#M20725</guid>
      <dc:creator>Jaekyung_S_</dc:creator>
      <dc:date>2014-06-17T01:55:12Z</dc:date>
    </item>
    <item>
      <title>I think there's no another</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Speed-of-solvers-with-CSR-format/m-p/1041610#M20726</link>
      <description>I think there's no another format...only CSR</description>
      <pubDate>Tue, 17 Jun 2014 01:58:55 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Speed-of-solvers-with-CSR-format/m-p/1041610#M20726</guid>
      <dc:creator>Vidura_Dhananjaya</dc:creator>
      <dc:date>2014-06-17T01:58:55Z</dc:date>
    </item>
    <item>
      <title>  </title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Speed-of-solvers-with-CSR-format/m-p/1041611#M20727</link>
      <description>&lt;P&gt;&amp;nbsp;&amp;nbsp;&lt;/P&gt;

&lt;P&gt;&amp;nbsp;ex)&lt;/P&gt;

&lt;P&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;A 2x2 matrix &amp;nbsp; &amp;nbsp; &amp;nbsp;A = &amp;nbsp;1 &amp;nbsp;2&lt;/P&gt;

&lt;P&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 2 &amp;nbsp; 3&lt;/P&gt;

&lt;P&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;In pardiso solver( or PCG)&amp;nbsp;&lt;FONT color="#000000" face="gulim"&gt;&lt;SPAN style="line-height: normal;"&gt;I wonder that&lt;/SPAN&gt;&lt;/FONT&gt;&lt;/P&gt;

&lt;P&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;using &amp;nbsp;a = { 1, 2, 2, 3 } &amp;nbsp;in mtype 11 is much faster than using mtype 2 with a={1,2,3}&lt;/P&gt;

&lt;P&gt;&amp;nbsp; &amp;nbsp; I&amp;nbsp;&lt;SPAN style="color: rgb(0, 0, 0); font-family: gulim; line-height: normal; font-size: 1em;"&gt;Actual matrix size is larger.&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 17 Jun 2014 02:33:00 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Speed-of-solvers-with-CSR-format/m-p/1041611#M20727</guid>
      <dc:creator>Jaekyung_S_</dc:creator>
      <dc:date>2014-06-17T02:33:00Z</dc:date>
    </item>
    <item>
      <title>Hi,</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Speed-of-solvers-with-CSR-format/m-p/1041612#M20728</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;

&lt;P&gt;&lt;SPAN style="font-size: 12px; line-height: 1.5;"&gt;&amp;gt; &amp;nbsp;I wonder that using full element CSR format is much faster than having half CSR format.&lt;/SPAN&gt;&lt;/P&gt;

&lt;P style="font-size: 12px;"&gt;May i ask about time result in for different phase and input parameters? Such fact can be describe, for example, by additional iterative step that have been done for symmetric case because of pivot or 2 level algorithm of factorization that is default for nonsymmetric case and can provide performance improvement for many-core processors.&amp;nbsp;&lt;/P&gt;

&lt;P style="font-size: 12px;"&gt;&lt;SPAN style="line-height: 1.5;"&gt;&amp;nbsp;&amp;gt;Also, there is&amp;nbsp;a example with jacobi precondition CG in MKL folder.&lt;/SPAN&gt;&lt;/P&gt;

&lt;P style="font-size: 12px;"&gt;&amp;gt;Are there another preconditioned CG examples?&lt;/P&gt;

&lt;P style="font-size: 12px;"&gt;In current version of MKL there is only one preconditioner example that you mentioned that covered main step of CG RCI routines.&lt;/P&gt;

&lt;P style="font-size: 12px;"&gt;&amp;nbsp;&lt;/P&gt;

&lt;P style="font-size: 12px;"&gt;Thanks,&lt;/P&gt;

&lt;P style="font-size: 12px;"&gt;Alex&lt;/P&gt;</description>
      <pubDate>Tue, 17 Jun 2014 16:20:10 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Speed-of-solvers-with-CSR-format/m-p/1041612#M20728</guid>
      <dc:creator>Alexander_K_Intel2</dc:creator>
      <dc:date>2014-06-17T16:20:10Z</dc:date>
    </item>
    <item>
      <title>Alex,</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Speed-of-solvers-with-CSR-format/m-p/1041613#M20729</link>
      <description>&lt;P&gt;Alex,&lt;/P&gt;

&lt;P&gt;I'm using this solver to solve a pentadiagonal matrix.&lt;/P&gt;

&lt;P&gt;The link to the solver:&amp;nbsp;http://sepwww.stanford.edu/sep/claudio/Research/Prst_ExpRefl/ShtPSPI/intel/mkl/10.0.3.020/examples/solver/source/dcsrilu0_exampl2.f&lt;/P&gt;

&lt;P&gt;However, I realized the solver is selective. I wouldn't know if I'm not setting one of the parameters right. I can solve a &amp;nbsp;matrix size of 80 X 80. Anything after that, I get the "stack overflow" error message. If I bump it to 256 X 256; it solves it but after that again, it returns an error. If I increase it yet again to 512 X 512, it produces the right result. Is it a memory problem?&lt;/P&gt;

&lt;P&gt;I use a core i3, 6G RAM computer.&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;

&lt;P&gt;Thanks&lt;/P&gt;</description>
      <pubDate>Thu, 03 Jul 2014 22:04:19 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Speed-of-solvers-with-CSR-format/m-p/1041613#M20729</guid>
      <dc:creator>Million_P_</dc:creator>
      <dc:date>2014-07-03T22:04:19Z</dc:date>
    </item>
    <item>
      <title>It shouldn't be surprising if</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Speed-of-solvers-with-CSR-format/m-p/1041614#M20730</link>
      <description>&lt;P&gt;It shouldn't be surprising if you need to raise stack settings.&amp;nbsp; Both the overall shell stack limit (set on Windows by stack reserve and commiit) and the thread stack size (e.g. OMP_STACKSIZE) may be in play.&amp;nbsp; Intel default for OMP_STACK_SIZE is 4MB in 64-bit mode (2MB for 32-bit mode).&amp;nbsp; With ifort you also have the option to move some of your dynamic allocation to heap by /heap-arrays.&lt;/P&gt;</description>
      <pubDate>Fri, 04 Jul 2014 01:42:47 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Speed-of-solvers-with-CSR-format/m-p/1041614#M20730</guid>
      <dc:creator>TimP</dc:creator>
      <dc:date>2014-07-04T01:42:47Z</dc:date>
    </item>
    <item>
      <title>As Tim hinted try to change</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Speed-of-solvers-with-CSR-format/m-p/1041615#M20731</link>
      <description>&lt;P&gt;As Tim hinted try to change stack settings.Default thread stack size is 1MB.&lt;/P&gt;

&lt;P&gt;&lt;A href="http://msdn.microsoft.com/en-us/library/windows/desktop/ms686774(v=vs.85).aspx" target="_blank"&gt;http://msdn.microsoft.com/en-us/library/windows/desktop/ms686774(v=vs.85).aspx&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Sat, 05 Jul 2014 08:10:19 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Speed-of-solvers-with-CSR-format/m-p/1041615#M20731</guid>
      <dc:creator>Bernard</dc:creator>
      <dc:date>2014-07-05T08:10:19Z</dc:date>
    </item>
    <item>
      <title>Iliya is correct that</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Speed-of-solvers-with-CSR-format/m-p/1041616#M20732</link>
      <description>Iliya is correct that Microsoft used to fix thread stacks at 1 MB but this doesn't apply to mkl. Still it's likely you may need a reasonable increase.</description>
      <pubDate>Sat, 05 Jul 2014 10:33:57 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Speed-of-solvers-with-CSR-format/m-p/1041616#M20732</guid>
      <dc:creator>TimP</dc:creator>
      <dc:date>2014-07-05T10:33:57Z</dc:date>
    </item>
    <item>
      <title> &gt;&gt;&gt;can solve a  matrix size</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Speed-of-solvers-with-CSR-format/m-p/1041617#M20733</link>
      <description>&lt;P&gt;&lt;SPAN style="font-size: 12px; line-height: 14.399999618530273px;"&gt;&amp;nbsp;&amp;gt;&amp;gt;&amp;gt;can solve a &amp;nbsp;matrix size of 80 X 80. Anything after that, I get the "stack overflow" error message. If I bump it to 256 X 256; it solves it but after that again, it returns an error. If I increase it yet again to 512 X 512, it produces the right result. Is it a memory problem&amp;gt;&amp;gt;&amp;gt;&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;&lt;SPAN style="font-size: 12px; line-height: 14.399999618530273px;"&gt;Without dump file is hard to find the exact reason for the stack overflow.My assumption is that in some case the allocation is crossing guard page at small stack size and this can be a reason for stack overflow error.&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Sat, 05 Jul 2014 11:17:37 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Speed-of-solvers-with-CSR-format/m-p/1041617#M20733</guid>
      <dc:creator>Bernard</dc:creator>
      <dc:date>2014-07-05T11:17:37Z</dc:date>
    </item>
    <item>
      <title>Hi Million P.</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Speed-of-solvers-with-CSR-format/m-p/1041618#M20734</link>
      <description>&lt;P&gt;Hi Million P.&lt;/P&gt;

&lt;P&gt;Could you please provide us a test case, like 256x256 matrix so we can investigate your problem?&lt;/P&gt;

&lt;P&gt;Best Regards,&lt;/P&gt;

&lt;P&gt;Ying&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 08 Jul 2014 05:34:33 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Speed-of-solvers-with-CSR-format/m-p/1041618#M20734</guid>
      <dc:creator>Ying_H_Intel</dc:creator>
      <dc:date>2014-07-08T05:34:33Z</dc:date>
    </item>
  </channel>
</rss>

