<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Hi Daniel, in Intel® oneAPI Math Kernel Library</title>
    <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Scalability-of-dense-symmetric-indefinite-factorization/m-p/1144573#M26595</link>
    <description>Hi Daniel,
Do you mean on both machine, the thread scale is limited to thread 8?  it is not expected. we publish some factorization benchmark like dgetrf &lt;A href="https://software.intel.com/en-us/mkl/features/benchmarks" target="_blank"&gt;https://software.intel.com/en-us/mkl/features/benchmarks&lt;/A&gt; on xeon and xeon phi. for your reference.  And if need, please submit the exact issue to &lt;A href="https://supporttickets.intel.com/?lang=en-US" target="_blank"&gt;https://supporttickets.intel.com/?lang=en-US&lt;/A&gt; with your reproduce matrix. 

Best Regards,
Ying</description>
    <pubDate>Thu, 01 Mar 2018 01:26:18 GMT</pubDate>
    <dc:creator>Ying_H_Intel</dc:creator>
    <dc:date>2018-03-01T01:26:18Z</dc:date>
    <item>
      <title>Scalability of dense symmetric indefinite factorization</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Scalability-of-dense-symmetric-indefinite-factorization/m-p/1144572#M26594</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;

&lt;P&gt;I am trying to speed up the factorization of a dense symmetric indefinite matrix, the size of my matrices is usually between 10 k and 20 k.&lt;/P&gt;

&lt;P&gt;I am using LAPACK (dsytrf) and MKL 2018 and I run it on a supercomputer node with two Intel Xeon E5-2680 v3 Haswell CPUs (2 x 12 Cores, 2,5 GHz). I also tried a node with&amp;nbsp; Intel Xeon Phi 7250-F Knights Landing&amp;nbsp; CPU and 68 cores, 1.4 GHz. The problem is that the factorization does not seem to scale very well with the number of threads I use: with up to 8 threads I see some improvement (the run time is halfed) but after that there is even a slowdown.&lt;/P&gt;

&lt;P&gt;Is this something that is to be expected from this MKL routine? And if so, do you know of any alternative that scales better?&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;

&lt;P&gt;Thanks&lt;/P&gt;

&lt;P&gt;Daniel&lt;/P&gt;</description>
      <pubDate>Sun, 25 Feb 2018 20:05:17 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Scalability-of-dense-symmetric-indefinite-factorization/m-p/1144572#M26594</guid>
      <dc:creator>Rehfeldt__Daniel</dc:creator>
      <dc:date>2018-02-25T20:05:17Z</dc:date>
    </item>
    <item>
      <title>Hi Daniel,</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Scalability-of-dense-symmetric-indefinite-factorization/m-p/1144573#M26595</link>
      <description>Hi Daniel,
Do you mean on both machine, the thread scale is limited to thread 8?  it is not expected. we publish some factorization benchmark like dgetrf &lt;A href="https://software.intel.com/en-us/mkl/features/benchmarks" target="_blank"&gt;https://software.intel.com/en-us/mkl/features/benchmarks&lt;/A&gt; on xeon and xeon phi. for your reference.  And if need, please submit the exact issue to &lt;A href="https://supporttickets.intel.com/?lang=en-US" target="_blank"&gt;https://supporttickets.intel.com/?lang=en-US&lt;/A&gt; with your reproduce matrix. 

Best Regards,
Ying</description>
      <pubDate>Thu, 01 Mar 2018 01:26:18 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Scalability-of-dense-symmetric-indefinite-factorization/m-p/1144573#M26595</guid>
      <dc:creator>Ying_H_Intel</dc:creator>
      <dc:date>2018-03-01T01:26:18Z</dc:date>
    </item>
    <item>
      <title>Hi Ying,</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Scalability-of-dense-symmetric-indefinite-factorization/m-p/1144574#M26596</link>
      <description>&lt;P&gt;Hi Ying,&lt;/P&gt;

&lt;P&gt;thanks for your help. On both machines the factorization does not scale beyond 8 threads.I will submit the matrix to the support, as you suggested.&lt;/P&gt;

&lt;P&gt;Best&lt;/P&gt;

&lt;P&gt;Daniel&lt;/P&gt;</description>
      <pubDate>Thu, 01 Mar 2018 10:16:19 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Scalability-of-dense-symmetric-indefinite-factorization/m-p/1144574#M26596</guid>
      <dc:creator>Rehfeldt__Daniel</dc:creator>
      <dc:date>2018-03-01T10:16:19Z</dc:date>
    </item>
    <item>
      <title>Hi Daniel,</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Scalability-of-dense-symmetric-indefinite-factorization/m-p/1144575#M26597</link>
      <description>&lt;P&gt;Hi Daniel,&lt;/P&gt;

&lt;P&gt;We have been working on improving this functionality in terms of performance and scalability, the optimizations will be available in one of the next releases.&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 02 Mar 2018 05:58:25 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Scalability-of-dense-symmetric-indefinite-factorization/m-p/1144575#M26597</guid>
      <dc:creator>Denis_S_Intel</dc:creator>
      <dc:date>2018-03-02T05:58:25Z</dc:date>
    </item>
    <item>
      <title>Hi Denis,</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Scalability-of-dense-symmetric-indefinite-factorization/m-p/1144576#M26598</link>
      <description>&lt;P&gt;Hi Denis,&lt;/P&gt;

&lt;P&gt;thanks for the information. Do you have an idea on how long this will take (several months, a year, etc)? I am not familiar with your release cycles. Would you recommend to try an LU factorization until then? According to your benchmarks that seem to scale beyond 8 threads.&lt;/P&gt;</description>
      <pubDate>Fri, 02 Mar 2018 09:07:52 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Scalability-of-dense-symmetric-indefinite-factorization/m-p/1144576#M26598</guid>
      <dc:creator>Rehfeldt__Daniel</dc:creator>
      <dc:date>2018-03-02T09:07:52Z</dc:date>
    </item>
    <item>
      <title>Hi Daniel,</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Scalability-of-dense-symmetric-indefinite-factorization/m-p/1144577#M26599</link>
      <description>&lt;P&gt;Hi Daniel,&lt;/P&gt;

&lt;P&gt;The new release is expected this month. As for LU factorization, yes I think it's a good way to try LU instead of LDLT until the new release is available.&lt;BR /&gt;
	May I ask you what are you going to do with the results once you have them?&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 02 Mar 2018 23:43:43 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Scalability-of-dense-symmetric-indefinite-factorization/m-p/1144577#M26599</guid>
      <dc:creator>Denis_S_Intel</dc:creator>
      <dc:date>2018-03-02T23:43:43Z</dc:date>
    </item>
    <item>
      <title>Hi Denis,</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Scalability-of-dense-symmetric-indefinite-factorization/m-p/1144578#M26600</link>
      <description>&lt;P&gt;Hi Denis,&lt;/P&gt;

&lt;P&gt;thanks again. Will your enhancements be in the release notes? Otherwise could you let me know once it has been released?&lt;/P&gt;

&lt;P&gt;I will try the LU factorization then. Would you expect it to scale to 68 Cores (lets say for a 10 k matrix)?&lt;/P&gt;

&lt;P&gt;I use the factorization for solving two to four linear systems with different right hand sides.&lt;/P&gt;</description>
      <pubDate>Sat, 03 Mar 2018 07:56:03 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Scalability-of-dense-symmetric-indefinite-factorization/m-p/1144578#M26600</guid>
      <dc:creator>Rehfeldt__Daniel</dc:creator>
      <dc:date>2018-03-03T07:56:03Z</dc:date>
    </item>
    <item>
      <title>Hi Daniel,</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Scalability-of-dense-symmetric-indefinite-factorization/m-p/1144579#M26601</link>
      <description>&lt;P&gt;Hi Daniel,&lt;/P&gt;

&lt;P&gt;Yes, the&amp;nbsp;enhancements will be in the release notes and yes, the LU factorization shows good scalability.&lt;/P&gt;</description>
      <pubDate>Tue, 06 Mar 2018 20:44:38 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Scalability-of-dense-symmetric-indefinite-factorization/m-p/1144579#M26601</guid>
      <dc:creator>Denis_S_Intel</dc:creator>
      <dc:date>2018-03-06T20:44:38Z</dc:date>
    </item>
  </channel>
</rss>

