<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic matrix inverse FLOPS in Intel® oneAPI Math Kernel Library</title>
    <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/matrix-inverse-FLOPS/m-p/969574#M16464</link>
    <description>&lt;P&gt;Hi ,&amp;nbsp;&lt;/P&gt;

&lt;P&gt;What should be the required FLOPS for 16x16 MKL_Complex8 matrix inversion using cpotrf and than cpotri ?&lt;/P&gt;

&lt;P&gt;How many CPU clocks it should take on ATOM E3826 CPU and I5-3470 CPU ?&lt;/P&gt;

&lt;P&gt;Is there any performance difference using Linux 32bit operating system vs Linux 64bit operating system ? (for those specific CPUs)&lt;/P&gt;

&lt;P&gt;Thanks , Nimrod&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Thu, 03 Apr 2014 05:22:32 GMT</pubDate>
    <dc:creator>Nimrod_H_</dc:creator>
    <dc:date>2014-04-03T05:22:32Z</dc:date>
    <item>
      <title>matrix inverse FLOPS</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/matrix-inverse-FLOPS/m-p/969574#M16464</link>
      <description>&lt;P&gt;Hi ,&amp;nbsp;&lt;/P&gt;

&lt;P&gt;What should be the required FLOPS for 16x16 MKL_Complex8 matrix inversion using cpotrf and than cpotri ?&lt;/P&gt;

&lt;P&gt;How many CPU clocks it should take on ATOM E3826 CPU and I5-3470 CPU ?&lt;/P&gt;

&lt;P&gt;Is there any performance difference using Linux 32bit operating system vs Linux 64bit operating system ? (for those specific CPUs)&lt;/P&gt;

&lt;P&gt;Thanks , Nimrod&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 03 Apr 2014 05:22:32 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/matrix-inverse-FLOPS/m-p/969574#M16464</guid>
      <dc:creator>Nimrod_H_</dc:creator>
      <dc:date>2014-04-03T05:22:32Z</dc:date>
    </item>
    <item>
      <title>Hi Nimrod,</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/matrix-inverse-FLOPS/m-p/969575#M16465</link>
      <description>&lt;P&gt;Hi Nimrod,&lt;/P&gt;

&lt;P&gt;Approximate flops formula for (S/D)POTRF is 1/3*N^3, (S/D)POTRI is 2/3*N^3, for complex case these multiplied by four.&lt;BR /&gt;
	More precise formulas for complex case which makes sence for such a small size are:&lt;/P&gt;

&lt;P&gt;CPOTRF_FLOPS = 6 * N * (N * (N * 1./6. + .5) + 1./3.) + 2 * N * 1./6. * (N * N - 1.);&lt;/P&gt;

&lt;P&gt;CPOTRI_FLOPS = 6 * N * (N * (N * 1./3. + 1.) + 2./3.) + 2 * N * (N * (N * 1./3. - .5) + 1./6.)&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;

&lt;P&gt;Usually there is a difference for 32 and 64 bit code, which comes from richer set of registers in Intel 64 architecture and other improvements in x86-64 Application Binary Interface (ABI).&lt;/P&gt;

&lt;P&gt;Unfortunately I don't have clock counts for these functions.&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;

&lt;P&gt;W.B.R., Alexander&lt;BR /&gt;
	&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 03 Apr 2014 09:48:34 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/matrix-inverse-FLOPS/m-p/969575#M16465</guid>
      <dc:creator>Alexander_K_Intel3</dc:creator>
      <dc:date>2014-04-03T09:48:34Z</dc:date>
    </item>
  </channel>
</rss>

