<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: MKL Itanium 2 Performance in Intel® oneAPI Math Kernel Library</title>
    <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/MKL-Itanium-2-Performance/m-p/930964#M13742</link>
    <description>Yes, that's based on retiring 2 fused multiply-add instructions per clock cycle.</description>
    <pubDate>Fri, 28 Apr 2006 23:38:25 GMT</pubDate>
    <dc:creator>TimP</dc:creator>
    <dc:date>2006-04-28T23:38:25Z</dc:date>
    <item>
      <title>MKL Itanium 2 Performance</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/MKL-Itanium-2-Performance/m-p/930961#M13739</link>
      <description>&lt;DIV&gt;I was looking at a variety of information and was trying to determine what the theoretical peak performance for an Itanium 2 (1.5 GHz) is.&lt;/DIV&gt;&lt;DIV&gt;&lt;/DIV&gt;&lt;DIV&gt;From some information I found on various web sites, it would appear that for double precision floating point data, the peak would be 6 GFlops and 12 GFlops for single precision data (based on the 4 floating point units available for single precision). Is this correct?&lt;/DIV&gt;&lt;DIV&gt;&lt;/DIV&gt;&lt;DIV&gt;However, there is a performance graph for DGEMM on one of the MKL pages which shows a performance of more than 6 GFlops (single CPU).&lt;/DIV&gt;&lt;DIV&gt;&lt;/DIV&gt;&lt;DIV&gt;Can someone help clarify this?&lt;/DIV&gt;&lt;DIV&gt;&lt;/DIV&gt;&lt;DIV&gt;Thanks,&lt;/DIV&gt;&lt;DIV&gt;Tim&lt;/DIV&gt;</description>
      <pubDate>Fri, 28 Apr 2006 00:01:52 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/MKL-Itanium-2-Performance/m-p/930961#M13739</guid>
      <dc:creator>tcrony70</dc:creator>
      <dc:date>2006-04-28T00:01:52Z</dc:date>
    </item>
    <item>
      <title>Re: MKL Itanium 2 Performance</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/MKL-Itanium-2-Performance/m-p/930962#M13740</link>
      <description>&lt;DIV&gt;I'll investigate the performance on our website. It may be an error (perhaps with the clock speed reported).&lt;/DIV&gt;
&lt;DIV&gt;&lt;/DIV&gt;
&lt;DIV&gt;For both single and double precision, the theoretical peak of an Itanium 2 processor is 6 GFLOPS.&lt;/DIV&gt;
&lt;DIV&gt;&lt;/DIV&gt;
&lt;DIV&gt;Todd&lt;/DIV&gt;</description>
      <pubDate>Fri, 28 Apr 2006 00:52:55 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/MKL-Itanium-2-Performance/m-p/930962#M13740</guid>
      <dc:creator>Todd_R_Intel</dc:creator>
      <dc:date>2006-04-28T00:52:55Z</dc:date>
    </item>
    <item>
      <title>Re: MKL Itanium 2 Performance</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/MKL-Itanium-2-Performance/m-p/930963#M13741</link>
      <description>&lt;DIV&gt;&lt;/DIV&gt;&lt;P&gt;Thanks for the response.&lt;/P&gt;&lt;P&gt;Does this mean that there are not 4 floating point units capable of doing single precision arithmetic as I saw in several places on the web or that only 2 can act in a any given cycle (since I would have to believe that they can do a fused multiply and add).&lt;/P&gt;</description>
      <pubDate>Fri, 28 Apr 2006 20:49:01 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/MKL-Itanium-2-Performance/m-p/930963#M13741</guid>
      <dc:creator>tcrony70</dc:creator>
      <dc:date>2006-04-28T20:49:01Z</dc:date>
    </item>
    <item>
      <title>Re: MKL Itanium 2 Performance</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/MKL-Itanium-2-Performance/m-p/930964#M13742</link>
      <description>Yes, that's based on retiring 2 fused multiply-add instructions per clock cycle.</description>
      <pubDate>Fri, 28 Apr 2006 23:38:25 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/MKL-Itanium-2-Performance/m-p/930964#M13742</guid>
      <dc:creator>TimP</dc:creator>
      <dc:date>2006-04-28T23:38:25Z</dc:date>
    </item>
    <item>
      <title>Re: MKL Itanium 2 Performance</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/MKL-Itanium-2-Performance/m-p/930965#M13743</link>
      <description>&lt;DIV&gt;So what would be the point of having 4 floating point units that can handle short precision if one can still only retire 2 floating point instructions per clock cycle?&lt;/DIV&gt;&lt;DIV&gt;&lt;/DIV&gt;</description>
      <pubDate>Sat, 29 Apr 2006 00:48:40 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/MKL-Itanium-2-Performance/m-p/930965#M13743</guid>
      <dc:creator>tcrony70</dc:creator>
      <dc:date>2006-04-29T00:48:40Z</dc:date>
    </item>
    <item>
      <title>Re: MKL Itanium 2 Performance</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/MKL-Itanium-2-Performance/m-p/930966#M13744</link>
      <description>You put Itanium 2 in the title. With Itanium 1, in principle, it was possible to execute SSE instructions. If your goal is to execute SSE code efficiently, surely Itanium 2 is not your vehicle.</description>
      <pubDate>Sat, 29 Apr 2006 03:40:47 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/MKL-Itanium-2-Performance/m-p/930966#M13744</guid>
      <dc:creator>TimP</dc:creator>
      <dc:date>2006-04-29T03:40:47Z</dc:date>
    </item>
    <item>
      <title>Re: MKL Itanium 2 Performance</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/MKL-Itanium-2-Performance/m-p/930967#M13745</link>
      <description>&lt;DIV&gt;&lt;/DIV&gt;
&lt;P&gt;The data you saw was erroneously labeled as being 1.5 GHz, but was, in fact 1.6 GHz, which explains why the performance exceeded 6 GFLOPS on DGEMM. The maximum performance in either single precision or double precision, as mentioned elsewhere, is 4 floating point operations per clock, or 2 FMAs per clock.&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 02 May 2006 01:34:53 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/MKL-Itanium-2-Performance/m-p/930967#M13745</guid>
      <dc:creator>Intel_C_Intel</dc:creator>
      <dc:date>2006-05-02T01:34:53Z</dc:date>
    </item>
  </channel>
</rss>

