<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Do ?axpy work for you? in Intel® oneAPI Math Kernel Library</title>
    <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/How-to-use-fused-multiply-add-with-MKL/m-p/1172097#M28653</link>
    <description>Do ?axpy work for you?</description>
    <pubDate>Tue, 12 Dec 2017 15:22:13 GMT</pubDate>
    <dc:creator>TimP</dc:creator>
    <dc:date>2017-12-12T15:22:13Z</dc:date>
    <item>
      <title>How to use fused multiply–add with MKL?</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/How-to-use-fused-multiply-add-with-MKL/m-p/1172096#M28652</link>
      <description>I want to do basic a*x + b operation, where a, x and b are the vectors(or matrixes) with utilization of FMA processor capabilities. I think I&amp;nbsp;am using v?Mul +&amp;nbsp;v?Add I will get two separate operations. How to use FMA with the help of MKL and Intel compiler? Must I use FMA Intrinsics only?</description>
      <pubDate>Tue, 12 Dec 2017 14:08:28 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/How-to-use-fused-multiply-add-with-MKL/m-p/1172096#M28652</guid>
      <dc:creator>Konovalov__Pavel</dc:creator>
      <dc:date>2017-12-12T14:08:28Z</dc:date>
    </item>
    <item>
      <title>Do ?axpy work for you?</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/How-to-use-fused-multiply-add-with-MKL/m-p/1172097#M28653</link>
      <description>Do ?axpy work for you?</description>
      <pubDate>Tue, 12 Dec 2017 15:22:13 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/How-to-use-fused-multiply-add-with-MKL/m-p/1172097#M28653</guid>
      <dc:creator>TimP</dc:creator>
      <dc:date>2017-12-12T15:22:13Z</dc:date>
    </item>
    <item>
      <title>No, Tim. in ?axpy a is a</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/How-to-use-fused-multiply-add-with-MKL/m-p/1172098#M28654</link>
      <description>&lt;P&gt;No, Tim. in&amp;nbsp;&lt;SPAN style="font-size: 12px;"&gt;?axpy&amp;nbsp;a is a scalar, not a vector and there is no vector b.&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 12 Dec 2017 16:04:11 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/How-to-use-fused-multiply-add-with-MKL/m-p/1172098#M28654</guid>
      <dc:creator>Konovalov__Pavel</dc:creator>
      <dc:date>2017-12-12T16:04:11Z</dc:date>
    </item>
    <item>
      <title>The C and Fortran compilers</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/How-to-use-fused-multiply-add-with-MKL/m-p/1172099#M28655</link>
      <description>&lt;P&gt;The C and Fortran compilers will generate FMA instructions from ordinary source code loops for which the FMA operation is appropriate, provided that&lt;/P&gt;

&lt;UL&gt;
	&lt;LI&gt;the target instruction set includes FMA (AVX2 or newer -- note that the default is SSE, which does not support FMA), and&lt;/LI&gt;
	&lt;LI&gt;the optimization level is high enough (at least O1, but preferably O2), and&lt;/LI&gt;
	&lt;LI&gt;you have not prohibited FMA with a different compiler flag (-no-fma or some options to the -fp-model flag).&lt;/LI&gt;
&lt;/UL&gt;

&lt;P&gt;There are a few cases involving reduction operations where the compiler will choose not to use FMA operations because it estimates that there will be a shorter critical path by splitting the operation (doing the multiplication earlier and the addition later).&lt;/P&gt;</description>
      <pubDate>Tue, 12 Dec 2017 19:13:39 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/How-to-use-fused-multiply-add-with-MKL/m-p/1172099#M28655</guid>
      <dc:creator>McCalpinJohn</dc:creator>
      <dc:date>2017-12-12T19:13:39Z</dc:date>
    </item>
    <item>
      <title>Thank you, John!  Is there</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/How-to-use-fused-multiply-add-with-MKL/m-p/1172100#M28656</link>
      <description>Thank you, John!  Is there any indication from the compiler ouput that loop is FMAsed :) ? like with vectorisation report</description>
      <pubDate>Tue, 12 Dec 2017 22:51:13 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/How-to-use-fused-multiply-add-with-MKL/m-p/1172100#M28656</guid>
      <dc:creator>Konovalov__Pavel</dc:creator>
      <dc:date>2017-12-12T22:51:13Z</dc:date>
    </item>
  </channel>
</rss>

