<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic It is not generally helpful in Intel® oneAPI Math Kernel Library</title>
    <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Simple-vectcorization-question/m-p/1078175#M22665</link>
    <description>&lt;P&gt;It is not generally helpful to post the same question on multiple forums.&amp;nbsp; This question does not belong here in the Intel MKL forum, since it has nothing to do with MKL.&amp;nbsp; Appropriate forums might include:&lt;/P&gt;

&lt;UL&gt;
	&lt;LI&gt;&lt;A href="https://software.intel.com/en-us/forums/software-tuning-performance-optimization-platform-monitoring" target="_blank"&gt;https://software.intel.com/en-us/forums/software-tuning-performance-optimization-platform-monitoring&lt;/A&gt;&lt;/LI&gt;
	&lt;LI&gt;&lt;A href="https://software.intel.com/en-us/forums/intel-moderncode-for-parallel-architectures" target="_blank"&gt;https://software.intel.com/en-us/forums/intel-moderncode-for-parallel-architectures&lt;/A&gt;&lt;/LI&gt;
	&lt;LI&gt;&lt;A href="https://software.intel.com/en-us/forums/intel-many-integrated-core" target="_blank"&gt;https://software.intel.com/en-us/forums/intel-many-integrated-core&lt;/A&gt;&lt;/LI&gt;
	&lt;LI&gt;&lt;A href="https://software.intel.com/en-us/forums/intel-c-compiler" target="_blank"&gt;https://software.intel.com/en-us/forums/intel-c-compiler&lt;/A&gt;&lt;/LI&gt;
&lt;/UL&gt;

&lt;P&gt;The performance on this kernel is limited by a single core's memory bandwidth, as I explained in response to your post in the C Compiler Forum at &lt;A href="https://software.intel.com/en-us/forums/intel-c-compiler/topic/726771#comment-1902493" target="_blank"&gt;https://software.intel.com/en-us/forums/intel-c-compiler/topic/726771#comment-1902493&lt;/A&gt;&lt;/P&gt;</description>
    <pubDate>Wed, 12 Apr 2017 16:22:16 GMT</pubDate>
    <dc:creator>McCalpinJohn</dc:creator>
    <dc:date>2017-04-12T16:22:16Z</dc:date>
    <item>
      <title>Simple vectcorization question</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Simple-vectcorization-question/m-p/1078170#M22660</link>
      <description>&lt;P&gt;&amp;nbsp;&lt;/P&gt;

&lt;P&gt;Hi,&lt;/P&gt;

&lt;P&gt;I wrote a simple function&amp;nbsp;&lt;SPAN style="font-size: 13.008px;"&gt;and executed it on a KNL processor (68 cores, Flat Quadrature, using MCDRAM) using only &lt;STRONG&gt;one thread&lt;/STRONG&gt; and n=10,000,000. I execute this function 100 times and take the average, then calculate&amp;nbsp;the GFLOPS using the following formula gflops&amp;nbsp;= (1e-9 * 2.0 * n ) / execution time&lt;/SPAN&gt;&lt;/P&gt;

&lt;PRE class="brush:cpp;" style="font-size: 13.008px;"&gt;double multiplyAccum(long n,double *A, double *B)
{
    long i;
    double result = 0;
    #pragma novector
    //#pragma simd
    for ( i = 0; i &amp;lt; n; i++ )
    {
        result += A&lt;I&gt; * B&lt;I&gt;;
    }
    return result;
}&lt;/I&gt;&lt;/I&gt;&lt;/PRE&gt;

&lt;P&gt;1) When I use #pragma&amp;nbsp;novector,&amp;nbsp;I get 0.839571 GFLOPS/s&lt;/P&gt;

&lt;P&gt;This is the compiler report for the loop:&lt;/P&gt;

&lt;P&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; remark #15319: loop was not vectorized: novector directive used&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; remark #25439: unrolled with remainder by 8 &amp;nbsp;&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; remark #25456: Number of Array Refs Scalar Replaced In Loop: 1&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; remark #25457: Number of partial sums replaced: 1&lt;/P&gt;

&lt;P&gt;When I use #pragma simd, I get &amp;nbsp;1.495788 GFLOPS/s&lt;/P&gt;

&lt;P&gt;&lt;SPAN style="font-size: 13.008px;"&gt;This is the compiler report for the loop:&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; remark #15388: vectorization support: reference A_34279 has aligned access &amp;nbsp; [ multiplyAccum.cpp(64,3) ]&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; remark #15388: vectorization support: reference B_34279 has aligned access &amp;nbsp; [ multiplyAccum.cpp(64,3) ]&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; remark #15305: vectorization support: vector length 8&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; remark #15399: vectorization support: unroll factor set to 8&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; remark #15309: vectorization support: normalized vectorization overhead 0.446&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; remark #15301: SIMD LOOP WAS VECTORIZED&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; remark #15448: unmasked aligned unit stride loads: 2&amp;nbsp;&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; remark #15475: --- begin vector loop cost summary ---&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; remark #15476: scalar loop cost: 9&amp;nbsp;&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; remark #15477: vector loop cost: 0.870&amp;nbsp;&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; &lt;STRONG&gt;remark #15478: estimated potential speedup: 10.280&amp;nbsp;&lt;/STRONG&gt;&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; remark #15488: --- end vector loop cost summary ---&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; remark #25015: Estimate of max trip count of loop=156250&lt;/P&gt;

&lt;P&gt;The potential speedup is 10X, while I only get 1.8X, What is the explanation for this ?&amp;nbsp;&lt;/P&gt;

&lt;PRE class="brush:cpp;"&gt;
&amp;nbsp;&lt;/PRE&gt;

&lt;P&gt;Thanks,&lt;/P&gt;</description>
      <pubDate>Fri, 07 Apr 2017 23:18:28 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Simple-vectcorization-question/m-p/1078170#M22660</guid>
      <dc:creator>Mohammad_A_</dc:creator>
      <dc:date>2017-04-07T23:18:28Z</dc:date>
    </item>
    <item>
      <title>You may find a non-specific</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Simple-vectcorization-question/m-p/1078171#M22661</link>
      <description>&lt;P&gt;You may find a non-specific answer by reading about Amdahl's Law and its later modifications.&lt;/P&gt;</description>
      <pubDate>Sat, 08 Apr 2017 08:55:01 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Simple-vectcorization-question/m-p/1078171#M22661</guid>
      <dc:creator>mecej4</dc:creator>
      <dc:date>2017-04-08T08:55:01Z</dc:date>
    </item>
    <item>
      <title>Thank you for your advice.</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Simple-vectcorization-question/m-p/1078172#M22662</link>
      <description>&lt;P&gt;Thank you for your advice.&lt;/P&gt;

&lt;P&gt;&lt;SPAN style="font-size: 1em;"&gt;I only measure the execution time of the 'for loop' and nothing else.&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN style="font-size: 1em;"&gt;How is&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN style="font-size: 12px;"&gt;Amdahl's&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN style="font-size: 1em;"&gt;law related ?&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN style="font-size: 1em;"&gt;If I measure the execution time of the whole program I would say that the serial part, data movements. etc .. have the dominant impact on performance. But in my case, I am only interested in the vectorized part of my program.&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;&lt;SPAN style="font-size: 1em;"&gt;[Start timing]&lt;/SPAN&gt;&lt;/P&gt;

&lt;DIV class="line alt2" style="font-family: Consolas, &amp;quot;Bitstream Vera Sans Mono&amp;quot;, &amp;quot;Courier New&amp;quot;, Courier, monospace; font-size: 13.008px; color: rgb(96, 96, 96); margin: 0px !important; padding: 0px !important; border: 0px !important; outline: 0px !important; background: none rgb(248, 248, 248) !important; float: none !important; vertical-align: baseline !important; position: static !important; left: auto !important; top: auto !important; right: auto !important; bottom: auto !important; height: auto !important; width: auto !important; line-height: 1.1em !important; min-height: auto !important;"&gt;
	&lt;TABLE style="border-collapse: collapse !important; border: 0px !important; font-size: 1em !important; margin-top: 0px !important; margin-bottom: 0px !important; width: auto !important; vertical-align: baseline !important; outline: 0px !important; background: none !important; float: none !important; position: static !important; left: auto !important; top: auto !important; right: auto !important; bottom: auto !important; height: auto !important; line-height: 1.1em !important; min-height: auto !important;"&gt;
		&lt;TBODY style="border: 0px !important; outline: 0px !important; background: none !important; float: none !important; vertical-align: baseline !important; position: static !important; left: auto !important; top: auto !important; right: auto !important; bottom: auto !important; height: auto !important; width: auto !important; line-height: 1.1em !important; font-size: 1em !important; min-height: auto !important;"&gt;
			&lt;TR style="border: 0px !important; outline: 0px !important; background: none !important; float: none !important; vertical-align: baseline !important; position: static !important; left: auto !important; top: auto !important; right: auto !important; bottom: auto !important; height: auto !important; width: auto !important; line-height: 1.1em !important; font-size: 1em !important; min-height: auto !important;"&gt;
				&lt;TD class="content" style="border-width: 0px 0px 0px 3px !important; border-bottom-style: initial !important; padding: 0px 0px 0px 0.5em !important; border-top-style: initial !important; border-right-style: initial !important; border-left-style: solid !important; border-top-color: initial !important; border-right-color: initial !important; border-left-color: rgb(108, 226, 108) !important; border-image: initial !important; outline: 0px !important; background: none !important; float: none !important; position: static !important; left: auto !important; top: auto !important; right: auto !important; bottom: auto !important; height: auto !important; width: auto !important; line-height: 1.1em !important; font-size: 1em !important; min-height: auto !important; color: rgb(0, 0, 0) !important;"&gt;&lt;CODE class="comments" style="font-family: Consolas, &amp;quot;Bitstream Vera Sans Mono&amp;quot;, &amp;quot;Courier New&amp;quot;, Courier, monospace !important; margin-top: 0px !important; margin-bottom: 0px !important; padding: 0px !important; border: 0px !important; outline: 0px !important; background: none !important; float: none !important; vertical-align: baseline !important; position: static !important; left: auto !important; top: auto !important; right: auto !important; bottom: auto !important; height: auto !important; width: auto !important; line-height: 1.1em !important; min-height: auto !important; color: rgb(0, 130, 0) !important;"&gt;//#pragma simd&lt;/CODE&gt;&lt;/TD&gt;
			&lt;/TR&gt;
		&lt;/TBODY&gt;
	&lt;/TABLE&gt;
&lt;/DIV&gt;

&lt;DIV class="line alt1" style="font-family: Consolas, &amp;quot;Bitstream Vera Sans Mono&amp;quot;, &amp;quot;Courier New&amp;quot;, Courier, monospace; font-size: 13.008px; color: rgb(96, 96, 96); margin: 0px !important; padding: 0px !important; border: 0px !important; outline: 0px !important; background-image: none !important; background-position: initial !important; background-size: initial !important; background-repeat: initial !important; background-attachment: initial !important; background-origin: initial !important; background-clip: initial !important; float: none !important; vertical-align: baseline !important; position: static !important; left: auto !important; top: auto !important; right: auto !important; bottom: auto !important; height: auto !important; width: auto !important; line-height: 1.1em !important; min-height: auto !important;"&gt;
	&lt;TABLE style="border-collapse: collapse !important; border: 0px !important; font-size: 1em !important; margin-top: 0px !important; margin-bottom: 0px !important; width: auto !important; vertical-align: baseline !important; outline: 0px !important; background: none !important; float: none !important; position: static !important; left: auto !important; top: auto !important; right: auto !important; bottom: auto !important; height: auto !important; line-height: 1.1em !important; min-height: auto !important;"&gt;
		&lt;TBODY style="border: 0px !important; outline: 0px !important; background: none !important; float: none !important; vertical-align: baseline !important; position: static !important; left: auto !important; top: auto !important; right: auto !important; bottom: auto !important; height: auto !important; width: auto !important; line-height: 1.1em !important; font-size: 1em !important; min-height: auto !important;"&gt;
			&lt;TR style="border: 0px !important; outline: 0px !important; background: none !important; float: none !important; vertical-align: baseline !important; position: static !important; left: auto !important; top: auto !important; right: auto !important; bottom: auto !important; height: auto !important; width: auto !important; line-height: 1.1em !important; font-size: 1em !important; min-height: auto !important;"&gt;
				&lt;TD class="number" style="border-width: 0px !important; border-style: initial !important; border-top-color: initial !important; border-right-color: initial !important; border-left-color: initial !important; border-image: initial !important; padding: 0px !important; outline: 0px !important; background: none !important; float: none !important; position: static !important; left: auto !important; top: auto !important; right: auto !important; bottom: auto !important; height: auto !important; width: 3em !important; line-height: 1.1em !important; font-size: 1em !important; min-height: auto !important; color: rgb(175, 175, 175) !important;"&gt;&lt;CODE style="font-family: Consolas, &amp;quot;Bitstream Vera Sans Mono&amp;quot;, &amp;quot;Courier New&amp;quot;, Courier, monospace !important; margin: 0px !important; padding: 0px 0.3em 0px 0px !important; border: 0px !important; outline: 0px !important; background: none !important; text-align: right !important; float: none !important; vertical-align: baseline !important; position: static !important; left: auto !important; top: auto !important; right: auto !important; bottom: auto !important; height: auto !important; width: 2.7em !important; line-height: 1.1em !important; min-height: auto !important; display: block !important;"&gt;07&lt;/CODE&gt;&lt;/TD&gt;
				&lt;TD class="content" style="border-width: 0px 0px 0px 3px !important; border-bottom-style: initial !important; padding: 0px 0px 0px 0.5em !important; border-top-style: initial !important; border-right-style: initial !important; border-left-style: solid !important; border-top-color: initial !important; border-right-color: initial !important; border-left-color: rgb(108, 226, 108) !important; border-image: initial !important; outline: 0px !important; background: none !important; float: none !important; position: static !important; left: auto !important; top: auto !important; right: auto !important; bottom: auto !important; height: auto !important; width: auto !important; line-height: 1.1em !important; font-size: 1em !important; min-height: auto !important; color: rgb(0, 0, 0) !important;"&gt;&lt;CODE class="spaces" style="font-family: Consolas, &amp;quot;Bitstream Vera Sans Mono&amp;quot;, &amp;quot;Courier New&amp;quot;, Courier, monospace !important; margin: 0px !important; padding: 0px !important; border: 0px !important; outline: 0px !important; background: none !important; float: none !important; vertical-align: baseline !important; position: static !important; left: auto !important; top: auto !important; right: auto !important; bottom: auto !important; height: auto !important; width: auto !important; line-height: 1.1em !important; min-height: auto !important;"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&lt;/CODE&gt;&lt;CODE class="keyword bold" style="font-family: Consolas, &amp;quot;Bitstream Vera Sans Mono&amp;quot;, &amp;quot;Courier New&amp;quot;, Courier, monospace !important; margin: 0px !important; padding: 0px !important; border: 0px !important; outline: 0px !important; background: none !important; float: none !important; vertical-align: baseline !important; position: static !important; left: auto !important; top: auto !important; right: auto !important; bottom: auto !important; height: auto !important; width: auto !important; line-height: 1.1em !important; font-weight: bold !important; min-height: auto !important; color: rgb(0, 102, 153) !important;"&gt;for&lt;/CODE&gt;&amp;nbsp;&lt;CODE class="plain" style="font-family: Consolas, &amp;quot;Bitstream Vera Sans Mono&amp;quot;, &amp;quot;Courier New&amp;quot;, Courier, monospace !important; margin: 0px !important; padding: 0px !important; border: 0px !important; outline: 0px !important; background: none !important; float: none !important; vertical-align: baseline !important; position: static !important; left: auto !important; top: auto !important; right: auto !important; bottom: auto !important; height: auto !important; width: auto !important; line-height: 1.1em !important; min-height: auto !important;"&gt;( i = 0; i &amp;lt; n; i++ )&lt;/CODE&gt;&lt;/TD&gt;
			&lt;/TR&gt;
		&lt;/TBODY&gt;
	&lt;/TABLE&gt;
&lt;/DIV&gt;

&lt;DIV class="line alt2" style="font-family: Consolas, &amp;quot;Bitstream Vera Sans Mono&amp;quot;, &amp;quot;Courier New&amp;quot;, Courier, monospace; font-size: 13.008px; color: rgb(96, 96, 96); margin: 0px !important; padding: 0px !important; border: 0px !important; outline: 0px !important; background: none rgb(248, 248, 248) !important; float: none !important; vertical-align: baseline !important; position: static !important; left: auto !important; top: auto !important; right: auto !important; bottom: auto !important; height: auto !important; width: auto !important; line-height: 1.1em !important; min-height: auto !important;"&gt;
	&lt;TABLE style="border-collapse: collapse !important; border: 0px !important; font-size: 1em !important; margin-top: 0px !important; margin-bottom: 0px !important; width: auto !important; vertical-align: baseline !important; outline: 0px !important; background: none !important; float: none !important; position: static !important; left: auto !important; top: auto !important; right: auto !important; bottom: auto !important; height: auto !important; line-height: 1.1em !important; min-height: auto !important;"&gt;
		&lt;TBODY style="border: 0px !important; outline: 0px !important; background: none !important; float: none !important; vertical-align: baseline !important; position: static !important; left: auto !important; top: auto !important; right: auto !important; bottom: auto !important; height: auto !important; width: auto !important; line-height: 1.1em !important; font-size: 1em !important; min-height: auto !important;"&gt;
			&lt;TR style="border: 0px !important; outline: 0px !important; background: none !important; float: none !important; vertical-align: baseline !important; position: static !important; left: auto !important; top: auto !important; right: auto !important; bottom: auto !important; height: auto !important; width: auto !important; line-height: 1.1em !important; font-size: 1em !important; min-height: auto !important;"&gt;
				&lt;TD class="number" style="border-width: 0px !important; border-style: initial !important; border-top-color: initial !important; border-right-color: initial !important; border-left-color: initial !important; border-image: initial !important; padding: 0px !important; outline: 0px !important; background: none !important; float: none !important; position: static !important; left: auto !important; top: auto !important; right: auto !important; bottom: auto !important; height: auto !important; width: 3em !important; line-height: 1.1em !important; font-size: 1em !important; min-height: auto !important; color: rgb(175, 175, 175) !important;"&gt;&lt;CODE style="font-family: Consolas, &amp;quot;Bitstream Vera Sans Mono&amp;quot;, &amp;quot;Courier New&amp;quot;, Courier, monospace !important; margin: 0px !important; padding: 0px 0.3em 0px 0px !important; border: 0px !important; outline: 0px !important; background: none !important; text-align: right !important; float: none !important; vertical-align: baseline !important; position: static !important; left: auto !important; top: auto !important; right: auto !important; bottom: auto !important; height: auto !important; width: 2.7em !important; line-height: 1.1em !important; min-height: auto !important; display: block !important;"&gt;08&lt;/CODE&gt;&lt;/TD&gt;
				&lt;TD class="content" style="border-width: 0px 0px 0px 3px !important; border-bottom-style: initial !important; padding: 0px 0px 0px 0.5em !important; border-top-style: initial !important; border-right-style: initial !important; border-left-style: solid !important; border-top-color: initial !important; border-right-color: initial !important; border-left-color: rgb(108, 226, 108) !important; border-image: initial !important; outline: 0px !important; background: none !important; float: none !important; position: static !important; left: auto !important; top: auto !important; right: auto !important; bottom: auto !important; height: auto !important; width: auto !important; line-height: 1.1em !important; font-size: 1em !important; min-height: auto !important; color: rgb(0, 0, 0) !important;"&gt;&lt;CODE class="spaces" style="font-family: Consolas, &amp;quot;Bitstream Vera Sans Mono&amp;quot;, &amp;quot;Courier New&amp;quot;, Courier, monospace !important; margin: 0px !important; padding: 0px !important; border: 0px !important; outline: 0px !important; background: none !important; float: none !important; vertical-align: baseline !important; position: static !important; left: auto !important; top: auto !important; right: auto !important; bottom: auto !important; height: auto !important; width: auto !important; line-height: 1.1em !important; min-height: auto !important;"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&lt;/CODE&gt;&lt;CODE class="plain" style="font-family: Consolas, &amp;quot;Bitstream Vera Sans Mono&amp;quot;, &amp;quot;Courier New&amp;quot;, Courier, monospace !important; margin: 0px !important; padding: 0px !important; border: 0px !important; outline: 0px !important; background: none !important; float: none !important; vertical-align: baseline !important; position: static !important; left: auto !important; top: auto !important; right: auto !important; bottom: auto !important; height: auto !important; width: auto !important; line-height: 1.1em !important; min-height: auto !important;"&gt;{&lt;/CODE&gt;&lt;/TD&gt;
			&lt;/TR&gt;
		&lt;/TBODY&gt;
	&lt;/TABLE&gt;
&lt;/DIV&gt;

&lt;DIV class="line alt1" style="font-family: Consolas, &amp;quot;Bitstream Vera Sans Mono&amp;quot;, &amp;quot;Courier New&amp;quot;, Courier, monospace; font-size: 13.008px; color: rgb(96, 96, 96); margin: 0px !important; padding: 0px !important; border: 0px !important; outline: 0px !important; background-image: none !important; background-position: initial !important; background-size: initial !important; background-repeat: initial !important; background-attachment: initial !important; background-origin: initial !important; background-clip: initial !important; float: none !important; vertical-align: baseline !important; position: static !important; left: auto !important; top: auto !important; right: auto !important; bottom: auto !important; height: auto !important; width: auto !important; line-height: 1.1em !important; min-height: auto !important;"&gt;
	&lt;TABLE style="border-collapse: collapse !important; border: 0px !important; font-size: 1em !important; margin-top: 0px !important; margin-bottom: 0px !important; width: auto !important; vertical-align: baseline !important; outline: 0px !important; background: none !important; float: none !important; position: static !important; left: auto !important; top: auto !important; right: auto !important; bottom: auto !important; height: auto !important; line-height: 1.1em !important; min-height: auto !important;"&gt;
		&lt;TBODY style="border: 0px !important; outline: 0px !important; background: none !important; float: none !important; vertical-align: baseline !important; position: static !important; left: auto !important; top: auto !important; right: auto !important; bottom: auto !important; height: auto !important; width: auto !important; line-height: 1.1em !important; font-size: 1em !important; min-height: auto !important;"&gt;
			&lt;TR style="border: 0px !important; outline: 0px !important; background: none !important; float: none !important; vertical-align: baseline !important; position: static !important; left: auto !important; top: auto !important; right: auto !important; bottom: auto !important; height: auto !important; width: auto !important; line-height: 1.1em !important; font-size: 1em !important; min-height: auto !important;"&gt;
				&lt;TD class="number" style="border-width: 0px !important; border-style: initial !important; border-top-color: initial !important; border-right-color: initial !important; border-left-color: initial !important; border-image: initial !important; padding: 0px !important; outline: 0px !important; background: none !important; float: none !important; position: static !important; left: auto !important; top: auto !important; right: auto !important; bottom: auto !important; height: auto !important; width: 3em !important; line-height: 1.1em !important; font-size: 1em !important; min-height: auto !important; color: rgb(175, 175, 175) !important;"&gt;&lt;CODE style="font-family: Consolas, &amp;quot;Bitstream Vera Sans Mono&amp;quot;, &amp;quot;Courier New&amp;quot;, Courier, monospace !important; margin: 0px !important; padding: 0px 0.3em 0px 0px !important; border: 0px !important; outline: 0px !important; background: none !important; text-align: right !important; float: none !important; vertical-align: baseline !important; position: static !important; left: auto !important; top: auto !important; right: auto !important; bottom: auto !important; height: auto !important; width: 2.7em !important; line-height: 1.1em !important; min-height: auto !important; display: block !important;"&gt;09&lt;/CODE&gt;&lt;/TD&gt;
				&lt;TD class="content" style="border-width: 0px 0px 0px 3px !important; border-bottom-style: initial !important; padding: 0px 0px 0px 0.5em !important; border-top-style: initial !important; border-right-style: initial !important; border-left-style: solid !important; border-top-color: initial !important; border-right-color: initial !important; border-left-color: rgb(108, 226, 108) !important; border-image: initial !important; outline: 0px !important; background: none !important; float: none !important; position: static !important; left: auto !important; top: auto !important; right: auto !important; bottom: auto !important; height: auto !important; width: auto !important; line-height: 1.1em !important; font-size: 1em !important; min-height: auto !important; color: rgb(0, 0, 0) !important;"&gt;&lt;CODE class="spaces" style="font-family: Consolas, &amp;quot;Bitstream Vera Sans Mono&amp;quot;, &amp;quot;Courier New&amp;quot;, Courier, monospace !important; margin: 0px !important; padding: 0px !important; border: 0px !important; outline: 0px !important; background: none !important; float: none !important; vertical-align: baseline !important; position: static !important; left: auto !important; top: auto !important; right: auto !important; bottom: auto !important; height: auto !important; width: auto !important; line-height: 1.1em !important; min-height: auto !important;"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&lt;/CODE&gt;&lt;CODE class="plain" style="font-family: Consolas, &amp;quot;Bitstream Vera Sans Mono&amp;quot;, &amp;quot;Courier New&amp;quot;, Courier, monospace !important; margin: 0px !important; padding: 0px !important; border: 0px !important; outline: 0px !important; background: none !important; float: none !important; vertical-align: baseline !important; position: static !important; left: auto !important; top: auto !important; right: auto !important; bottom: auto !important; height: auto !important; width: auto !important; line-height: 1.1em !important; min-height: auto !important;"&gt;result += A&lt;I&gt; * B&lt;I&gt;;&lt;/I&gt;&lt;/I&gt;&lt;/CODE&gt;&lt;I&gt;&lt;I&gt;&lt;/I&gt;&lt;/I&gt;&lt;/TD&gt;
			&lt;/TR&gt;
		&lt;/TBODY&gt;
	&lt;/TABLE&gt;
&lt;/DIV&gt;

&lt;DIV class="line alt2" style="font-family: Consolas, &amp;quot;Bitstream Vera Sans Mono&amp;quot;, &amp;quot;Courier New&amp;quot;, Courier, monospace; font-size: 13.008px; color: rgb(96, 96, 96); margin: 0px !important; padding: 0px !important; border: 0px !important; outline: 0px !important; background: none rgb(248, 248, 248) !important; float: none !important; vertical-align: baseline !important; position: static !important; left: auto !important; top: auto !important; right: auto !important; bottom: auto !important; height: auto !important; width: auto !important; line-height: 1.1em !important; min-height: auto !important;"&gt;
	&lt;P&gt;&amp;nbsp;&lt;/P&gt;

	&lt;P&gt;[End Timing]&lt;/P&gt;

	&lt;P&gt;Thanks,&lt;/P&gt;

	&lt;P&gt;&amp;nbsp;&lt;/P&gt;

	&lt;TABLE style="border-collapse: collapse !important; border: 0px !important; font-size: 1em !important; margin-top: 0px !important; margin-bottom: 0px !important; width: auto !important; vertical-align: baseline !important; outline: 0px !important; background: none !important; float: none !important; position: static !important; left: auto !important; top: auto !important; right: auto !important; bottom: auto !important; height: auto !important; line-height: 1.1em !important; min-height: auto !important;"&gt;
		&lt;TBODY style="border: 0px !important; outline: 0px !important; background: none !important; float: none !important; vertical-align: baseline !important; position: static !important; left: auto !important; top: auto !important; right: auto !important; bottom: auto !important; height: auto !important; width: auto !important; line-height: 1.1em !important; font-size: 1em !important; min-height: auto !important;"&gt;
			&lt;TR style="border: 0px !important; outline: 0px !important; background: none !important; float: none !important; vertical-align: baseline !important; position: static !important; left: auto !important; top: auto !important; right: auto !important; bottom: auto !important; height: auto !important; width: auto !important; line-height: 1.1em !important; font-size: 1em !important; min-height: auto !important;"&gt;
				&lt;TD class="number" style="border-width: 0px !important; border-style: initial !important; border-top-color: initial !important; border-right-color: initial !important; border-left-color: initial !important; border-image: initial !important; padding: 0px !important; outline: 0px !important; background: none !important; float: none !important; position: static !important; left: auto !important; top: auto !important; right: auto !important; bottom: auto !important; height: auto !important; width: 3em !important; line-height: 1.1em !important; font-size: 1em !important; min-height: auto !important; color: rgb(175, 175, 175) !important;"&gt;&lt;CODE style="font-family: Consolas, &amp;quot;Bitstream Vera Sans Mono&amp;quot;, &amp;quot;Courier New&amp;quot;, Courier, monospace !important; margin: 0px !important; padding: 0px 0.3em 0px 0px !important; border: 0px !important; outline: 0px !important; background: none !important; text-align: right !important; float: none !important; vertical-align: baseline !important; position: static !important; left: auto !important; top: auto !important; right: auto !important; bottom: auto !important; height: auto !important; width: 2.7em !important; line-height: 1.1em !important; min-height: auto !important; display: block !important;"&gt;10&lt;/CODE&gt;&lt;/TD&gt;
				&lt;TD class="content" style="border-width: 0px 0px 0px 3px !important; border-bottom-style: initial !important; padding: 0px 0px 0px 0.5em !important; border-top-style: initial !important; border-right-style: initial !important; border-left-style: solid !important; border-top-color: initial !important; border-right-color: initial !important; border-left-color: rgb(108, 226, 108) !important; border-image: initial !important; outline: 0px !important; background: none !important; float: none !important; position: static !important; left: auto !important; top: auto !important; right: auto !important; bottom: auto !important; height: auto !important; width: auto !important; line-height: 1.1em !important; font-size: 1em !important; min-height: auto !important; color: rgb(0, 0, 0) !important;"&gt;&lt;CODE class="spaces" style="font-family: Consolas, &amp;quot;Bitstream Vera Sans Mono&amp;quot;, &amp;quot;Courier New&amp;quot;, Courier, monospace !important; margin: 0px !important; padding: 0px !important; border: 0px !important; outline: 0px !important; background: none !important; float: none !important; vertical-align: baseline !important; position: static !important; left: auto !important; top: auto !important; right: auto !important; bottom: auto !important; height: auto !important; width: auto !important; line-height: 1.1em !important; min-height: auto !important;"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&lt;/CODE&gt;&lt;CODE class="plain" style="font-family: Consolas, &amp;quot;Bitstream Vera Sans Mono&amp;quot;, &amp;quot;Courier New&amp;quot;, Courier, monospace !important; margin: 0px !important; padding: 0px !important; border: 0px !important; outline: 0px !important; background: none !important; float: none !important; vertical-align: baseline !important; position: static !important; left: auto !important; top: auto !important; right: auto !important; bottom: auto !important; height: auto !important; width: auto !important; line-height: 1.1em !important; min-height: auto !important;"&gt;}&lt;/CODE&gt;&lt;/TD&gt;
			&lt;/TR&gt;
		&lt;/TBODY&gt;
	&lt;/TABLE&gt;
&lt;/DIV&gt;</description>
      <pubDate>Sun, 09 Apr 2017 02:11:20 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Simple-vectcorization-question/m-p/1078172#M22662</guid>
      <dc:creator>Mohammad_A_</dc:creator>
      <dc:date>2017-04-09T02:11:20Z</dc:date>
    </item>
    <item>
      <title>#pragma simd requires the</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Simple-vectcorization-question/m-p/1078173#M22663</link>
      <description>&lt;P&gt;#pragma simd requires the reduction clause to be explicit. &amp;nbsp;Where is your mkl question?&lt;/P&gt;</description>
      <pubDate>Sun, 09 Apr 2017 12:12:15 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Simple-vectcorization-question/m-p/1078173#M22663</guid>
      <dc:creator>TimP</dc:creator>
      <dc:date>2017-04-09T12:12:15Z</dc:date>
    </item>
    <item>
      <title>Hi Mohammad ,</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Simple-vectcorization-question/m-p/1078174#M22664</link>
      <description>&lt;P&gt;Hi &lt;A href="https://software.intel.com/en-us/user/1404856"&gt;&lt;U&gt;&lt;FONT color="#0066cc"&gt;Mohammad &lt;/FONT&gt;&lt;/U&gt;&lt;/A&gt;,&lt;/P&gt;

&lt;P&gt;If you'd like to try MKL function,&amp;nbsp; you may try&amp;nbsp; replace the&amp;nbsp;function &amp;nbsp;&lt;FONT face="Courier New"&gt;multiplyAccum()&lt;/FONT&gt;&lt;/P&gt;

&lt;P&gt;with &lt;SPAN class="fontstyle0"&gt;&lt;FONT size="2"&gt;double cblas_ddot &lt;/FONT&gt;&lt;/SPAN&gt;&lt;SPAN class="fontstyle0" style="font-size: 9pt;"&gt;( )&amp;nbsp; &lt;/SPAN&gt;&lt;SPAN class="fontstyle0"&gt;&lt;FONT size="2"&gt;Computes a vector-vector dot product and compile it with &lt;/FONT&gt;&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;&lt;SPAN class="fontstyle0"&gt;&lt;FONT size="2"&gt;icc yourmainc.cpp&amp;nbsp; -mkl&amp;nbsp;&amp;nbsp;&amp;nbsp; and let us know if any result. &lt;/FONT&gt;&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;FYI: &amp;nbsp;mkl user guide&amp;nbsp; &lt;A href="https://software.intel.com/en-us/node/528582"&gt;&lt;U&gt;&lt;FONT color="#0066cc"&gt;&lt;/FONT&gt;&lt;/U&gt;&lt;/A&gt;&lt;A href="https://software.intel.com/en-us/node/528582" target="_blank"&gt;https://software.intel.com/en-us/node/528582&lt;/A&gt;&amp;nbsp; about the memory alignment etc to improve the performance.&lt;/P&gt;

&lt;P&gt;and the mkl developer manual: &lt;A href="https://software.intel.com/en-us/mkl-developer-reference-c"&gt;https://software.intel.com/en-us/mkl-developer-reference-c&lt;/A&gt;&lt;BR style="text-transform: none; line-height: normal; text-indent: 0px; letter-spacing: normal; font-style: normal; font-variant: normal; font-weight: normal; word-spacing: 0px; white-space: normal; orphans: 2; widows: 2; -webkit-text-size-adjust: auto; -webkit-text-stroke-width: 0px;" /&gt;
	&lt;SPAN class="fontstyle0"&gt;&lt;FONT size="2"&gt;double cblas_ddot &lt;/FONT&gt;&lt;/SPAN&gt;&lt;SPAN class="fontstyle0" style="font-size: 9pt;"&gt;(&lt;/SPAN&gt;&lt;FONT size="2"&gt;&lt;SPAN class="fontstyle0"&gt;const MKL_INT &lt;/SPAN&gt;&lt;SPAN class="fontstyle2"&gt;&lt;EM&gt;n&lt;/EM&gt;&lt;/SPAN&gt;&lt;SPAN class="fontstyle0"&gt;, const double &lt;/SPAN&gt;&lt;SPAN class="fontstyle2"&gt;&lt;EM&gt;*x&lt;/EM&gt;&lt;/SPAN&gt;&lt;SPAN class="fontstyle0"&gt;, const MKL_INT &lt;/SPAN&gt;&lt;SPAN class="fontstyle2"&gt;&lt;EM&gt;incx&lt;/EM&gt;&lt;/SPAN&gt;&lt;/FONT&gt;&lt;FONT size="2"&gt;&lt;SPAN class="fontstyle0"&gt;, const double&lt;/SPAN&gt;&lt;BR /&gt;
	&lt;SPAN class="fontstyle2"&gt;&lt;EM&gt;*y&lt;/EM&gt;&lt;/SPAN&gt;&lt;SPAN class="fontstyle0"&gt;, const MKL_INT &lt;/SPAN&gt;&lt;SPAN class="fontstyle2"&gt;&lt;EM&gt;incy&lt;/EM&gt;&lt;/SPAN&gt;&lt;/FONT&gt;&lt;SPAN class="fontstyle0" style="font-size: 9pt;"&gt;);&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;&lt;FONT size="2"&gt;Best Regards,&lt;/FONT&gt;&lt;/P&gt;

&lt;P&gt;&lt;FONT size="2"&gt;Ying &lt;/FONT&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 10 Apr 2017 01:31:05 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Simple-vectcorization-question/m-p/1078174#M22664</guid>
      <dc:creator>Ying_H_Intel</dc:creator>
      <dc:date>2017-04-10T01:31:05Z</dc:date>
    </item>
    <item>
      <title>It is not generally helpful</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Simple-vectcorization-question/m-p/1078175#M22665</link>
      <description>&lt;P&gt;It is not generally helpful to post the same question on multiple forums.&amp;nbsp; This question does not belong here in the Intel MKL forum, since it has nothing to do with MKL.&amp;nbsp; Appropriate forums might include:&lt;/P&gt;

&lt;UL&gt;
	&lt;LI&gt;&lt;A href="https://software.intel.com/en-us/forums/software-tuning-performance-optimization-platform-monitoring" target="_blank"&gt;https://software.intel.com/en-us/forums/software-tuning-performance-optimization-platform-monitoring&lt;/A&gt;&lt;/LI&gt;
	&lt;LI&gt;&lt;A href="https://software.intel.com/en-us/forums/intel-moderncode-for-parallel-architectures" target="_blank"&gt;https://software.intel.com/en-us/forums/intel-moderncode-for-parallel-architectures&lt;/A&gt;&lt;/LI&gt;
	&lt;LI&gt;&lt;A href="https://software.intel.com/en-us/forums/intel-many-integrated-core" target="_blank"&gt;https://software.intel.com/en-us/forums/intel-many-integrated-core&lt;/A&gt;&lt;/LI&gt;
	&lt;LI&gt;&lt;A href="https://software.intel.com/en-us/forums/intel-c-compiler" target="_blank"&gt;https://software.intel.com/en-us/forums/intel-c-compiler&lt;/A&gt;&lt;/LI&gt;
&lt;/UL&gt;

&lt;P&gt;The performance on this kernel is limited by a single core's memory bandwidth, as I explained in response to your post in the C Compiler Forum at &lt;A href="https://software.intel.com/en-us/forums/intel-c-compiler/topic/726771#comment-1902493" target="_blank"&gt;https://software.intel.com/en-us/forums/intel-c-compiler/topic/726771#comment-1902493&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 12 Apr 2017 16:22:16 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Simple-vectcorization-question/m-p/1078175#M22665</guid>
      <dc:creator>McCalpinJohn</dc:creator>
      <dc:date>2017-04-12T16:22:16Z</dc:date>
    </item>
  </channel>
</rss>

