<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Pablo, in Intel® Integrated Performance Primitives</title>
    <link>https://community.intel.com/t5/Intel-Integrated-Performance/ippiSqr-function-very-slow/m-p/994886#M22812</link>
    <description>Pablo, 
it might be because of ippiSqr_*_ is not threaded but ippiMul - is threaded.
please check the perf results between these functions when link with serial version of IPP.</description>
    <pubDate>Thu, 04 Oct 2012 05:20:55 GMT</pubDate>
    <dc:creator>Gennady_F_Intel</dc:creator>
    <dc:date>2012-10-04T05:20:55Z</dc:date>
    <item>
      <title>ippiSqr function very slow</title>
      <link>https://community.intel.com/t5/Intel-Integrated-Performance/ippiSqr-function-very-slow/m-p/994884#M22810</link>
      <description>&lt;P&gt;Hi,&lt;BR /&gt;&lt;BR /&gt;If I want to square an Ipp32f image I find that using ippiMul_32f_C1R is many times (~7x) faster than ippiSqr_32f_C1R.&lt;BR /&gt;&lt;BR /&gt;I am evaluating a trial version ippIP AVX (e9) version: 7.1.0 (r36264).&lt;BR /&gt;I use:&lt;BR /&gt;Intel(R) Xeon(R) CPU E31235 @ 3.20GHz&lt;BR /&gt;KMP_AFFINITY=verbose,granularity=core,compact,0,0&lt;BR /&gt;1 packages x 4 cores/pkg x 2 threads/core (4 total cores)&lt;BR /&gt;gcc version 4.6.3 (Ubuntu/Linaro 4.6.3-1ubuntu5)&lt;/P&gt;
&lt;P&gt;&lt;BR /&gt;I observe that ippiSqr uses more cores even with the affinity configuration above.&lt;BR /&gt;&lt;BR /&gt;Thanks,&lt;BR /&gt;Pablo&lt;/P&gt;</description>
      <pubDate>Thu, 27 Sep 2012 08:08:30 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-Integrated-Performance/ippiSqr-function-very-slow/m-p/994884#M22810</guid>
      <dc:creator>Pablo_N_</dc:creator>
      <dc:date>2012-09-27T08:08:30Z</dc:date>
    </item>
    <item>
      <title>Pablo,</title>
      <link>https://community.intel.com/t5/Intel-Integrated-Performance/ippiSqr-function-very-slow/m-p/994885#M22811</link>
      <description>Pablo, 

Do you have a code snippet that we could use to replicate this issue? That would help a lot. 

Thanks,
Chuck</description>
      <pubDate>Mon, 01 Oct 2012 18:50:50 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-Integrated-Performance/ippiSqr-function-very-slow/m-p/994885#M22811</guid>
      <dc:creator>Chuck_De_Sylva</dc:creator>
      <dc:date>2012-10-01T18:50:50Z</dc:date>
    </item>
    <item>
      <title>Pablo,</title>
      <link>https://community.intel.com/t5/Intel-Integrated-Performance/ippiSqr-function-very-slow/m-p/994886#M22812</link>
      <description>Pablo, 
it might be because of ippiSqr_*_ is not threaded but ippiMul - is threaded.
please check the perf results between these functions when link with serial version of IPP.</description>
      <pubDate>Thu, 04 Oct 2012 05:20:55 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-Integrated-Performance/ippiSqr-function-very-slow/m-p/994886#M22812</guid>
      <dc:creator>Gennady_F_Intel</dc:creator>
      <dc:date>2012-10-04T05:20:55Z</dc:date>
    </item>
  </channel>
</rss>

