<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Performance is worse with hyperthreading on dual xeon w5580 in Intel® Moderncode for Parallel Architectures</title>
    <link>https://community.intel.com/t5/Intel-Moderncode-for-Parallel/Performance-is-worse-with-hyperthreading-on-dual-xeon-w5580/m-p/892045#M3818</link>
    <description>Our tests show that the live HD capture performance is much worse with hyperthreading than without it on the new HP Z800 system. &lt;BR /&gt;&lt;BR /&gt;The new Nahalem processor has small individual L2 cache (512K per core) with an relatively large L3 cache shared by the four cores. Our threads all execute data-hungary tasks. Do you think the small L2 cache may cause too many misses with hyperthreading?</description>
    <pubDate>Tue, 18 Aug 2009 17:58:34 GMT</pubDate>
    <dc:creator>shyaki</dc:creator>
    <dc:date>2009-08-18T17:58:34Z</dc:date>
    <item>
      <title>Performance is worse with hyperthreading on dual xeon w5580</title>
      <link>https://community.intel.com/t5/Intel-Moderncode-for-Parallel/Performance-is-worse-with-hyperthreading-on-dual-xeon-w5580/m-p/892045#M3818</link>
      <description>Our tests show that the live HD capture performance is much worse with hyperthreading than without it on the new HP Z800 system. &lt;BR /&gt;&lt;BR /&gt;The new Nahalem processor has small individual L2 cache (512K per core) with an relatively large L3 cache shared by the four cores. Our threads all execute data-hungary tasks. Do you think the small L2 cache may cause too many misses with hyperthreading?</description>
      <pubDate>Tue, 18 Aug 2009 17:58:34 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-Moderncode-for-Parallel/Performance-is-worse-with-hyperthreading-on-dual-xeon-w5580/m-p/892045#M3818</guid>
      <dc:creator>shyaki</dc:creator>
      <dc:date>2009-08-18T17:58:34Z</dc:date>
    </item>
    <item>
      <title>Re: Performance is worse with hyperthreading on dual xeon w5580</title>
      <link>https://community.intel.com/t5/Intel-Moderncode-for-Parallel/Performance-is-worse-with-hyperthreading-on-dual-xeon-w5580/m-p/892046#M3819</link>
      <description>&lt;DIV style="margin:0px;"&gt;
&lt;DIV id="quote_reply" style="width: 100%; margin-top: 5px;"&gt;
&lt;DIV style="margin-left:2px;margin-right:2px;"&gt;Quoting - &lt;A href="https://community.intel.com/en-us/profile/262284"&gt;shyaki&lt;/A&gt;&lt;/DIV&gt;
&lt;DIV style="background-color:#E5E5E5; padding:5px;border: 1px; border-style: inset;margin-left:2px;margin-right:2px;"&gt;&lt;EM&gt;Our tests show that the live HD capture performance is much worse with hyperthreading than without it on the new HP Z800 system. &lt;BR /&gt;&lt;BR /&gt;The new Nahalem processor has small individual L2 cache (512K per core) with an relatively large L3 cache shared by the four cores. Our threads all execute data-hungary tasks. Do you think the small L2 cache may cause too many misses with hyperthreading?&lt;/EM&gt;&lt;/DIV&gt;
&lt;/DIV&gt;
&lt;/DIV&gt;
It's certainly possible, if a thread of your application requires more than half the L1 or L2 cache, or 8 threads together require more than than the entire L3, that cache capacity problems would produce the effect you reported.&lt;BR /&gt;If there is an advantage to specifying which threads share cores and associated caches, you would need that specification (KMP_AFFINITY for Intel OpenMP).&lt;BR /&gt;Data intensive applications with good memory locality are likely not to benefit from HT, as you could use up the entire memory bandwidth with 1 thread per core.&lt;BR /&gt;</description>
      <pubDate>Tue, 18 Aug 2009 18:13:55 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-Moderncode-for-Parallel/Performance-is-worse-with-hyperthreading-on-dual-xeon-w5580/m-p/892046#M3819</guid>
      <dc:creator>TimP</dc:creator>
      <dc:date>2009-08-18T18:13:55Z</dc:date>
    </item>
    <item>
      <title>Re: Performance is worse with hyperthreading on dual xeon w5580</title>
      <link>https://community.intel.com/t5/Intel-Moderncode-for-Parallel/Performance-is-worse-with-hyperthreading-on-dual-xeon-w5580/m-p/892047#M3820</link>
      <description>&lt;DIV style="margin:0px;"&gt;&lt;/DIV&gt;
&lt;BR /&gt;If you use vtune you should be able to see the cache behavior reported. If you are not a vtune user it is a good tool and makes use of the on chip hardware profiling to measure details of what is happening on the processor. You could compare the reports single and multi-threaded and see what is going on.</description>
      <pubDate>Fri, 21 Aug 2009 00:27:39 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-Moderncode-for-Parallel/Performance-is-worse-with-hyperthreading-on-dual-xeon-w5580/m-p/892047#M3820</guid>
      <dc:creator>Tom_Spyrou</dc:creator>
      <dc:date>2009-08-21T00:27:39Z</dc:date>
    </item>
    <item>
      <title>Re: Performance is worse with hyperthreading on dual xeon w5580</title>
      <link>https://community.intel.com/t5/Intel-Moderncode-for-Parallel/Performance-is-worse-with-hyperthreading-on-dual-xeon-w5580/m-p/892048#M3821</link>
      <description>&lt;DIV style="margin:0px;"&gt;
&lt;DIV id="quote_reply" style="width: 100%; margin-top: 5px;"&gt;
&lt;DIV style="margin-left:2px;margin-right:2px;"&gt;Quoting - &lt;A href="https://community.intel.com/en-us/profile/262284"&gt;shyaki&lt;/A&gt;&lt;/DIV&gt;
&lt;DIV style="background-color:#E5E5E5; padding:5px;border: 1px; border-style: inset;margin-left:2px;margin-right:2px;"&gt;&lt;EM&gt;Our tests show that the live HD capture performance is much worse with hyperthreading than without it on the new HP Z800 system. &lt;BR /&gt;&lt;BR /&gt;The new Nahalem processor has small individual L2 cache (512K per core) with an relatively large L3 cache shared by the four cores. Our threads all execute data-hungary tasks. Do you think the small L2 cache may cause too many misses with hyperthreading?&lt;/EM&gt;&lt;/DIV&gt;
&lt;/DIV&gt;
&lt;/DIV&gt;
&lt;BR /&gt;
&lt;DIV&gt;I agree with the previous posts. You might find this guide usefulhttp://software.intel.com/en-us/articles/using-intel-vtune-performance-analyzer-to-optimize-software-on-intel-core-i7-processors/ to find out if cache misses harm your performance.&lt;/DIV&gt;</description>
      <pubDate>Fri, 21 Aug 2009 21:51:51 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-Moderncode-for-Parallel/Performance-is-worse-with-hyperthreading-on-dual-xeon-w5580/m-p/892048#M3821</guid>
      <dc:creator>Roman_D_Intel</dc:creator>
      <dc:date>2009-08-21T21:51:51Z</dc:date>
    </item>
  </channel>
</rss>

