<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Questions: in Intel® Moderncode for Parallel Architectures</title>
    <link>https://community.intel.com/t5/Intel-Moderncode-for-Parallel/Cache-performance-differences-between-nodes-running-same/m-p/1178993#M8102</link>
    <description>&lt;P&gt;Questions:&lt;/P&gt;

&lt;OL&gt;
	&lt;LI&gt;Are the "slow" nodes slow every time you run on them, or just slow for the duration of one job?&lt;/LI&gt;
	&lt;LI&gt;Do the nodes have Transparent Huge Pages enabled?&lt;/LI&gt;
&lt;/OL&gt;

&lt;P&gt;If the slow nodes are only slow for the duration of a run, then this looks like a standard cache conflict due to unlucky combinations of physical addresses.&lt;/P&gt;

&lt;UL&gt;
	&lt;LI&gt;Your data is consistent with cache conflicts in either the L2 or L3.&amp;nbsp;&lt;/LI&gt;
	&lt;LI&gt;Conflicts in the L2 cache are extremely common when using 4KiB pages.
		&lt;UL&gt;
			&lt;LI&gt;They arise because 3 bits of the L2 cache index are translated from contiguous virtual addresses to (pseudo-random) physical addresses.&amp;nbsp;&amp;nbsp; The cache can only hold its full capacity if the addresses being used are mapped to pages whose physical addresses have a completely uniform distribution of the values in these 3 bits.&lt;/LI&gt;
			&lt;LI&gt;No L2 cache index bits are translated when using 2MiB pages, so this type of cache conflict cannot happen (with contiguous addresses) when using 2MiB pages.&amp;nbsp;&lt;/LI&gt;
			&lt;LI&gt;Enabling Transparent Huge Pages is the easiest way to use 2MiB pages, but it is also possible to pre-allocate the 2MiB pages and use them for memory allocated by mmap(), shmget(), or by using libhugetlbfs.&lt;/LI&gt;
		&lt;/UL&gt;
	&lt;/LI&gt;
	&lt;LI&gt;Conflicts in the L3 cache are extremely rare in Haswell systems (due to the high associativity of the L3), but if an L3 conflict occurs, each cache line evicted from the L3 must be evicted from the L2 caches first.&amp;nbsp;&amp;nbsp;
		&lt;UL&gt;
			&lt;LI&gt;L3 cache indexing on Haswell is extremely complex, deliberately documented, and has only been "reverse-engineered" for a small number of configurations.&lt;/LI&gt;
		&lt;/UL&gt;
	&lt;/LI&gt;
	&lt;LI&gt;In either case, if slowdown is rare (but lasts the duration of the job), the easiest approach is to note that the job is running slow, kill it, and restart it.&lt;/LI&gt;
&lt;/UL&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Fri, 31 Aug 2018 19:21:02 GMT</pubDate>
    <dc:creator>McCalpinJohn</dc:creator>
    <dc:date>2018-08-31T19:21:02Z</dc:date>
    <item>
      <title>Cache performance differences between nodes running same application</title>
      <link>https://community.intel.com/t5/Intel-Moderncode-for-Parallel/Cache-performance-differences-between-nodes-running-same/m-p/1178990#M8099</link>
      <description>&lt;P&gt;&lt;SPAN style="font-size: 1em;"&gt;I am running a large application on an academic cluster (Haswell) and have observed performance differences between nodes running the exact same application on the exact same input data. I came to this point by comparing runs for a short simulated time vs a long simulated time and noticed that the time per iteration varied depending on the node (maybe that answers the question of why I am repeatedly running the same code with the same data.&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;

&lt;P&gt;Within the cluster now I have identified a set of 'slow' nodes and a smaller set of 'fast' nodes. I have been instrumenting a triply-nested set of lengthy floating point calculations with PAPI performance monitoring and have the following, slow node vs. fast node:&lt;/P&gt;

&lt;P&gt;&amp;nbsp;Total Instructions: &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;295627 vs. 295627&lt;BR /&gt;
	&amp;nbsp;Level 1 cache misses: &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 36258 vs. 36340&lt;BR /&gt;
	&amp;nbsp;Level 2 cache misses: &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 22726 vs. 12015&lt;BR /&gt;
	&amp;nbsp;Level 3 cache misses: &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 21771 vs. &amp;nbsp;1454&lt;/P&gt;

&lt;P&gt;The code being measured is:&lt;/P&gt;

&lt;P style="margin-bottom: 0px; font-stretch: normal; font-size: 10px; line-height: normal; font-family: Monaco; color: rgb(0, 0, 0); background-color: rgb(211, 211, 211);"&gt;&lt;SPAN style="font-variant-ligatures: no-common-ligatures"&gt;&amp;nbsp; &amp;nbsp;&amp;nbsp;do &amp;nbsp; &amp;nbsp; &amp;nbsp; k = lo(3), hi(3)&lt;/SPAN&gt;&lt;/P&gt;

&lt;P style="margin-bottom: 0px; font-stretch: normal; font-size: 10px; line-height: normal; font-family: Monaco; color: rgb(0, 0, 0); background-color: rgb(211, 211, 211);"&gt;&lt;SPAN style="font-variant-ligatures: no-common-ligatures"&gt;&amp;nbsp;&amp;nbsp; &amp;nbsp; &amp;nbsp; do&amp;nbsp; &amp;nbsp; j = lo(2), hi(2)&lt;/SPAN&gt;&lt;/P&gt;

&lt;P style="margin-bottom: 0px; font-stretch: normal; font-size: 10px; line-height: normal; font-family: Monaco; color: rgb(0, 0, 0); background-color: rgb(211, 211, 211);"&gt;&lt;SPAN style="font-variant-ligatures: no-common-ligatures"&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; do i = lo(1), hi(1)&lt;/SPAN&gt;&lt;/P&gt;

&lt;P style="margin-bottom: 0px; font-stretch: normal; font-size: 10px; line-height: normal; font-family: Monaco; color: rgb(0, 0, 0); background-color: rgb(211, 211, 211);"&gt;&lt;SPAN style="font-variant-ligatures: no-common-ligatures"&gt;&amp;nbsp;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; y(i,j,k) = alpha*a(i,j,k)*x(i,j,k) &amp;amp;&lt;/SPAN&gt;&lt;/P&gt;

&lt;P style="margin-bottom: 0px; font-stretch: normal; font-size: 10px; line-height: normal; font-family: Monaco; color: rgb(0, 0, 0); background-color: rgb(211, 211, 211);"&gt;&lt;SPAN style="font-variant-ligatures: no-common-ligatures"&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; - dhx * (bX(i+1,j,k)*(x(i+1,j,k) - x(i&amp;nbsp; ,j,k))&amp;nbsp; &amp;amp;&lt;/SPAN&gt;&lt;/P&gt;

&lt;P style="margin-bottom: 0px; font-stretch: normal; font-size: 10px; line-height: normal; font-family: Monaco; color: rgb(0, 0, 0); background-color: rgb(211, 211, 211);"&gt;&lt;SPAN style="font-variant-ligatures: no-common-ligatures"&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;amp;&amp;nbsp; &amp;nbsp; &amp;nbsp; - bX(i&amp;nbsp; ,j,k)*(x(i&amp;nbsp; ,j,k) - x(i-1,j,k))) &amp;amp;&lt;/SPAN&gt;&lt;/P&gt;

&lt;P style="margin-bottom: 0px; font-stretch: normal; font-size: 10px; line-height: normal; font-family: Monaco; color: rgb(0, 0, 0); background-color: rgb(211, 211, 211);"&gt;&lt;SPAN style="font-variant-ligatures: no-common-ligatures"&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; - dhy * (bY(i,j+1,k)*(x(i,j+1,k) - x(i,j&amp;nbsp; ,k))&amp;nbsp; &amp;amp;&lt;/SPAN&gt;&lt;/P&gt;

&lt;P style="margin-bottom: 0px; font-stretch: normal; font-size: 10px; line-height: normal; font-family: Monaco; color: rgb(0, 0, 0); background-color: rgb(211, 211, 211);"&gt;&lt;SPAN style="font-variant-ligatures: no-common-ligatures"&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;amp;&amp;nbsp; &amp;nbsp; &amp;nbsp; - bY(i,j&amp;nbsp; ,k)*(x(i,j&amp;nbsp; ,k) - x(i,j-1,k))) &amp;amp;&lt;/SPAN&gt;&lt;/P&gt;

&lt;P style="margin-bottom: 0px; font-stretch: normal; font-size: 10px; line-height: normal; font-family: Monaco; color: rgb(0, 0, 0); background-color: rgb(211, 211, 211);"&gt;&lt;SPAN style="font-variant-ligatures: no-common-ligatures"&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; - dhz * (bZ(i,j,k+1)*(x(i,j,k+1) - x(i,j,k&amp;nbsp; ))&amp;nbsp; &amp;amp;&lt;/SPAN&gt;&lt;/P&gt;

&lt;P style="margin-bottom: 0px; font-stretch: normal; font-size: 10px; line-height: normal; font-family: Monaco; color: rgb(0, 0, 0); background-color: rgb(211, 211, 211);"&gt;&lt;SPAN style="font-variant-ligatures: no-common-ligatures"&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;amp;&amp;nbsp; &amp;nbsp; &amp;nbsp; - bZ(i,j,k&amp;nbsp; )*(x(i,j,k&amp;nbsp; ) - x(i,j,k-1)))&lt;/SPAN&gt;&lt;/P&gt;

&lt;P style="margin-bottom: 0px; font-stretch: normal; font-size: 10px; line-height: normal; font-family: Monaco; color: rgb(0, 0, 0); background-color: rgb(211, 211, 211);"&gt;&lt;SPAN style="font-variant-ligatures: no-common-ligatures"&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; end do&lt;/SPAN&gt;&lt;/P&gt;

&lt;P style="margin-bottom: 0px; font-stretch: normal; font-size: 10px; line-height: normal; font-family: Monaco; color: rgb(0, 0, 0); background-color: rgb(211, 211, 211);"&gt;&lt;SPAN style="font-variant-ligatures: no-common-ligatures"&gt;&amp;nbsp;&amp;nbsp; &amp;nbsp; &amp;nbsp; end do&lt;/SPAN&gt;&lt;/P&gt;

&lt;P style="margin-bottom: 0px; font-stretch: normal; font-size: 10px; line-height: normal; font-family: Monaco; color: rgb(0, 0, 0); background-color: rgb(211, 211, 211);"&gt;&lt;SPAN style="font-variant-ligatures: no-common-ligatures"&gt;&amp;nbsp; &amp;nbsp; end do&lt;/SPAN&gt;&lt;/P&gt;

&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;

&lt;DIV&gt;&amp;nbsp;Is there a hardware explanation for this difference in cache performance?&lt;/DIV&gt;

&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;

&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 31 Aug 2018 17:36:29 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-Moderncode-for-Parallel/Cache-performance-differences-between-nodes-running-same/m-p/1178990#M8099</guid>
      <dc:creator>Page__Mike</dc:creator>
      <dc:date>2018-08-31T17:36:29Z</dc:date>
    </item>
    <item>
      <title>Additional measurement, slow</title>
      <link>https://community.intel.com/t5/Intel-Moderncode-for-Parallel/Cache-performance-differences-between-nodes-running-same/m-p/1178991#M8100</link>
      <description>&lt;P&gt;Additional measurement, slow vs. fast&lt;/P&gt;

&lt;P&gt;Inst TLB misses: &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;325 vs. 444&lt;BR /&gt;
	Data prefetch cache misses: &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;8111 vs. 8&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 31 Aug 2018 17:41:15 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-Moderncode-for-Parallel/Cache-performance-differences-between-nodes-running-same/m-p/1178991#M8100</guid>
      <dc:creator>Page__Mike</dc:creator>
      <dc:date>2018-08-31T17:41:15Z</dc:date>
    </item>
    <item>
      <title>Very important:</title>
      <link>https://community.intel.com/t5/Intel-Moderncode-for-Parallel/Cache-performance-differences-between-nodes-running-same/m-p/1178992#M8101</link>
      <description>&lt;P&gt;Very important:&lt;/P&gt;

&lt;P&gt;I am running the code in serial mode with exclusive use of the node.&lt;/P&gt;</description>
      <pubDate>Fri, 31 Aug 2018 17:55:01 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-Moderncode-for-Parallel/Cache-performance-differences-between-nodes-running-same/m-p/1178992#M8101</guid>
      <dc:creator>Page__Mike</dc:creator>
      <dc:date>2018-08-31T17:55:01Z</dc:date>
    </item>
    <item>
      <title>Questions:</title>
      <link>https://community.intel.com/t5/Intel-Moderncode-for-Parallel/Cache-performance-differences-between-nodes-running-same/m-p/1178993#M8102</link>
      <description>&lt;P&gt;Questions:&lt;/P&gt;

&lt;OL&gt;
	&lt;LI&gt;Are the "slow" nodes slow every time you run on them, or just slow for the duration of one job?&lt;/LI&gt;
	&lt;LI&gt;Do the nodes have Transparent Huge Pages enabled?&lt;/LI&gt;
&lt;/OL&gt;

&lt;P&gt;If the slow nodes are only slow for the duration of a run, then this looks like a standard cache conflict due to unlucky combinations of physical addresses.&lt;/P&gt;

&lt;UL&gt;
	&lt;LI&gt;Your data is consistent with cache conflicts in either the L2 or L3.&amp;nbsp;&lt;/LI&gt;
	&lt;LI&gt;Conflicts in the L2 cache are extremely common when using 4KiB pages.
		&lt;UL&gt;
			&lt;LI&gt;They arise because 3 bits of the L2 cache index are translated from contiguous virtual addresses to (pseudo-random) physical addresses.&amp;nbsp;&amp;nbsp; The cache can only hold its full capacity if the addresses being used are mapped to pages whose physical addresses have a completely uniform distribution of the values in these 3 bits.&lt;/LI&gt;
			&lt;LI&gt;No L2 cache index bits are translated when using 2MiB pages, so this type of cache conflict cannot happen (with contiguous addresses) when using 2MiB pages.&amp;nbsp;&lt;/LI&gt;
			&lt;LI&gt;Enabling Transparent Huge Pages is the easiest way to use 2MiB pages, but it is also possible to pre-allocate the 2MiB pages and use them for memory allocated by mmap(), shmget(), or by using libhugetlbfs.&lt;/LI&gt;
		&lt;/UL&gt;
	&lt;/LI&gt;
	&lt;LI&gt;Conflicts in the L3 cache are extremely rare in Haswell systems (due to the high associativity of the L3), but if an L3 conflict occurs, each cache line evicted from the L3 must be evicted from the L2 caches first.&amp;nbsp;&amp;nbsp;
		&lt;UL&gt;
			&lt;LI&gt;L3 cache indexing on Haswell is extremely complex, deliberately documented, and has only been "reverse-engineered" for a small number of configurations.&lt;/LI&gt;
		&lt;/UL&gt;
	&lt;/LI&gt;
	&lt;LI&gt;In either case, if slowdown is rare (but lasts the duration of the job), the easiest approach is to note that the job is running slow, kill it, and restart it.&lt;/LI&gt;
&lt;/UL&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 31 Aug 2018 19:21:02 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-Moderncode-for-Parallel/Cache-performance-differences-between-nodes-running-same/m-p/1178993#M8102</guid>
      <dc:creator>McCalpinJohn</dc:creator>
      <dc:date>2018-08-31T19:21:02Z</dc:date>
    </item>
    <item>
      <title>If virtualization is being</title>
      <link>https://community.intel.com/t5/Intel-Moderncode-for-Parallel/Cache-performance-differences-between-nodes-running-same/m-p/1178994#M8103</link>
      <description>&lt;P&gt;If virtualization is being use on the server your "node" might not have exclusive use of L3.&lt;/P&gt;

&lt;P&gt;Jim Dempsey&lt;/P&gt;</description>
      <pubDate>Sat, 01 Sep 2018 12:32:50 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-Moderncode-for-Parallel/Cache-performance-differences-between-nodes-running-same/m-p/1178994#M8103</guid>
      <dc:creator>jimdempseyatthecove</dc:creator>
      <dc:date>2018-09-01T12:32:50Z</dc:date>
    </item>
    <item>
      <title> </title>
      <link>https://community.intel.com/t5/Intel-Moderncode-for-Parallel/Cache-performance-differences-between-nodes-running-same/m-p/1178995#M8104</link>
      <description>&lt;P&gt;&amp;nbsp;&lt;/P&gt;

&lt;P&gt;A sysadmin is looking into BIOS settings on some slow nodes now. I have asked him to look at the Transparent Huge Pages' setting.&lt;/P&gt;

&lt;P&gt;We are not using visualization on these nodes.&lt;/P&gt;</description>
      <pubDate>Mon, 03 Sep 2018 13:11:00 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-Moderncode-for-Parallel/Cache-performance-differences-between-nodes-running-same/m-p/1178995#M8104</guid>
      <dc:creator>Page__Mike</dc:creator>
      <dc:date>2018-09-03T13:11:00Z</dc:date>
    </item>
    <item>
      <title>We have determined that BIOS</title>
      <link>https://community.intel.com/t5/Intel-Moderncode-for-Parallel/Cache-performance-differences-between-nodes-running-same/m-p/1178996#M8105</link>
      <description>&lt;P&gt;We have determined that BIOS settings varied across the cluster and was causing the performance differences.&amp;nbsp;&lt;/P&gt;

&lt;P&gt;A new set will be uniformly installed across the cluster as part of maintenance.&lt;/P&gt;

&lt;P&gt;Testing the new settings shows that performance is uniform and even improved.&lt;/P&gt;</description>
      <pubDate>Wed, 05 Sep 2018 17:00:34 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-Moderncode-for-Parallel/Cache-performance-differences-between-nodes-running-same/m-p/1178996#M8105</guid>
      <dc:creator>Page__Mike</dc:creator>
      <dc:date>2018-09-05T17:00:34Z</dc:date>
    </item>
    <item>
      <title>Been there, done that, got</title>
      <link>https://community.intel.com/t5/Intel-Moderncode-for-Parallel/Cache-performance-differences-between-nodes-running-same/m-p/1178997#M8106</link>
      <description>&lt;P&gt;Been there, done that, got the T-shirt....&lt;/P&gt;

&lt;P&gt;(I did not really get a T-shirt....)&lt;/P&gt;</description>
      <pubDate>Wed, 05 Sep 2018 19:42:03 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-Moderncode-for-Parallel/Cache-performance-differences-between-nodes-running-same/m-p/1178997#M8106</guid>
      <dc:creator>McCalpinJohn</dc:creator>
      <dc:date>2018-09-05T19:42:03Z</dc:date>
    </item>
    <item>
      <title>All the t-shirt needs is a</title>
      <link>https://community.intel.com/t5/Intel-Moderncode-for-Parallel/Cache-performance-differences-between-nodes-running-same/m-p/1178998#M8107</link>
      <description>&lt;P&gt;&lt;SPAN style="font-size: 1em;"&gt;All the t-shirt needs is a pithy slogan.&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;&lt;SPAN style="font-size: 1em;"&gt;How does on make BIOS pithy or interesting?&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 05 Sep 2018 19:56:41 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-Moderncode-for-Parallel/Cache-performance-differences-between-nodes-running-same/m-p/1178998#M8107</guid>
      <dc:creator>Page__Mike</dc:creator>
      <dc:date>2018-09-05T19:56:41Z</dc:date>
    </item>
  </channel>
</rss>

