- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Our tests show that the live HD capture performance is much worse with hyperthreading than without it on the new HP Z800 system.
The new Nahalem processor has small individual L2 cache (512K per core) with an relatively large L3 cache shared by the four cores. Our threads all execute data-hungary tasks. Do you think the small L2 cache may cause too many misses with hyperthreading?
The new Nahalem processor has small individual L2 cache (512K per core) with an relatively large L3 cache shared by the four cores. Our threads all execute data-hungary tasks. Do you think the small L2 cache may cause too many misses with hyperthreading?
Link Copied
3 Replies
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Quoting - shyaki
Our tests show that the live HD capture performance is much worse with hyperthreading than without it on the new HP Z800 system.
The new Nahalem processor has small individual L2 cache (512K per core) with an relatively large L3 cache shared by the four cores. Our threads all execute data-hungary tasks. Do you think the small L2 cache may cause too many misses with hyperthreading?
The new Nahalem processor has small individual L2 cache (512K per core) with an relatively large L3 cache shared by the four cores. Our threads all execute data-hungary tasks. Do you think the small L2 cache may cause too many misses with hyperthreading?
If there is an advantage to specifying which threads share cores and associated caches, you would need that specification (KMP_AFFINITY for Intel OpenMP).
Data intensive applications with good memory locality are likely not to benefit from HT, as you could use up the entire memory bandwidth with 1 thread per core.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
If you use vtune you should be able to see the cache behavior reported. If you are not a vtune user it is a good tool and makes use of the on chip hardware profiling to measure details of what is happening on the processor. You could compare the reports single and multi-threaded and see what is going on.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Quoting - shyaki
Our tests show that the live HD capture performance is much worse with hyperthreading than without it on the new HP Z800 system.
The new Nahalem processor has small individual L2 cache (512K per core) with an relatively large L3 cache shared by the four cores. Our threads all execute data-hungary tasks. Do you think the small L2 cache may cause too many misses with hyperthreading?
The new Nahalem processor has small individual L2 cache (512K per core) with an relatively large L3 cache shared by the four cores. Our threads all execute data-hungary tasks. Do you think the small L2 cache may cause too many misses with hyperthreading?
I agree with the previous posts. You might find this guide usefulhttp://software.intel.com/en-us/articles/using-intel-vtune-performance-analyzer-to-optimize-software-on-intel-core-i7-processors/ to find out if cache misses harm your performance.
Reply
Topic Options
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page