- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
hi,
I am running vtune for linux on a system that has two hyperthreaded pentium xeons. This means that there are four virtual processors. I wanted to run a parallel app. and measure the coherence traffic. In specific I wanted to measure the following:
a) The percentage of L2 read misses that are satisfied by reading data from another processor's cache, rather than reading it from main memory. Basically the number of remote misses and the number of main memory misses.
Any help is appreciated.
thanks
smruti
I am running vtune for linux on a system that has two hyperthreaded pentium xeons. This means that there are four virtual processors. I wanted to run a parallel app. and measure the coherence traffic. In specific I wanted to measure the following:
a) The percentage of L2 read misses that are satisfied by reading data from another processor's cache, rather than reading it from main memory. Basically the number of remote misses and the number of main memory misses.
Any help is appreciated.
thanks
smruti
Link Copied
1 Reply
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Smruti,
Doesn't the thread view show you what you need per logical CPU?
Sorry if I'm misunderstanding the question, but what you're describing is exactly what you'll see if you just look for L2 cache misses, and head to the thread view.
I think!
Try and report back, ok?
cheers
jdg

Reply
Topic Options
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page