hi, I am running vtune for linux on a system that has two hyperthreaded pentium xeons. This means that there are four virtual processors. I wanted to run a parallel app. and measure the coherence traffic. In specific I wanted to measure the following:
a) The percentage of L2 read misses that are satisfied by reading data from another processor's cache, rather than reading it from main memory. Basically the number of remote misses and the number of main memory misses.