Analyzers
Talk to fellow users of Intel Analyzer tools (Intel VTune™ Profiler, Intel Advisor)

Branching, instruction-miss and prefetcher event counters

T_C
Beginner
490 Views

Hi, would somebody be able to tell me which event counters I should add to a custom analysis for the following:

1) How successful the prefetcher is

2) L1 Instruction cache misses

3) Pipeline stalls regardless of cause

4) Direct Branch mispredictions

5) Indirect Branch mispredictions

Also, what is the difference between event names with and without the _PS, like:

BR_MISP_RETIRED.ALL_BRANCHES

BR_MISP_RETIRED.ALL_BRANCHES_PS

Thank you in advance,

 

0 Kudos
3 Replies
David_A_Intel1
Employee
490 Views

Hi T C:

"PS" means precise.  It has to do with when the instruction pointer is captured after the event counter triggers the interrupt.

We can't just "tell you" which events to use.  First, it depends entirely on which processor you are collecting the data on.  Second, some processors may or may not support some of the metrics you are looking for.  The best bet is to see the tuning guides at http://www.intel.com/vtune-tuning-guides.  These guides are specific to each generation of microarchitecture and show what metrics are available, accurate, and useful in analyzing performance efficiency.

0 Kudos
T_C
Beginner
490 Views

MrAnderson (Intel) wrote:

Hi T C:

"PS" means precise.  It has to do with when the instruction pointer is captured after the event counter triggers the interrupt.

We can't just "tell you" which events to use.  First, it depends entirely on which processor you are collecting the data on.  Second, some processors may or may not support some of the metrics you are looking for.  The best bet is to see the tuning guides at http://www.intel.com/vtune-tuning-guides.  These guides are specific to each generation of microarchitecture and show what metrics are available, accurate, and useful in analyzing performance efficiency.

Hi, thank you for the explanation and guide. I am using Ivy Bridge (not Xeon), downloaded the guide but it doesn't specifically mention measuring the prefetcher rate?

Regarding "PS" - if there is a PS and non-PS version available, should the PS one always be preferred?

Thanks,

0 Kudos
Bernard
Valued Contributor I
490 Views
>>>3) Pipeline stalls regardless of cause>>> You can split Pipeline stalls analysis into Front-End and Back-End stalls.
0 Kudos
Reply