Software Tuning, Performance Optimization & Platform Monitoring
Discussion around monitoring and software tuning methodologies, Performance Monitoring Unit (PMU) of Intel microprocessors, and platform monitoring

Null

Zara_g_
Beginner
97 Views

Customer asked to close it .

0 Kudos
1 Reply
TimP
Black Belt
97 Views

What specifically is your question?  You are running a benchmark where the bottleneck is expected to be in memory bandwidth.  With a non-precise event, you can't control whether VTune attributes the counts to the responsible instruction or to one which is waiting for that instruction to complete.

Are you trying to compare event rates?  If so, you need to set sample-after values to convenient ratios.  If you are comparing runs with and without mfence, it may help if you assure that the same sample-after values are used in each case.

Reply