Software Tuning, Performance Optimization & Platform Monitoring
Discussion regarding monitoring and software tuning methodologies, Performance Monitoring Unit (PMU) of Intel microprocessors, and platform updating.

Question on PCM function getCyclesLostDueL3CacheMisses()

Pradeep_R_
Beginner
544 Views

Hi, In the function getCyclesLostDueL3CacheMisses() defined in cpucounters.h of the PCM package, I see a 180. * L3_cycles/total_cycles computation as return value. Can someone please explain why there is a 180 there, and no a 100? Thanks, Pradeep.

0 Kudos
3 Replies
Roman_D_Intel
Employee
544 Views

Hi Pradeep,

This was an average memory access latency for a 2-socket system. Since this is only a rough estimation method we have deprecated this function and the L3CLK/L2CLK metrics in the upcoming PCM version.

Best regards,

Roman

0 Kudos
Pradeep_R_
Beginner
544 Views

Thanks for the clarification. If you're removing the L3clk/L2clk metrics, is there an alternate route to estimate the amount of time the cores spent waiting on an L3 miss (which would mostly be waiting on DDR, assuming good threading)?

Thanks,

Pradeep.

0 Kudos
Roman_D_Intel
Employee
544 Views

Hi Pradeep,

I could recommend the top-down method implemented in the "General Exploration" analysis of Intel® VTune™ Amplifier XE. It is a much more robust method to analyze CPU stalls (incl L3 cache miss stalls).

Best regards,

Roman

0 Kudos
Reply