Perf counters for measuring TLB miss rate

AkshayBaviskar · ‎02-11-2021

I want to measure following things for an application:

TLB miss rate
Number of cycles spent in Page Walks
Runtime in number of cycles

I have an Intel(R) Xeon(R) Silver 4110 CPU @ 2.10GHz system.

To calculate these I am using following perf counters:

Total number of memory references ( X ) = mem_inst_retired.all_loads:u + mem_inst_retired.all_stores:u
Total number of memory references that missed in TLB ( Y ) =

mem_inst_retired.stlb_miss_loads:u + mem_inst_retired.stlb_miss_stores:u

TLB miss rate = Y/X
Number of cycles spent in Page Walks = dtlb_store_misses.walk_pending:u + dtlb_load_misses.walk_pending:u
Runtime in number of cycles = cycles

I am confused between three parameters to count the total number of references that missed the TLB:

dtlb_load_misses.miss_causes_a_walk + dtlb_store_misses.miss_causes_a_walk
dtlb_load_misses.walk_completed + dtlb_store_misses.walk_completed
mem_inst_retired.stlb_miss_loads + mem_inst_retired.stlb_miss_stores

However, when I ran the sequential array access of size 64MB. { arr[i] = i;} I am getting following values for above counters: (with THP disabled)

dtlb_store_misses.miss_causes_a_walk = 154771

dtlb_store_misses.walk_completed = 116499

mem_inst_retired.stlb_miss_stores = 15566

When I double the array size to 128 MB and then to 256 MB. These counters are also getting doubled approximately. Since, 64 MB array has 16K pages, I see that mem_inst_retired.stlb_miss_stores is giving the closest value.

Also, I didn’t see any effect of Next-page prefetcher in this as mentioned in this post ( https://community.intel.com/t5/Software-Tuning-Performance/Inconsistency-in-TLB-miss-counters/td-p/1... ). So, I suppose that my machine which has a SkyLake architecture, doesn’t have NPP.

Could you please let me know if I have chosen the right counters for my measurements?

Thanks in advance!

Best Regards,

Akshay

SergioS_Intel · ‎02-12-2021

Hello AkshayBaviskar,

Thank you for contacting Intel Customer Support.

Please allow us to check on your request and we will get back to you as soon as possible.

Best regards,

Sergio S.

Intel Customer Support Technician

SergioS_Intel · ‎02-15-2021

Hello AkshayBaviskar,

Thank you for waiting for our updates.

In regard to your question, we would like to encourage you to post the technical information you are looking for in the following community site:

https://community.intel.com/t5/Software-Development-Topics/ct-p/software-dev-topics

They can help you with all your questions about measuring TLB rate.

Best regards,

Sergio S.

Intel Customer Support Technician

For firmware updates and troubleshooting tips, visit :https://intel.com/support/serverbios

AkshayBaviskar · ‎02-16-2021

Okay.

Thank you!

-Akshay