L1 Bound::DTLB Overhead::Load STLB Hit is so high in my program. I'm not sure what that means.
I know the load was stalled without missing the L1 data cache, and it wait for STLB Hit. What scenario or what code would cause this problem？
Thank you for posting in Intel Communities.
Could you please share the below with us so that we understand the issue better.
- VTune version
- Processor details
- What type of analysis you are doing in VTune
- The exact steps you followed and a sample reproducer (a sample application that is similar to the application you are trying to analyze)?
- Any specific configurations/settings if done