Intel® ISA Extensions
Use hardware-based isolation and memory encryption to provide more code protection in your solutions.

Weird BTS Performance

cwillems
Beginner
181 Views
Hi,

I am playing around with the BTS feature and observe a very huge performance penalty. It is said in several places that this is normal. On the other hand I see academic publications that use this feature which report only a very small overhead.

Therefore, I have performed several experiments on different CPUs with different DebugCtl settings and different kind of memory caching types.

What me confuses the most is the fact that experiments with only the TR-flag enabled are *much* slower than those with TR-flag *and* BTS-flag enabled. From my understanding enabling TR+BTS does "more" than only TR, in fact writing the BTM not only to the system bus but also to the DebugStore.

Am I wrong? What is the reason for this "strange" observation?

Thanks a lot for your help,
Carsten
0 Kudos
5 Replies
SHIH_K_Intel
Employee
181 Views
Branch Trace isdesigned to help tools to profile/diagnose. It can capture a lot of information, and the associated costs (delays) goes with the amount/freqency your tool ask the HW to capture. In-frequent sampling would incur smaller overhead. Doing it frequently would be like attaching a exhaust emission analyzer to the car's tail pipe, it won't be able to drive normally or normal gas mileage.
cwillems
Beginner
181 Views
Thank you for your answer.
Unfortunately, I cannot see how it is related to my particular question.

Cheers
Carsten
TimP
Black Belt
181 Views
Perhaps you didn't tell us how your question relates to this forum.
cwillems
Beginner
181 Views
Hi Tim,

I am sorry if I have used the wrong one. I just have searched for similar requests and found some in this forum. Is their a better suited for BTM/PerformanceMonitoring-related questions?

Cheers & Thanks
Carsten
Hussam_Mousa__Intel_
New Contributor II
181 Views
Can you please post your question to this forum:
Reply