Community
cancel
Showing results for 
Search instead for 
Did you mean: 
cwillems
Beginner
93 Views

Weird BTS Performance

Hi,

I am playing around with the BTS feature and observe a very huge performance penalty. It is said in several places that this is normal. On the other hand I see academic publications that use this feature which report only a very small overhead.

Therefore, I have performed several experiments on different CPUs with different DebugCtl settings and different kind of memory caching types.

What me confuses the most is the fact that experiments with only the TR-flag enabled are *much* slower than those with TR-flag *and* BTS-flag enabled. From my understanding enabling TR+BTS does "more" than only TR, in fact writing the BTM not only to the system bus but also to the DebugStore.

Am I wrong? What is the reason for this "strange" observation?

Thanks a lot for your help,
Carsten
0 Kudos
5 Replies
SHIH_K_Intel
Employee
93 Views

Branch Trace isdesigned to help tools to profile/diagnose. It can capture a lot of information, and the associated costs (delays) goes with the amount/freqency your tool ask the HW to capture. In-frequent sampling would incur smaller overhead. Doing it frequently would be like attaching a exhaust emission analyzer to the car's tail pipe, it won't be able to drive normally or normal gas mileage.
cwillems
Beginner
93 Views

Thank you for your answer.
Unfortunately, I cannot see how it is related to my particular question.

Cheers
Carsten
TimP
Black Belt
93 Views

Perhaps you didn't tell us how your question relates to this forum.
cwillems
Beginner
93 Views

Hi Tim,

I am sorry if I have used the wrong one. I just have searched for similar requests and found some in this forum. Is their a better suited for BTM/PerformanceMonitoring-related questions?

Cheers & Thanks
Carsten
Hussam_Mousa__Intel_
New Contributor II
93 Views

Can you please post your question to this forum:
Reply