I installed the latest VTune analyzer 8.0 and VTune rdc. After I did several profiling, I found this new version causes more overhead than the old one.
Here is the summary for the overhead caused by VTune:
Samp_Write_PC_File about 15% clock ticks
Prepare_Wait about 8% clockticks
Finish_To_Wait about 8% clock ticks
I remember the old VTune 7.2 caused 20~25% overhead and the older older VTune consumes only 5% overhead. I don't understand why VTune cause so much overhead. I was told vtl will be better and SEP will be better than vtl. It seems I have to use SEP now.
Message Edited by email@example.com on 02-15-200601:08 PM