Peter, Thank you for you advice.
My question is the event INST_RETIRED.ANY can collect every instruction retired? If Vtune collect event send by CPU, It will make my code run slowly?If Vtunecollectevent CPU_CLK_UNHALTED.CORE, my program will be interputed every clock? So, if I use Vtune collect event, my program be influenced?
Thanks a lot.
Hi Huang Zhe,
Since your application is single thread basis, HT doesn't help on the performance - except Compiler did auto-paralleling in your code, actually I don't know what Compiler and it options you used. You mayopen sampling result and select the process of your application then narrow down to threads report to know if your codewas parallelizedby Compiler.
CPI0.7 seems OK for Core 2 Duo processor, if you want to use SSE to do thingsbetter - you have to use VTune Analyzer to find hot functions, then check if you have opportunity to optimize them.