Community
cancel
Showing results for 
Search instead for 
Did you mean: 
zhi_c_
Beginner
80 Views

Understanding to CPU Time and Instructions retired

Hi All,

The following is the snapshot from VTune on my Haswell processor. However, I don't understand that why the CPU time and the number of instructions retired for the highlighted code (vpbroadcastq) are so significantly greater than the others in the same basic block. I thought the number of the retired instructions should be not too different, though there might be cache misses or TLB misses. Can someone explain some possible reasons for it? Thanks.

vtune.png

0 Kudos
2 Replies
Peter_W_Intel
Employee
80 Views

I think this topic is Hardware Event Skid relevant - please see this 

You may find performance data refelcting to source line.

zhi_c_
Beginner
80 Views

Thanks for your link, Peter. But the question is that the highlighted instruction is in the middle of the basic block. Why the recording of the other instructions before/after it is not affected if this is due to hardware event skid? 

Reply