IhadusedVTuneto understand performance of my OpenMP programs. The number of cores used is sixteen. After "Run Activity",Idouble-clickmyprocess,anddouble-clickone of threads,and then double-clickmy function module.Sohotspots shows. When I focus on one of my functions, I get "Clocks per Instructions Retired - CPI" 1.20 For another thread, I get "Clocks per Instructions Retired - CPI" 1.15
1)How can I get correct CPI of my function? Should I do average of CPI among sixteen cores?
2)When I click my function name in hotspots lists, I can see the code ofthe function and values of "CPU_CLK_UNHALTED.CORE" "INST_RETIRED.ANY".IcomputeCPI used the values,whichis differrent with "Clocks per Instructions Retired - CPI" above. I confuse about it.