Analyzers
Talk to fellow users of Intel Analyzer tools (Intel VTune™ Profiler, Intel Advisor)

OpenMP load imbalance

mriedmann
Beginner
323 Views

In the past I used Oprofile for investigations on load imbalance because it has this nice table output with a column for each thread, see below. That make it easy to localize load imbalance. Is there a way to get a similar output formatting with VTune ?

 

oprof.png

Labels (1)
0 Kudos
1 Solution
yuzhang3_intel
Moderator
187 Views

Set Grouping to 'Thread/Function/Call Stack' and get the command line to generate the report by pressing the button '>_ ' on the right side. 

yuzhang3_intel_0-1737536542439.png

 

vtune -report exec-query -rep-knob row-by="/GenericThread/Function/ParentCallStack" -sort-desc "CPU Time:Self" -rep-knob column-by="ViewpointGUIandCLIColumns" -r <result_dir>

View solution in original post

0 Kudos
3 Replies
yuzhang3_intel
Moderator
271 Views

VTune provides a similar format report. 

You need to run the analysis type you need first, and then get report command line from the GUI, this can generate the report.

This is a hotspot result; you can press the >_ button to get the command line to generate the report.

 yuzhang3_intel_0-1737206423392.png

 

0 Kudos
mriedmann
Beginner
230 Views

Thanks, that helps a bit. However my main question is how to get the top hotspots listing side by side for each thread, not accumulated across all threads ?

0 Kudos
yuzhang3_intel
Moderator
188 Views

Set Grouping to 'Thread/Function/Call Stack' and get the command line to generate the report by pressing the button '>_ ' on the right side. 

yuzhang3_intel_0-1737536542439.png

 

vtune -report exec-query -rep-knob row-by="/GenericThread/Function/ParentCallStack" -sort-desc "CPU Time:Self" -rep-knob column-by="ViewpointGUIandCLIColumns" -r <result_dir>

0 Kudos
Reply