Community
cancel
Showing results for 
Search instead for 
Did you mean: 
slojuggler
Beginner
60 Views

Instrumenting from multiple nodes

Can Vtune 9.1 Update 2 be used to instrumentcode running concurrently on multiple nodes? If so, where do I need to go documentation wise to figure out how I need to install and configure things?

Or, is this a question best left to the Intel support team?

Thank you.

0 Kudos
2 Replies
Peter_W_Intel
Employee
60 Views

Quoting - slojuggler

Can Vtune 9.1 Update 2 be used to instrumentcode running concurrently on multiple nodes? If so, where do I need to go documentation wise to figure out how I need to install and configure things?

Or, is this a question best left to the Intel support team?

Thank you.


Intel VTune Performance Analyzer is not designed for cluster system. However you can simulate distributed computing in one node, and refer to http://software.intel.com/en-us/articles/performance-tools-for-software-developers-does-vtune-analyz...

Thanks, Peter
TimP
Black Belt
60 Views

As Peter said, it's often suitable to profile an entire MPI job running on a single node. It's easier to see the MPI functions when the MPI is static linked, even though that may not be your normal running mode. Also, with either MPI or OpenMP, or a hybrid combination, it's helpful to raise the spin lock transition numbers so that all wait times are counted in the application.
A possible way of profiling on multiple nodes is by using the PTU relative of VTune (see WhatIf forum) to generate an SEP batch command which may be run across a cluster under MPI, saving a tb5 file for each node.
As you're probably aware, specialized MPI profilers, such as jumpshot or Intel Trace Collector/Analyzer, are best for profiling to see the messaging paths and latencies. If you have Intel Trace Collector installed (basically, a profiling MPI library), and Intel MPI dynamic linked, you can activate profiling simply by adding -trace to the mpiexec command.
For MPI/OpenMP hybrid, the Intel profiling OpenMP library is useful for profiling the OpenMP process.
Reply