Yes, removing the verbose flag does not make any difference.
Unfortunately, I cannot send you the binary since we can only execute it with a valid (commercial) license.
Is there any other possibility to trace the MPI behaviour of the application under Windows?
We also have a Linux version of the binary. Under Linux, we could do the tracing just be specifying "mpiexec -trace [...]". Then, the application did not corrupt the ITC internal data structures.
But, we still need the Windows traces as well.
Concerning Multithreading, I am not totally sure. Under Linux, I see that the pthread library is linked to the application. However, the CPU performance never gets about 100% during the computation. When I run Intel Amplifier (Concurrency Analysis) on the first minute of the application, I see actually 2 threads in total. However, the thread 0 only seems to do anything at the beginning of the application, while thread 1 is waiting in the meanwhile. Afterwards only thread 1 is running/computing (as far as I can see).
Sorry, forgot that you are using iMPI 3.2.2. Intel MPI library supports statistics output in IPM format in version 4.0.3 and higher, so you cannot get it until you use iMPI 4.0.3.
It looks very strange that you got stats.ipm - could you send me MPI version? Justrun a simple program withI_MPI_DEBUG=5.
BTW: you can run your application with I_MPI_DEBUG=10 (or even 100). Running it with debug version (impid.dll) instead of impi.dll you can get even more information.
Knowing nothing about your application it's hardly possible to understand what's going wrong. Unfortunately!