- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
mpif90 test.o -L{VT_ROOT}/lib -lVT -ldwarf -lelf -lnsl -lm -lpthread -o ftest
Fatal error in MPI_Comm_dup: Invalid communicator, error stack:
MPI_Comm_dup(171): MPI_Comm_dup(comm=0x5b, new_comm=0xbfffc250) failed
MPI_Comm_dup(93): Invalid communicator
rank 0 in job 31 {host_name}_33927 caused collective abort of all ranks
exit status of rank 0: return code 13
Link Copied
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Paolo -
Are you running on an Itanium 2 system? If so, you need to add a "-lvtunwind" flag after the "-lVT" in the linking step. This is noted on page 7 (Chapter 3) of the User's Guide for the 5.0 version.
Otherwise, can you run the application by itself, without the Trace Collector, using MPICH 2 (or Intel MPI)? That is, is the problem only when you try to run using Trace Collector or is there something going wrong at a more basic level with MPI on your system?
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Paolo -
I agree, there seems to be some problem with the installation of ITC. Are the libraries visible (or loaded) on each node of the cluster? If you run 'ldd' on the binary, where will the application be looking for the shared library objects?
Can you create a statically linked version of the app and run this on the cluster nodes? Will the program run if you restrict the processes to the node that you installed ITC on?
If none of the above works and you have the libraries available on the cluster nodes, you should report the error to the Intel Premier Support site.
--clay
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Clay
I've installed ITC on a NFS. If I do 'which VTserver' on the master or any of the nodes, Iget ~/libraries/itc/bin/VTserver.
If I run ldd on the app I have
libnsl.so.1 => /lib/libnsl.so.1 (0xb75c9000)
libimf.so => /home/paolo/intel/fc/9.0/lib/libimf.so (0xb73ed000)
libm.so.6 => /lib/tls/libm.so.6 (0xb73cb000)
libpthread.so.0 => /lib/tls/libpthread.so.0 (0xb73bb000)
librt.so.1 => /lib/tls/librt.so.1 (0xb73a7000)
libc.so.6 => /lib/tls/libc.so.6 (0xb7270000)
/lib/ld-linux.so.2 => /lib/ld-linux.so.2 (0xb75eb000)
I've statically linked version of the app but does not run on the nodes.
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page