Intel® MPI Library
Get help with building, analyzing, optimizing, and scaling high-performance computing (HPC) applications.

mpi_abort does not terminate all processes under torque

Bernie_B_
Beginner
549 Views

We are running the Nasa Overflow code on a large linux cluster and have found that if the code calls MPI_ABORT it does not terminate as

expected.  We are running version 4.1.027 of Intel MPI.  We running under the Torque resource manager.

Bernie

0 Kudos
2 Replies
Bernd_Doser
Beginner
549 Views

We have the same problem using IntelMPI 4.1.0.024. MPI_Abort hang until a newline is send.

Bernd

0 Kudos
Sebastian_G_
Beginner
549 Views

We have the same issue with PBS as job scheduler and mpi version 5.0.3.048.
So the code sends an MPI_ABORT and the processes are not killed correctly that the job hangs in the queue.

Is there a solution to this problem?

Sebastian

 

0 Kudos
Reply