Intel® oneAPI HPC Toolkit
Get help with building, analyzing, optimizing, and scaling high-performance computing (HPC) applications.
Announcements
This community is designed for sharing of public information. Please do not share Intel or third-party confidential information here.
1910 Discussions

mpi_abort does not terminate all processes under torque

Bernie_B_
Beginner
175 Views

We are running the Nasa Overflow code on a large linux cluster and have found that if the code calls MPI_ABORT it does not terminate as

expected.  We are running version 4.1.027 of Intel MPI.  We running under the Torque resource manager.

Bernie

0 Kudos
2 Replies
Bernd_Doser
Beginner
175 Views

We have the same problem using IntelMPI 4.1.0.024. MPI_Abort hang until a newline is send.

Bernd

Sebastian_G_
Beginner
175 Views

We have the same issue with PBS as job scheduler and mpi version 5.0.3.048.
So the code sends an MPI_ABORT and the processes are not killed correctly that the job hangs in the queue.

Is there a solution to this problem?

Sebastian

 

Reply