- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Two issues with mpicleanup on Intel MPI 2019 U7.
1. The bin direcorty doesn't has mpicleanup script, since this is causing an error "mpirun: line 120: mpicleanup: command not found"
2. How mpicleanup is taken care in MPI 2019 U7?
I am not seeing any files created in /tmp/mpiexe<processid> file to track all the process id's.
Is there is new way of handling cleanup?
Link Copied
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi,
Thanks for reaching out to us.
The I_MPI_HYDRA_CLEANUP creates a file if Hydra ends incorrectly and the processes are still running.
This feature is not supported in IMPI 2019 since hydra should cleanup all the processes itself automatically.
If you find the processes are not being cleaned up automatically, please let us know.
Regards
Prasanth
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Thank you Prakash,
I am getting the below error occasionally while launching the MPI.
[30] [proxy:0:18@<Node>] HYD_spawn (../../../../../src/pm/i_hydra/libhydra/spawn/intel/hydra_spawn.c:128): execvp error on file <MyProcess> (Too many open files).
Is this side effect of not cleaning up correctly?
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi,
You can use the top command to check if the processes are still running after they are finished/terminated.
And coming to the error it may be due to a limitation from the Linux/job scheduler.
Could you mention how many processes you were launching and your environment details (Job scheduler, interconnect, Provider) etc?
Please check if you have set any maximum number of processes limit in job scheduler.
Regards
Prasanth
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hello Prasanth,
You can use the top command to check if the processes are still running after they are finished/terminated.
Tamil >> I will check and update, if issue is reproduced.
And coming to the error it may be due to a limitation from the Linux/job scheduler.
Could you mention how many processes you were launching and your environment details (Job scheduler, interconnect, Provider) etc?
Tamil >> We are launching only 2 process per node, Not using job scheduler, Mellanox IB, OFI.
Please check if you have set any maximum number of processes limit in job scheduler.
Tamil >> We are not setting any max number of process limit.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi,
Were you able to reproduce the issue and got a chance to check whether the threads have been still running?
Please confirm so we can go-ahead
Regards
Prasanth
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi,
Please let us know if you face the issue again. We were not able to reproduce the issue in our environment.
Regards
Prasanth
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi,
We are closing this thread considering your issue has been resolved. Please raise a new thread for any further assistance from Intel.
Any further interaction in this thread will be considered community only
Regards
Prasanth

- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page