integration problem between Torque 4 and Intel(R) MPI Library for Linux* OS, Version 2019 Update 1
I have successfully compiled and linked a program with IntelMPI and if I run it interactively or in background it runs very fast and without any problems on our new server (ProLiant DL580 Gen10, 1 node with 4 processors with 18 cores each, total 72 cores, hyperthreading disabled). If I try to submit it by Torque (version 4) strange things happen, for example:
1) if I submit 2 jobs asking each 8 cores they are both fine
2) if I submit a third job (8 cores) it is 4 times slower becasue the 8 process runs on two cores!
3) if I submit a fourth job it runs properly, but if I qdel all the four jobs, all of them disappear from qstat -a but the fourth is keeping running!
From previous discussion I notice in this forum, I have the feeling it is an integration problem between intelmpi and torque, so I did the following: