Intel® DevCloud
Help for those needing help starting or connecting to the Intel® DevCloud
1205 Discussions

Number of Cores ssh-devcloud MPI

alrabrmh
Beginner
138 Views

I am connecting to  ssh devcloud to test MPI application developed using VS code.

However when trying to run the code using one core it is working fine. however when I try to increase the number of cores it is giving me the below error.

 

Abort(2663823) on node 0 (rank 0 in comm 0): Fatal error in PMPI_Init: Other MPI error, error stack:
MPIR_Init_thread(176).........:
MPID_Init(1525)...............:
MPIDI_SHMI_mpi_init_hook(27)..:
MPIDI_POSIX_mpi_init_hook(165):
MPIDI_POSIX_eager_init(2892)..:
MPIDU_Init_shm_alloc(151).....: unable to allocate shared memory

 

Used Command :

mpirun -np 2 ./app_name

mpirun -np 4 ./app_name

0 Kudos
3 Replies
JaideepK_Intel
Moderator
119 Views

Hi,

 

Thank you for posting in Intel communities.

Note:

>>In DevCloud, we can run a maximum of 2 nodes, and on each node we can run 2 processes (total of 4 processes).

 

Please follow the below steps:

Connect to the DevCloud via Cygwin using below command:

qsub -I -l nodes=<number_of_nodes>:<property>:ppn=2 -d .

example: qsub -I -l nodes=2:gpu:ppn=2 -d .

 

After logging into the compute node, we need to get the node numbers which we accessed. So run the below command.

echo $PBS_NODEFILE (example output looks like this: /var/spool/torque/aux//1955007.v-qsvr-1.aidevcloud)

 

We need to cat the output of $PBS_NODEFILE

example:

cat /var/spool/torque/aux//1955007.v-qsvr-1.aidevcloud
s001-n141
s001-n141
s001-n157
s001-n157

 

Copy the node numbers from above and paste them into the host file (I pasted the above node numbers into host.txt)

After pasting the node numbers into the host file, we can run the mpirun command. (Since I am running the mpi4py script, I gave the python command in the below command.)

mpirun -n 4 -hostfile host.txt python hello.py

JaideepK_Intel_0-1670504075054.png

 

If this resolves your issue, make sure to accept this as a solution. This would help others with similar issue. Have a great day ahead.

 

Regards,

Jaideep

 

JaideepK_Intel
Moderator
60 Views

Hi,


If this resolves your issue, make sure to accept this as a solution. This would help others with similar issue. Thank you!


Regards,

Jaideep


JaideepK_Intel
Moderator
27 Views

Hi,


We assume that your issue is resolved. If you need any additional information, please post a new question as this thread will no longer be monitored by Intel.


Thanks,

Jaideep


Reply