Intel® oneAPI HPC Toolkit
Get help with building, analyzing, optimizing, and scaling high-performance computing (HPC) applications.
1940 Discussions

Intel MPI mpigdb don't accept user input

Ben_M_
Beginner
365 Views

I've encountered some wired problem that might be mixing Intel MPI 2017, MKL 2017 and glibc 2.26. A fortran program using `mpiifort -mkl=sequential -g -traceback -lstdc++`. It can run when executed directly, but fails with SIGSEGV immediately with mass of Unknown source and line on backtrace, even with `-np 1`.

I guess attaching gdb can help, so I ran `mpirun -np 1 -gdb a.out`, but the mpigdb just become more wired: (bold for my input)

~:server> mpirun -np 1 -gdb sleep 5
mpigdb: np = 1
mpigdb: attaching to 15762 sleep w003
[0] (mpigdb) <b>r</b>
[0] Undefined command: "". Try "help".
[0] (mpigdb) <b>help</b>
[0] Undefined command: "". Try "help".
[0] (mpigdb) <b>aaa</b>
[0] Undefined command: "". Try "help".
[0] (mpigdb) 

It seems that mpigdb just ignores all my input. Tried with several Intel Parallel Studio versions, ranging from 2016 to 2020 update 1, all of which suffers the same problem.

Any idea on it? OS is openSUSE 15.2, command executed from ssh session, bash as shell.

 

0 Kudos
4 Replies
PrasanthD_intel
Moderator
337 Views

Hi Ben,


May I know the reason you are using angular brackets(tags) while giving gdb commands.

You were saying they are bold tags, have you inserted them manually?

Clearly, gdb is not recognising the tags and giving the error Undefined command.

Can you try without using them?


Thanks

Prasanth


Ben_M_
Beginner
309 Views

I've mentioned that bold fonts are to mark out user inputs. I don't know why the HTML tags don't take effect.

PrasanthD_intel
Moderator
296 Views

Hi Ben,

 

Sorry, I thought you are providing the tags as input to gdb.

I have tried with various MPI versions and gdb is working fine for me, but I haven't tested it on OpenSUSE.

I am transferring this query to internal team.

 

Regards

Prasanth

Ben_M_
Beginner
292 Views

I'm sorry that I should mention that I've tested on an CentOS 7 machine, which behaves as expected. However I have totally no idea how to debug it.

Reply