- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi,
I try to run mkl scalapack in fortran code on an infinity band network using I_MPI_FABRICS=shm:dapl, however mkl scalapack does not work correctly when running on several nodes, e.g. pzheev exits with error code 16. When switching to shm:tcp network fabrics it works. It also works with netlib scalapack reference implementation + mkl lapack/blas. I tried intel-2016 update 4 and intel-2017 update 4, both give the same errors.
Any idea on this error ?
best,
marius
Link Copied
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
this is an unknown problem with MKL 2017 u3. If pzheev works with Netlib's implementation and fails with MKL, then probably, this is the bug. Could you give us reproducer?
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
thanks for the swift reply.
I did some more testing.I used to use -mkl=cluster for which I got the errors mentioned above, however when using the linker options supplied by intel mkl link advisor, i.e. " ${MKLROOT}/lib/intel64/libmkl_scalapack_lp64.a -Wl,--start-group ${MKLROOT}/lib/intel64/libmkl_intel_lp64.a ${MKLROOT}/lib/intel64/libmkl_sequential.a ${MKLROOT}/lib/intel64/libmkl_core.a ${MKLROOT}/lib/intel64/libmkl_blacs_intelmpi_lp64.a -Wl,--end-group -lpthread -lm -ldl", it works. Using -mkl=cluster together with TCP fabrics works, tough.
So is it in general not a good idea to use "-mkl=cluster" over giving the linker options explicitly?
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
This is strange because of when we use -mkl=cluster compiler option, that's mean we link with sequential MKL cluster components and these components are used Intel MPI. Here is should be identical what you explicitly used.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
then, could you give us the reproducer?
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
I will try to provide it.
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page