Intel® MPI Library
Get help with building, analyzing, optimizing, and scaling high-performance computing (HPC) applications.

Intel MPI crash on core i9

Frank_R_1
New Contributor I
589 Views

Dear support,

We have customers with 13th Gen Intel(R) Core(TM) i7-13850HX processors on Windows 10 64bit where we encounter random crashes

Please have a look at .\cpuinfo.exe output
######################################################################
Intel(R) processor family information utility, Version 2021.7 Build 20220909

Copyright (C) 2005-2022 Intel Corporation. All rights reserved.


===== Processor composition =====

Processor name : 13th Gen Intel(R) Core(TM) i7-13850HX

Packages(sockets) : 1

Cores : 20

Processors(CPUs) : 28

Cores per package : 20

Threads per core : 1


===== Processor identification =====

Processor Thread Id. Core Id. Package Id.

0 0 0 0

1 1 0 0

2 0 4 0

3 1 4 0

4 0 8 0

5 1 8 0

6 0 12 0

7 1 12 0

8 0 16 0

9 1 16 0

10 0 20 0

11 1 20 0

12 0 24 0

13 1 24 0

14 0 28 0

15 1 28 0

16 0 36 0

17 0 37 0

18 0 38 0

19 0 39 0

20 0 40 0

21 0 41 0

22 0 42 0

23 0 43 0

24 0 44 0

25 0 45 0

26 0 46 0

27 0 47 0

===== Placement on packages =====

Package Id. Core Id. Processors

0 0,4,8,12,16,20,24,28,36,37,38,39,40,41,42,43,44,45,46,47 (0,1)(2,3)(4,5)(6,7)(8,9)(10,11)(12,13)(14,15)16,17,18,19,20,21,22,23,24,25,26,27


==== Cache sharing =====

Cache Size Processors

L1 48 KB (0,1)(2,3)(4,5)(6,7)(8,9)(10,11)(12,13)(14,15)

L2 2 MB (0,1)(2,3)(4,5)(6,7)(8,9)(10,11)(12,13)(14,15)(16,17,18,19)(20,21,22,23)(24,25,26,27)

L3 30 MB (0,1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27)
######################################################################

We run our product with:

mpiexec.exe -np 4 #path_to_product

Unfortunately randomly the program crashes with:

Faulting application name: hydra_pmi_proxy.exe, version: 2021.7.0.0, time stamp: 0x631b56d2
Faulting module name: hydra_pmi_proxy.exe, version: 2021.7.0.0, time stamp: 0x631b56d2
Exception code: 0xc0000005
Fault offset: 0x000000000003fd11
Faulting process id: 0x0x106C
Faulting application start time: 0x0x1DA6314489698E1
Faulting application path: C:\MAGMASOFT\v6.0.0\WINDOWS64\impi\bin\hydra_pmi_proxy.exe
Faulting module path: C:\MAGMASOFT\v6.0.0\WINDOWS64\impi\bin\hydra_pmi_proxy.exe
Report Id: f41018a6-16aa-453c-9f52-8a563c1fc157
Faulting package full name:
Faulting package-relative application ID:

So apparently the hydra_pmi_proxy.exe crashes, not our product.
We have thousands of customers with Xeon processors were no crash occurs, it seems that this is due to performance and efficiency core mix on this type of processors.

We tried with
I_MPI_PIN_PROCESSOR_LIST=0,2,4,6
to prefer the performance cores but we still got random crashes

Do you have any idea what went wrong, what could be the workaround?

Best regards

Frank

0 Kudos
3 Replies
TobiasK
Moderator
562 Views

@Frank_R_1 
Please use the latest Intel MPI version available. Can you reproduce the issue with the IMB benchmarks? Otherwise we need a full reproducer to investigate.

0 Kudos
Frank_R_1
New Contributor I
549 Views

Hi,

 

As I mentioned,  not our product crashes it is the

hydra_pmi_proxy.exe (randomly crash)

This works on all Xeon and AMD processors at our customers.

It is only this type of processors with efficiency cores!

 

Only thing that I can try, is using Intel MPI 2021.11.

 

Best regards

Frank

0 Kudos
TobiasK
Moderator
518 Views

@Frank_R_1 the support of hybrid architectures will improve with the next release. However, such a fault is not known to us and I can only help if you provide a reproducer for the observed problem using the latest version available. E.g. you may try to run IMB benchmarks or MPI-Linpack.

Best
Tobias

0 Kudos
Reply