Intel® oneAPI Math Kernel Library
Ask questions and share information with other developers who use Intel® Math Kernel Library.
7060 Discussions

Faulting module name: mkl_core.dll, version: 11.0.4.1

kingsley_s_
Beginner
1,105 Views

Faulting module name: mkl_core.dll, version: 11.0.4.1, time stamp: 0x5195ead6

Exception code: 0xc0000005

Fault offset: 0x00829aa0

I'm hoping this is a known issue that has been fixed in a later updated. We've been running for several months performing a 2D FFT on 10,000's of images and this is the first time that the MKL library has failed. Not much to go on. Basically the program crashed and a couple of events in the system log.

Kingsley

0 Kudos
8 Replies
SergeyKostrov
Valued Contributor II
1,105 Views
What is bis-crash.evtx file? Please explain.
0 Kudos
kingsley_s_
Beginner
1,105 Views

evtx is a windows event log file. You should be able to just ckick on it and windows will display the two log entries that were created. Not much in it really, but that is the only info I have. Really all I get out of it is:

Faulting module name: mkl_core.dll, version: 11.0.4.1, time stamp: 0x5195ead6

Exception code: 0xc0000005

Fault offset: 0x00829aa0

0 Kudos
mecej4
Honored Contributor III
1,105 Views

Rather than  try to infer from event-logs what caused the C0000005 (access violation) exception, it would be more useful to isolate one (or more) input data to the FFT calculation that caused the exception. That data, and the portion of code that contains the FFT call, would be a lot easier to work with because that would be at the source code level. If you are calling from Fortran, you can enable the compiler's traceback option.

0 Kudos
kingsley_s_
Beginner
1,105 Views

This is a rather rare event. Maybe once every 10,000 frames. I already updated MKL to Update 5 (from Update 4) hoping that the problem has been fixed already. I ran two instruments over the weekend and did not have this problem again. Maybe collected 5,000 frames. Statistically, meaningless.

It is being called from C#.It is happening on the release code build as the debug build is too slow for actual instrument acquisition.

The fault offset will tell you which routine in the mkl_core.dll is crashing, but that is about all.

0 Kudos
SergeyKostrov
Valued Contributor II
1,105 Views
>>...That data, and the portion of code that contains the FFT call, would be a lot easier to work with because that would be >>at the source code level... I agree with that point and Windows Event logs are not helpful because they do not show how MKL API was actually used. kingsley, Why wouldn't you upload a test case that reproduces the problem?
0 Kudos
kingsley_s_
Beginner
1,105 Views

>>> Why wouldn't you upload a test case that reproduces the problem?

The program crashed and all information is thus lost. It may take another three months to reproduce this issue. The image is newly acquired, so I do not have a copy of that either.

0 Kudos
mecej4
Honored Contributor III
1,105 Views

kingsley s. wrote:
The fault offset will tell you which routine in the mkl_core.dll is crashing, but that is about all.
True, but not very useful to find the reason for the failure, particularly since users do not have  the MKL source code.

From your own experience, we know that most of the time no crash occurs at that location (or the location is not even encountered). Thus, the crash occurs only for some input argument values. Therefore, we have to find those values and investigate the crash using that set of values.

One way of capturing the faulty input set is to print an image number to a log file or the screen after each successful FFT call. Note the last number printed out before the program, and recreate the input data using that number. If this approach is too slow, use binary search.

0 Kudos
kingsley_s_
Beginner
1,105 Views

More information on this issue. I think it may be a threading issue. The exception is being thrown on the line:

int ret =DFTI.DftiCreateDescriptor(refdesc, precision, forward_domain, dimension, len);

The parameters are always the same. Precision is DFTI.DOUBLE. forward_domain is DFTI.REAL. Dimension is 2. Len is {1024,1024}. What I am thinking is that the previous image may still be processing its FFT. Are you limited to processing one FFT at a time? Thanks.

0 Kudos
Reply