- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi intel community!
I got Purple Screen on VMware ESXi 7u3, as I could see on the FDM (Fault Diagnosis Management) the root cause was due to DIMM faulty, but this kind of error is so common.
I have some servers with the same hardware configuration and many times they got purple screen on VMware, after replaced parts faulty server starts work as normal but after a couple of weeks purple screen cames back again.
To be honest I don´t know if there is some configuration missing from the BIOS or something like that.
I hope someone has an idea what could be the root cause for this situation. I´ll apreciate your help.
Server configuration:
4xCPU Intel(R) Xeon(R) Platinum 8376H CPU @ 2.60GHz
48 DIMM DDR 4 Samsung, 65536 MB, 3200 MT/s, 3200 MT/s, DDR4, 1200 mV, 2 rank
Logs error from FDM file:
[Hardware Error Log Number]: 1 Time: 2023-05-03 17:16:56(UTC+-5:00)
Collect: BIOS(SMI) SMI Serial Number: 138 Collect Integrity: Validate(0X00)
CPU: 2(Socket: CPU3) Core: uncore Module: MCA Bank8(M2M 1) MCA Mode: CDC
----------------------- MCA REPORTING ERROR -----------------------
Error Type:Corrected Error
MCACODE: 0X0092 (Last level memory controller error:{Memory-read-error}_CHANNEL{2}_ERR)
MSCODE: 0X0101 ( Error logged is an MC read data error. Error specifically on DDR4.)
Mci_Status: Bit(58)ADDRValid Bit(59)MISCValid
Mci_Address: 0X1D3F3C82B40[DIMM251]
M2M1 generated an eMCA event
Retry_Rd_Err_Address: [DIMM250](CE,CS0,C0,BankGroup 0,Bank 0,Column 0,Row 0,Device 0)
----------------------- MCA REGISTER DUMP -----------------------
MSR_MCG_CONTAIN: 0X0000000000000001
IA32_MCG_STATUS: 0X0000000000000000
IA32_MCG_CAP: 0X000000000F000C17
ERROR_CONTROL: 0X0000000000000000
IA32_MCi_CTL: 0X000000000000FFFD
IA32_MCi_CTL2: 0X0000000540000001
IA32_MCi_STATUS: 0X9C00004001010092
IA32_MCi_ADDR: 0X000001D3F3C82B40
IA32_MCi_MISC: 0X000800C030000886
CORE_SMI_ERR_SRC: 0X0000000000000000
UNCORE_SMI_ERR_SRC: 0X0000000000000100
IerrLoggingReg: 0X00000000
MCerrLoggingReg: 0X00000000
EMCA_CORE_CSMI_LOG: 0X00000000
EMCA_CORE_MSMI_LOG: 0X00000000
SMICtrl: 0X06000000
MCA_ERR_SRC_LOG: 0X00000000
retry_rd_err_log: 0X0000A000
retry_rd_err_log_parity: 0X00000000
retry_rd_err_log_misc: 0X00000000
retry_rd_err_log_address1: 0X00000000
retry_rd_err_log_address2: 0X00000000.
Link Copied
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hello Jair Garcia,
Greetings for the day!
Thank you for reaching out to Intel Support.
Here are the details of your support case:
Case No:06012821
Issue: Purple Screen on Vmware ESXi 7 u3
I hope this message finds you well.
We apologize for the delay that may have occurred. We understand the root cause of the purple screen is a faulty DIMM.
However we request to please help us with the server information and the screenshot of error message to assist you further.
Once you provide us with the requested details, our technical support team will analyze the situation and offer you comprehensive guidance to effectively troubleshoot and resolve the issue.
Thank you for choosing Intel products and services. We are committed to ensuring your satisfaction and resolving the issue promptly.
Best regards,
Dineshbabu.
Intel Customer support
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hello Jair Garcia,
Greetings for the day!
Thank you for posting on Intel community.
Please help us with the details requested on the previous post to assist you further.
Best Regards,
Megha k
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi Intel Dineshbabu,
Thanks for the reply and sorry for the delay,
Here is the file.doc where you can find two purple screenshots showing the errors.
I hope it would be helpful.
Thanks.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hello Jair Garcia,
I hope this message finds you well.
Kindly share us the below details
1. The Baseboard Management Controller (BMC) System Debug Log of the server in question. This log will contain detailed information about the server's management controller, which will enable us to identify any potential hardware-related issues. I would suggest referring to the article How to Collect the Baseboard Management Controller (BMC) System Debug... (intel.com) for a step-by-step guide on how to generate the BMC System Debug Log. The second part of the article describes how to generate the log locally.
2. Could please provide the system board details.
Best regards,
Syed Abdul Mazeed
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi Intel community,
I was reading the Integrated Management Controller Wen Console user guide, but We don´t have the RMM4 intel module installed in our baseboard servers, so we cannot load logs to Server Log Advisor because our logs are in different extension. I don´t know if there is other way to load or what is the correct file that Server Log Advisor needs.
I´ll appreciate your help.
Best regards.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi Dineshbabu,
Thanks for the reply.
I was trying to upload bios.bin.zip to intel server log advisor but it says there is no files, I don´t know what extension or files is correct to upload in this intel application.
Could you help me please?
Regards.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hello Jair Garcia,
I hope this message finds you well.
We would greatly appreciate it if you could kindly share the system board details with us.
Best regards,
Syed M
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi Jair Garcia,
Thank you for posting on Intel community.
Please help us with the system board details requested on the previous post to assist you further.
Best Regards,
Megha k
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi Jair Garcia,
Thank you for posting on Intel community.
We have not received a reply from you , and as such, we will be closing your chat.
If you want to continue support, please reply to this email and we will reopen your chat or create a new one so that we can continue to support you.
Best Regards,
Megha
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page