<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Random System Crashes M50CYP1UR204 in Intel® Xeon® Processor and Server Products</title>
    <link>https://community.intel.com/t5/Intel-Xeon-Processor-and-Server/Random-System-Crashes-M50CYP1UR204/m-p/1399197#M22264</link>
    <description>&lt;P&gt;Hey guys,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I am currently experiencing random system crashes on 1 of my servers (I have 6 in total and only problematic on this 1 server).&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;When I experience a crash, there is no video output or anything. I am able to still access remote access (BMC) however it reports as the "Host Power Status: Host is currently OFF&lt;SPAN&gt;". The overall system health light is a solid green.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt; When I try and power the server on remotely, via BMC, it fails and the overall system health light changes to a solid red/amber colour.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;Looking at event logs via BMC, this is logged at the time of the crash.&lt;/SPAN&gt;&lt;/P&gt;
&lt;TABLE class="listgrid" cellspacing="0" cellpadding="0"&gt;
&lt;TBODY&gt;
&lt;TR class="normal"&gt;
&lt;TD&gt;801&lt;/TD&gt;
&lt;TD&gt;Sun Jul 10 05:59:53 2022&lt;/TD&gt;
&lt;TD&gt;Pwr Unit Status&lt;/TD&gt;
&lt;TD&gt;BMC&lt;/TD&gt;
&lt;TD&gt;Informational&lt;/TD&gt;
&lt;TD&gt;Power Unit&lt;/TD&gt;
&lt;TD&gt;Power Off / Power Down - Asserted&lt;/TD&gt;
&lt;/TR&gt;
&lt;/TBODY&gt;
&lt;/TABLE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;When I try and remotely restart it, these events are logged&lt;/P&gt;
&lt;TABLE class="listgrid" cellspacing="0" cellpadding="0"&gt;
&lt;TBODY&gt;
&lt;TR class="normal"&gt;
&lt;TD width="40px" height="47px"&gt;804&lt;/TD&gt;
&lt;TD width="144.225px" height="47px"&gt;Sun Jul 10 16:33:10 2022&lt;/TD&gt;
&lt;TD width="93.575px" height="47px"&gt;P2 Status&lt;/TD&gt;
&lt;TD width="44.4px" height="47px"&gt;BMC&lt;/TD&gt;
&lt;TD width="60.9875px" height="47px"&gt;Critical&lt;/TD&gt;
&lt;TD width="86.8625px" height="47px"&gt;Processor&lt;/TD&gt;
&lt;TD width="224.15px" height="47px"&gt;Thermal Trip - CPU boot FIVR fault - Asserted&lt;/TD&gt;
&lt;/TR&gt;
&lt;TR class="normal"&gt;
&lt;TD width="40px" height="47px"&gt;803&lt;/TD&gt;
&lt;TD width="144.225px" height="47px"&gt;Sun Jul 10 16:33:10 2022&lt;/TD&gt;
&lt;TD width="93.575px" height="47px"&gt;P1 Status&lt;/TD&gt;
&lt;TD width="44.4px" height="47px"&gt;BMC&lt;/TD&gt;
&lt;TD width="60.9875px" height="47px"&gt;Critical&lt;/TD&gt;
&lt;TD width="86.8625px" height="47px"&gt;Processor&lt;/TD&gt;
&lt;TD width="224.15px" height="47px"&gt;Thermal Trip - CPU boot FIVR fault - Asserted&lt;/TD&gt;
&lt;/TR&gt;
&lt;TR class="normal"&gt;
&lt;TD width="40px" height="55px"&gt;802&lt;/TD&gt;
&lt;TD width="144.225px" height="55px"&gt;Sun Jul 10 16:33:09 2022&lt;/TD&gt;
&lt;TD width="93.575px" height="55px"&gt;Pwr Unit Status&lt;/TD&gt;
&lt;TD width="44.4px" height="55px"&gt;BMC&lt;/TD&gt;
&lt;TD width="60.9875px" height="55px"&gt;Critical&lt;/TD&gt;
&lt;TD width="86.8625px" height="55px"&gt;Power Unit&lt;/TD&gt;
&lt;TD width="224.15px" height="55px"&gt;
&lt;P&gt;Soft Power Control Failure - Asserted&lt;/P&gt;
&lt;/TD&gt;
&lt;/TR&gt;
&lt;/TBODY&gt;
&lt;/TABLE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;When looking at the Sensor Readings, I see this&lt;/P&gt;
&lt;TABLE class="listgrid" cellspacing="0" cellpadding="0"&gt;
&lt;TBODY&gt;
&lt;TR class="normal"&gt;
&lt;TD&gt;Critical&lt;/TD&gt;
&lt;TD&gt;Pwr Unit Status&lt;/TD&gt;
&lt;TD&gt;Power Off / Power Down&lt;BR /&gt;Soft Power Control Failure&lt;/TD&gt;
&lt;TD&gt;0x8021&lt;/TD&gt;
&lt;/TR&gt;
&lt;/TBODY&gt;
&lt;/TABLE&gt;
&lt;TABLE class="listgrid" width="448px" cellspacing="0" cellpadding="0"&gt;
&lt;TBODY&gt;
&lt;TR class="normal"&gt;
&lt;TD width="60.9875px"&gt;Critical&lt;/TD&gt;
&lt;TD width="119.55px"&gt;P1 Status&lt;/TD&gt;
&lt;TD width="200.163px"&gt;Thermal Trip&lt;BR /&gt;Processor Presence detected&lt;/TD&gt;
&lt;TD width="66.3px"&gt;0x8082&lt;/TD&gt;
&lt;/TR&gt;
&lt;TR class="normal"&gt;
&lt;TD width="60.9875px"&gt;Critical&lt;/TD&gt;
&lt;TD width="119.55px"&gt;P2 Status&lt;/TD&gt;
&lt;TD width="200.163px"&gt;Thermal Trip&lt;BR /&gt;Processor Presence detected&lt;/TD&gt;
&lt;TD width="66.3px"&gt;0x8082&lt;/TD&gt;
&lt;/TR&gt;
&lt;/TBODY&gt;
&lt;/TABLE&gt;
&lt;P&gt;The only way for me to bring the server up, is a full power reset (unplug both power cables). The server may stay online from 2 hours till 24 hours, it varies, before it crashes again.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I have tried updating to latest BIOS:&amp;nbsp;&lt;SPAN&gt;01.01.0005 with no luck.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;My system is currently running 2x Xeons 4309Y, 64GB RAM (4x16GB), 2x P4510 in RAID 0 via VROC.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;OS is Ubuntu 22.04 LTS&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Any help would be much appreciated.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks&lt;/P&gt;</description>
    <pubDate>Sun, 10 Jul 2022 05:50:24 GMT</pubDate>
    <dc:creator>BevanO</dc:creator>
    <dc:date>2022-07-10T05:50:24Z</dc:date>
    <item>
      <title>Random System Crashes M50CYP1UR204</title>
      <link>https://community.intel.com/t5/Intel-Xeon-Processor-and-Server/Random-System-Crashes-M50CYP1UR204/m-p/1399197#M22264</link>
      <description>&lt;P&gt;Hey guys,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I am currently experiencing random system crashes on 1 of my servers (I have 6 in total and only problematic on this 1 server).&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;When I experience a crash, there is no video output or anything. I am able to still access remote access (BMC) however it reports as the "Host Power Status: Host is currently OFF&lt;SPAN&gt;". The overall system health light is a solid green.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt; When I try and power the server on remotely, via BMC, it fails and the overall system health light changes to a solid red/amber colour.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;Looking at event logs via BMC, this is logged at the time of the crash.&lt;/SPAN&gt;&lt;/P&gt;
&lt;TABLE class="listgrid" cellspacing="0" cellpadding="0"&gt;
&lt;TBODY&gt;
&lt;TR class="normal"&gt;
&lt;TD&gt;801&lt;/TD&gt;
&lt;TD&gt;Sun Jul 10 05:59:53 2022&lt;/TD&gt;
&lt;TD&gt;Pwr Unit Status&lt;/TD&gt;
&lt;TD&gt;BMC&lt;/TD&gt;
&lt;TD&gt;Informational&lt;/TD&gt;
&lt;TD&gt;Power Unit&lt;/TD&gt;
&lt;TD&gt;Power Off / Power Down - Asserted&lt;/TD&gt;
&lt;/TR&gt;
&lt;/TBODY&gt;
&lt;/TABLE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;When I try and remotely restart it, these events are logged&lt;/P&gt;
&lt;TABLE class="listgrid" cellspacing="0" cellpadding="0"&gt;
&lt;TBODY&gt;
&lt;TR class="normal"&gt;
&lt;TD width="40px" height="47px"&gt;804&lt;/TD&gt;
&lt;TD width="144.225px" height="47px"&gt;Sun Jul 10 16:33:10 2022&lt;/TD&gt;
&lt;TD width="93.575px" height="47px"&gt;P2 Status&lt;/TD&gt;
&lt;TD width="44.4px" height="47px"&gt;BMC&lt;/TD&gt;
&lt;TD width="60.9875px" height="47px"&gt;Critical&lt;/TD&gt;
&lt;TD width="86.8625px" height="47px"&gt;Processor&lt;/TD&gt;
&lt;TD width="224.15px" height="47px"&gt;Thermal Trip - CPU boot FIVR fault - Asserted&lt;/TD&gt;
&lt;/TR&gt;
&lt;TR class="normal"&gt;
&lt;TD width="40px" height="47px"&gt;803&lt;/TD&gt;
&lt;TD width="144.225px" height="47px"&gt;Sun Jul 10 16:33:10 2022&lt;/TD&gt;
&lt;TD width="93.575px" height="47px"&gt;P1 Status&lt;/TD&gt;
&lt;TD width="44.4px" height="47px"&gt;BMC&lt;/TD&gt;
&lt;TD width="60.9875px" height="47px"&gt;Critical&lt;/TD&gt;
&lt;TD width="86.8625px" height="47px"&gt;Processor&lt;/TD&gt;
&lt;TD width="224.15px" height="47px"&gt;Thermal Trip - CPU boot FIVR fault - Asserted&lt;/TD&gt;
&lt;/TR&gt;
&lt;TR class="normal"&gt;
&lt;TD width="40px" height="55px"&gt;802&lt;/TD&gt;
&lt;TD width="144.225px" height="55px"&gt;Sun Jul 10 16:33:09 2022&lt;/TD&gt;
&lt;TD width="93.575px" height="55px"&gt;Pwr Unit Status&lt;/TD&gt;
&lt;TD width="44.4px" height="55px"&gt;BMC&lt;/TD&gt;
&lt;TD width="60.9875px" height="55px"&gt;Critical&lt;/TD&gt;
&lt;TD width="86.8625px" height="55px"&gt;Power Unit&lt;/TD&gt;
&lt;TD width="224.15px" height="55px"&gt;
&lt;P&gt;Soft Power Control Failure - Asserted&lt;/P&gt;
&lt;/TD&gt;
&lt;/TR&gt;
&lt;/TBODY&gt;
&lt;/TABLE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;When looking at the Sensor Readings, I see this&lt;/P&gt;
&lt;TABLE class="listgrid" cellspacing="0" cellpadding="0"&gt;
&lt;TBODY&gt;
&lt;TR class="normal"&gt;
&lt;TD&gt;Critical&lt;/TD&gt;
&lt;TD&gt;Pwr Unit Status&lt;/TD&gt;
&lt;TD&gt;Power Off / Power Down&lt;BR /&gt;Soft Power Control Failure&lt;/TD&gt;
&lt;TD&gt;0x8021&lt;/TD&gt;
&lt;/TR&gt;
&lt;/TBODY&gt;
&lt;/TABLE&gt;
&lt;TABLE class="listgrid" width="448px" cellspacing="0" cellpadding="0"&gt;
&lt;TBODY&gt;
&lt;TR class="normal"&gt;
&lt;TD width="60.9875px"&gt;Critical&lt;/TD&gt;
&lt;TD width="119.55px"&gt;P1 Status&lt;/TD&gt;
&lt;TD width="200.163px"&gt;Thermal Trip&lt;BR /&gt;Processor Presence detected&lt;/TD&gt;
&lt;TD width="66.3px"&gt;0x8082&lt;/TD&gt;
&lt;/TR&gt;
&lt;TR class="normal"&gt;
&lt;TD width="60.9875px"&gt;Critical&lt;/TD&gt;
&lt;TD width="119.55px"&gt;P2 Status&lt;/TD&gt;
&lt;TD width="200.163px"&gt;Thermal Trip&lt;BR /&gt;Processor Presence detected&lt;/TD&gt;
&lt;TD width="66.3px"&gt;0x8082&lt;/TD&gt;
&lt;/TR&gt;
&lt;/TBODY&gt;
&lt;/TABLE&gt;
&lt;P&gt;The only way for me to bring the server up, is a full power reset (unplug both power cables). The server may stay online from 2 hours till 24 hours, it varies, before it crashes again.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I have tried updating to latest BIOS:&amp;nbsp;&lt;SPAN&gt;01.01.0005 with no luck.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;My system is currently running 2x Xeons 4309Y, 64GB RAM (4x16GB), 2x P4510 in RAID 0 via VROC.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;OS is Ubuntu 22.04 LTS&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Any help would be much appreciated.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks&lt;/P&gt;</description>
      <pubDate>Sun, 10 Jul 2022 05:50:24 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-Xeon-Processor-and-Server/Random-System-Crashes-M50CYP1UR204/m-p/1399197#M22264</guid>
      <dc:creator>BevanO</dc:creator>
      <dc:date>2022-07-10T05:50:24Z</dc:date>
    </item>
    <item>
      <title>Re:Random System Crashes M50CYP1UR204</title>
      <link>https://community.intel.com/t5/Intel-Xeon-Processor-and-Server/Random-System-Crashes-M50CYP1UR204/m-p/1399252#M22265</link>
      <description>&lt;P&gt;Hello BevanO,&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Thank you for joining the Intel community&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Please check the following article and try to follow the suggested steps when possible. If the issue persist after this we should be able to consider a CPU replacement under warranty&lt;/P&gt;&lt;P&gt;&lt;A href="https://www.intel.com/content/www/us/en/support/articles/000089860/server-products/multi-node-servers.html" rel="noopener noreferrer" target="_blank"&gt;Error Message: IERR – Non-boot Core FIVR Fault – Asserted... (intel.com)&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;We will look forward to your updates.&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Regards&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Jose A.&lt;/P&gt;&lt;P&gt;Intel Customer Support Technician&lt;/P&gt;&lt;P&gt;&lt;EM&gt;For firmware updates and troubleshooting tips, visit:&lt;/EM&gt;&lt;/P&gt;&lt;P&gt;&lt;EM&gt;&lt;A href="https://intel.com/support/serverbios" target="_blank"&gt;https://intel.com/support/serverbios&lt;/A&gt;&lt;/EM&gt;&lt;/P&gt;&lt;BR /&gt;</description>
      <pubDate>Mon, 11 Jul 2022 00:30:25 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-Xeon-Processor-and-Server/Random-System-Crashes-M50CYP1UR204/m-p/1399252#M22265</guid>
      <dc:creator>JoseH_Intel</dc:creator>
      <dc:date>2022-07-11T00:30:25Z</dc:date>
    </item>
    <item>
      <title>Re:Random System Crashes M50CYP1UR204</title>
      <link>https://community.intel.com/t5/Intel-Xeon-Processor-and-Server/Random-System-Crashes-M50CYP1UR204/m-p/1402132#M22286</link>
      <description>&lt;P&gt;Hello BevanO,&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;I am just following up to double-check if you found the provided information useful. If you have further questions please don't hesitate to ask. If you consider the issue to be completed please let us know so we can proceed to mark this thread as resolved. I will try to reach you as a very last time on next Monday 25th. After that the thread will be automatically archived. &lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Regards&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Jose A.&lt;/P&gt;&lt;P&gt;Intel Customer Support Technician&lt;/P&gt;&lt;P&gt;&lt;EM&gt;For firmware updates and troubleshooting tips, visit:&lt;/EM&gt;&lt;/P&gt;&lt;P&gt;&lt;A href="https://intel.com/support/serverbios" rel="noopener noreferrer" target="_blank"&gt;&lt;EM&gt;https://intel.com/support/serverbios&lt;/EM&gt;&lt;/A&gt;&lt;/P&gt;&lt;BR /&gt;</description>
      <pubDate>Thu, 21 Jul 2022 05:59:22 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-Xeon-Processor-and-Server/Random-System-Crashes-M50CYP1UR204/m-p/1402132#M22286</guid>
      <dc:creator>JoseH_Intel</dc:creator>
      <dc:date>2022-07-21T05:59:22Z</dc:date>
    </item>
    <item>
      <title>Re:Random System Crashes M50CYP1UR204</title>
      <link>https://community.intel.com/t5/Intel-Xeon-Processor-and-Server/Random-System-Crashes-M50CYP1UR204/m-p/1403375#M22303</link>
      <description>&lt;P&gt;Hello BevanO,&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;We will proceed to mark this thread as resolved. If you have further issues or questions just go ahead and submit a new topic.&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Regards&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Jose A.&lt;/P&gt;&lt;P&gt;Intel Customer Support Technician&lt;/P&gt;&lt;P&gt;&lt;EM&gt;For firmware updates and troubleshooting tips, visit:&lt;/EM&gt;&lt;/P&gt;&lt;P&gt;&lt;A href="https://intel.com/support/serverbios" rel="noopener noreferrer" target="_blank"&gt;&lt;EM&gt;https://intel.com/support/serverbios&lt;/EM&gt;&lt;/A&gt;&lt;/P&gt;&lt;BR /&gt;</description>
      <pubDate>Tue, 26 Jul 2022 06:03:19 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-Xeon-Processor-and-Server/Random-System-Crashes-M50CYP1UR204/m-p/1403375#M22303</guid>
      <dc:creator>JoseH_Intel</dc:creator>
      <dc:date>2022-07-26T06:03:19Z</dc:date>
    </item>
  </channel>
</rss>

