<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Thank you.  How would you in Software Tuning, Performance Optimization &amp; Platform Monitoring</title>
    <link>https://community.intel.com/t5/Software-Tuning-Performance/How-to-debug-ProcHot/m-p/1116802#M6124</link>
    <description>&lt;P&gt;Thank you. &amp;nbsp;How would you check the MSRs on a Windows system? &amp;nbsp;&lt;/P&gt;

&lt;P&gt;I saw this link. &amp;nbsp;http://faydoc.tripod.com/cpu/rdmsr.htm&lt;/P&gt;

&lt;P&gt;"&lt;SPAN style="color: rgb(0, 0, 0); font-family: &amp;quot;Times New Roman&amp;quot;; font-size: medium;"&gt;This instruction must be executed at privilege level 0 or in real-address mode"&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;&lt;SPAN style="color: rgb(0, 0, 0); font-family: &amp;quot;Times New Roman&amp;quot;; font-size: medium;"&gt;How do we get into either of those modes?&lt;/SPAN&gt;&lt;/P&gt;</description>
    <pubDate>Mon, 13 Mar 2017 18:53:00 GMT</pubDate>
    <dc:creator>Adrian_C_</dc:creator>
    <dc:date>2017-03-13T18:53:00Z</dc:date>
    <item>
      <title>How to debug ProcHot?</title>
      <link>https://community.intel.com/t5/Software-Tuning-Performance/How-to-debug-ProcHot/m-p/1116800#M6122</link>
      <description>&lt;P&gt;Not sure if this is the right forum. &amp;nbsp;&lt;SPAN style="font-size: 1em;"&gt;We have a Xeon server platform that is encountering a ProcHot after running IO meter tests. &amp;nbsp;We can configure our BMC sensors to collect the temp. readings. &amp;nbsp;What other tools are available for debug?&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Sat, 11 Mar 2017 05:55:37 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Tuning-Performance/How-to-debug-ProcHot/m-p/1116800#M6122</guid>
      <dc:creator>Adrian_C_</dc:creator>
      <dc:date>2017-03-11T05:55:37Z</dc:date>
    </item>
    <item>
      <title>Not sure what you mean by</title>
      <link>https://community.intel.com/t5/Software-Tuning-Performance/How-to-debug-ProcHot/m-p/1116801#M6123</link>
      <description>&lt;P&gt;Not sure what you mean by "debug" in this case....&amp;nbsp;&lt;/P&gt;

&lt;P&gt;The first thing to do is determine whether this is an "internal" or "external" PROCHOT.&amp;nbsp;&amp;nbsp;&lt;/P&gt;

&lt;UL&gt;
	&lt;LI&gt;Section 14.8 of Volume 3 of the Intel Architectures SW Developer's Manual discusses the IA32_PACKAGE_THERM_STATUS register (MSR 0x1b1).&amp;nbsp; Bits 0-1 tell you about the internal temperature sensors, and bits 2-3 tell you about the external PROCHOT signal.&lt;/LI&gt;
	&lt;LI&gt;Due to the power-throttling in recent processors, I typically monitor IA32_PACKAGE_THERM_STATUS (MSR 0x1b1) and sometimes monitor the core-specific values in each core's IA32_THERM_STATUS register (MSR 0x19C).
		&lt;UL&gt;
			&lt;LI&gt;Starting with the Haswell processors, additional information is available in the MSR_CORE_PERF_LIMIT_REASONS register (MSR 0x690).&lt;/LI&gt;
			&lt;LI&gt;While many of the bits in these MSRs clearly work as described, I am not able to test all of them, and some of the bits give suspicious results.&lt;/LI&gt;
		&lt;/UL&gt;
	&lt;/LI&gt;
&lt;/UL&gt;

&lt;P&gt;If the digital temperature sensors in the IA32_PACKAGE_THERM_STATUS register show that you are actually reaching the PROCHOT temperature, then the obvious checks are:&lt;/P&gt;

&lt;UL&gt;
	&lt;LI&gt;Check the airflow path for obstructions.&lt;/LI&gt;
	&lt;LI&gt;Monitor the fans to make sure they are spinning up properly.&lt;/LI&gt;
	&lt;LI&gt;Check the heat-sink installation to make sure they were installed properly &amp;amp; with proper thermal paste.&lt;/LI&gt;
&lt;/UL&gt;

&lt;P&gt;I had one system with a bad thermal solution that hit PROCHOT under fairly light loads.&amp;nbsp; I verified that the airflow path was OK, but did not get around to checking the fans or heat-sink installation -- I was having too much fun studying the behavior of the processor while it was throttled!&amp;nbsp; (This was one of several pre-production loaner systems from a vendor, so it was not critical to have it fully operational.)&lt;/P&gt;

&lt;P&gt;We have also seen cases where the external PROCHOT signal was applied even when the processor reported temperatures well below PROCHOT.&amp;nbsp; These were tracked down to bugs in the software on the system management controllers and were eventually fixed.&amp;nbsp; The good news is that these were temporary glitches, and when the external PROCHOT signal was dropped, the processors returned to normal behavior.&lt;/P&gt;

&lt;P&gt;In the Xeon E5 processors, it is possible that additional information can be obtained from the performance counters in the Power Control Unit of the Uncore.&amp;nbsp; These are described in the Uncore Performance Monitoring manuals for various processor generations (v1, v2, v3, v4).&amp;nbsp; (I have not tried this myself -- most of our systems are kept very cool, so we see power-throttling frequently, but not thermal throttling.)&lt;/P&gt;</description>
      <pubDate>Mon, 13 Mar 2017 14:25:47 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Tuning-Performance/How-to-debug-ProcHot/m-p/1116801#M6123</guid>
      <dc:creator>McCalpinJohn</dc:creator>
      <dc:date>2017-03-13T14:25:47Z</dc:date>
    </item>
    <item>
      <title>Thank you.  How would you</title>
      <link>https://community.intel.com/t5/Software-Tuning-Performance/How-to-debug-ProcHot/m-p/1116802#M6124</link>
      <description>&lt;P&gt;Thank you. &amp;nbsp;How would you check the MSRs on a Windows system? &amp;nbsp;&lt;/P&gt;

&lt;P&gt;I saw this link. &amp;nbsp;http://faydoc.tripod.com/cpu/rdmsr.htm&lt;/P&gt;

&lt;P&gt;"&lt;SPAN style="color: rgb(0, 0, 0); font-family: &amp;quot;Times New Roman&amp;quot;; font-size: medium;"&gt;This instruction must be executed at privilege level 0 or in real-address mode"&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;&lt;SPAN style="color: rgb(0, 0, 0); font-family: &amp;quot;Times New Roman&amp;quot;; font-size: medium;"&gt;How do we get into either of those modes?&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 13 Mar 2017 18:53:00 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Tuning-Performance/How-to-debug-ProcHot/m-p/1116802#M6124</guid>
      <dc:creator>Adrian_C_</dc:creator>
      <dc:date>2017-03-13T18:53:00Z</dc:date>
    </item>
    <item>
      <title>I don't know what the magic</title>
      <link>https://community.intel.com/t5/Software-Tuning-Performance/How-to-debug-ProcHot/m-p/1116803#M6125</link>
      <description>&lt;P&gt;I don't know what the magic infrastructure looks like on Windows.&amp;nbsp; You have to figure out how to run in the kernel -- the "real-address mode" comment is ancient history.&lt;/P&gt;

&lt;P&gt;Windows has some debug facilities to access MSRs -- e.g., &lt;A href="https://msdn.microsoft.com/en-us/library/windows/hardware/ff553516(v=vs.85).aspx" target="_blank"&gt;https://msdn.microsoft.com/en-us/library/windows/hardware/ff553516(v=vs.85).aspx&lt;/A&gt;&lt;/P&gt;

&lt;P&gt;Intel has some code that allows their Performance Counter Monitor to access the MSRs on Windows systems -- have a look at &lt;A href="https://software.intel.com/en-us/articles/intel-performance-counter-monitor" target="_blank"&gt;https://software.intel.com/en-us/articles/intel-performance-counter-monitor&lt;/A&gt;&lt;/P&gt;

&lt;P&gt;It looks like Intel has discontinued PCM in favor of an open source project at &lt;A href="https://github.com/opcm/pcm.&amp;nbsp;" target="_blank"&gt;https://github.com/opcm/pcm.&amp;nbsp;&lt;/A&gt;; This has some notes on Windows.&amp;nbsp; One of the directories in the githib site is &lt;A href="https://github.com/opcm/pcm/tree/master/PCM-Power_Win" target="_blank"&gt;https://github.com/opcm/pcm/tree/master/PCM-Power_Win&lt;/A&gt;, which might have what you need already....&lt;/P&gt;</description>
      <pubDate>Mon, 13 Mar 2017 20:07:13 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Tuning-Performance/How-to-debug-ProcHot/m-p/1116803#M6125</guid>
      <dc:creator>McCalpinJohn</dc:creator>
      <dc:date>2017-03-13T20:07:13Z</dc:date>
    </item>
    <item>
      <title>Hi,</title>
      <link>https://community.intel.com/t5/Software-Tuning-Performance/How-to-debug-ProcHot/m-p/1116804#M6126</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;

&lt;P&gt;&lt;A href="https://software.intel.com/en-us/articles/intel-performance-counter-monitor/"&gt;PCM-power&lt;/A&gt; utility can show prochot metrics out-of-the-box. You can download it here:&amp;nbsp;https://github.com/opcm/pcm (and here are the binaries: &lt;A href="https://ci.appveyor.com/project/opcm/pcm/build/artifacts" target="_blank"&gt;https://ci.appveyor.com/project/opcm/pcm/build/artifacts&lt;/A&gt; )&lt;/P&gt;

&lt;P&gt;Thanks,&lt;/P&gt;

&lt;P&gt;Roman&lt;/P&gt;</description>
      <pubDate>Fri, 17 Mar 2017 09:42:54 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Tuning-Performance/How-to-debug-ProcHot/m-p/1116804#M6126</guid>
      <dc:creator>Roman_D_Intel</dc:creator>
      <dc:date>2017-03-17T09:42:54Z</dc:date>
    </item>
    <item>
      <title>Re: Not sure what you mean by</title>
      <link>https://community.intel.com/t5/Software-Tuning-Performance/How-to-debug-ProcHot/m-p/1269563#M7851</link>
      <description>&lt;P&gt;Hi John,&lt;/P&gt;
&lt;P&gt;Sorry to wake-up this thread up but recently I have cleared those&amp;nbsp;IA32_THERM_STATUS |&amp;nbsp;IA32_PACKAGE_THERM_STATUS log bits on Coffee Lake processors which froze immediately.&lt;/P&gt;
&lt;P&gt;For exemple: " PROTCHOT # or FORCEPR# Log (R/WC0) "&lt;/P&gt;
&lt;P&gt;As mentioned by the SDM: "&amp;nbsp;Software may clear this bit by writing a zero. "&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;The SDM specifies those bits as "R/WC0", but I don't find the meaning of this abbreviation and I believe it might make the difference when writing zero in those bits from my privileged kernel driver.&lt;/P&gt;
&lt;P&gt;Do you think I miss to check some CPUID or other MSR registers prior altering these log bits ?&lt;/P&gt;
&lt;P&gt;Or, am I facing undocumented exceptions to these IA registers ?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks for any help,&lt;/P&gt;
&lt;P&gt;Regards,&lt;/P&gt;
&lt;P&gt;Cyril&lt;/P&gt;</description>
      <pubDate>Wed, 31 Mar 2021 09:50:00 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Tuning-Performance/How-to-debug-ProcHot/m-p/1269563#M7851</guid>
      <dc:creator>CyrIng</dc:creator>
      <dc:date>2021-03-31T09:50:00Z</dc:date>
    </item>
    <item>
      <title>Re: Not sure what you mean by</title>
      <link>https://community.intel.com/t5/Software-Tuning-Performance/How-to-debug-ProcHot/m-p/1272540#M7863</link>
      <description>&lt;P&gt;It may sound obvious but don't clear&amp;nbsp;&lt;SPAN&gt;IA32_THERM_STATUS Log bits if&amp;nbsp;IA32_THERM_INTERRUPT High or Low&amp;nbsp;Temperature Interrupt bit is enabled.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;An thermal interrupt handler might have been installed by the Kernel or SMI and thus claim control over those registers.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;Issue fixed in this &lt;A href="https://github.com/cyring/CoreFreq/issues/225" target="_self"&gt;thread&lt;/A&gt;.&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Sat, 10 Apr 2021 08:42:57 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Tuning-Performance/How-to-debug-ProcHot/m-p/1272540#M7863</guid>
      <dc:creator>CyrIng</dc:creator>
      <dc:date>2021-04-10T08:42:57Z</dc:date>
    </item>
  </channel>
</rss>

