<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re:Intel Processor Checker in Intel® MPI Library</title>
    <link>https://community.intel.com/t5/Intel-MPI-Library/Intel-Processor-Checker/m-p/1210149#M7160</link>
    <description>&lt;P&gt;Hi Kihang,&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Can you provide the CPU information of that processor?&lt;/P&gt;&lt;P&gt;&lt;I&gt;command: &amp;nbsp;&lt;/I&gt;&lt;B&gt;&lt;I&gt;cpuinfo&lt;/I&gt;&lt;/B&gt;&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Intel does provide tools to find problems in HPC applications:&lt;/P&gt;&lt;P&gt;&amp;nbsp;1)To check whether the problem is in cluster setup/hardware, you can check the status of hardware using cluster checker.&lt;/P&gt;&lt;P&gt;        &lt;B&gt;&lt;I&gt;source &amp;lt;install_dir&amp;gt;/clck/latest/env/vars.sh &lt;/I&gt;&lt;/B&gt;&lt;/P&gt;&lt;P&gt;&lt;B&gt;&lt;I&gt; clck -F&amp;lt;Framework&amp;gt; -f nodefile   &lt;/I&gt;&lt;/B&gt;&lt;/P&gt;&lt;P&gt;     The nodefile to be filled with hostnames, the framework can be cpu_info, hyper_threading etc based on your requirement. You can get the list of available frameworks using  &lt;B&gt;&lt;I&gt;clck -Xlist&lt;/I&gt;&lt;/B&gt;&lt;/P&gt;&lt;P&gt;For more info&amp;nbsp;please check &lt;A href="https://software.intel.com/content/www/us/en/develop/documentation/cluster-checker-user-guide/top/getting-started.html" rel="noopener noreferrer" target="_blank"&gt;https://software.intel.com/content/www/us/en/develop/documentation/cluster-checker-user-guide/top/getting-started.html&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;2)To check for problems in code,  you can analyze the application using ITAC.&lt;/P&gt;&lt;P&gt; &lt;B&gt;&lt;I&gt;source &amp;lt;itac_installdir&amp;gt;/bin/itacvars.sh&lt;/I&gt;&lt;/B&gt;&lt;/P&gt;&lt;P&gt; &lt;B&gt;&lt;I&gt;mpiicc -g&amp;nbsp;-trace &amp;lt;app_name&amp;gt;&amp;nbsp;&amp;lt;args..&amp;gt;&lt;/I&gt;&lt;/B&gt;&lt;/P&gt;&lt;P&gt; &lt;B&gt;&lt;I&gt;mpirun -genv VT_LOGFILE_FORMAT=SINGLESTF -trace -n 16 -ppn 2 -f hosts.txt ./&amp;lt;executable&amp;gt;&amp;nbsp;&lt;/I&gt;&lt;/B&gt;&lt;/P&gt;&lt;P&gt; &lt;B&gt;&lt;I&gt;traceanalyzer ./&amp;lt;application_name&amp;gt;.stf &amp;amp;&lt;/I&gt;&lt;/B&gt;&lt;/P&gt;&lt;P&gt;For more info please check &lt;A href="https://software.intel.com/content/www/us/en/develop/documentation/itac-vtune-mpi-openmp-tutorial-lin/top/identify-communication-issues-with-intel-trace-analyzer-and-collector.html" rel="noopener noreferrer" target="_blank"&gt;https://software.intel.com/content/www/us/en/develop/documentation/itac-vtune-mpi-openmp-tutorial-lin/top/identify-communication-issues-with-intel-trace-analyzer-and-collector.html&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Hope this information will help in debugging the problem.&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Regards&lt;/P&gt;&lt;P&gt;Prasanth&lt;/P&gt;&lt;BR /&gt;</description>
    <pubDate>Thu, 17 Sep 2020 10:32:38 GMT</pubDate>
    <dc:creator>PrasanthD_intel</dc:creator>
    <dc:date>2020-09-17T10:32:38Z</dc:date>
    <item>
      <title>Intel Processor Checker</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/Intel-Processor-Checker/m-p/1210042#M7158</link>
      <description>&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Hello everybody,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I am writing to &lt;SPAN class="tlid-translation translation"&gt;&lt;SPAN class="" title=""&gt;ask a question about the problem of randomly different results when calculating using a specific processor.&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN class="tlid-translation translation"&gt;&lt;SPAN class="" title=""&gt;I have a forecast model where the values vary when repeatedly executed on a specific compute node.&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN class="tlid-translation translation"&gt;&lt;SPAN class="" title=""&gt; Even when analyzing at the code level, the difference in values does not occur at the same source code location every time, nor does it occur at the same time.&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN class="tlid-translation translation"&gt;&lt;SPAN class="" title=""&gt;&lt;SPAN title=""&gt;I have found that this problem occurs when using a specific processor through the following options.&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN class="tlid-translation translation"&gt;&lt;SPAN class="" title=""&gt;I_MPI_PIN_PROCESSOR_LIST="0-6,8-47": No error&lt;BR /&gt;I_MPI_PIN_PROCESSOR_LIST="0-7,9-47": Error&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN class="tlid-translation translation"&gt;&lt;SPAN class="" title=""&gt;&lt;SPAN title=""&gt;Therefore, I am judging that there is a problem with the 8th core (processor).&lt;/SPAN&gt; &lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN class="tlid-translation translation"&gt;&lt;SPAN class="" title=""&gt;&lt;SPAN title=""&gt;In this case, the following three questions arise.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN title=""&gt;1) What factors (e.g. hardware, software) are in the core causing the result to be different?&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN class="tlid-translation translation"&gt;&lt;SPAN class="" title=""&gt;&lt;SPAN title=""&gt;2) Is this related to the source code (e.g. MKL, MPI)?&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN class="tlid-translation translation"&gt;&lt;SPAN class="" title=""&gt;&lt;SPAN title=""&gt;- Because I want to know if it affects other source code.&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN class="tlid-translation translation"&gt;&lt;SPAN class="" title=""&gt;3) Is there a tool (like HPL) that quickly finds this problem?&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN class="tlid-translation translation"&gt;&lt;SPAN class="" title=""&gt;&lt;SPAN title=""&gt;- Because I have a lot of nodes I need to manage, so I hope there is a simple test tool.&lt;/SPAN&gt; It would be nice if it would be a tool that judges whether it gives a difference or not to the result value if it is executed.&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN class="tlid-translation translation"&gt;&lt;SPAN class="" title=""&gt;Thank you in advance&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN class="tlid-translation translation"&gt;&lt;SPAN class="" title=""&gt;Kihang&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 17 Sep 2020 02:05:54 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/Intel-Processor-Checker/m-p/1210042#M7158</guid>
      <dc:creator>youn__kihang</dc:creator>
      <dc:date>2020-09-17T02:05:54Z</dc:date>
    </item>
    <item>
      <title>Re:Intel Processor Checker</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/Intel-Processor-Checker/m-p/1210149#M7160</link>
      <description>&lt;P&gt;Hi Kihang,&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Can you provide the CPU information of that processor?&lt;/P&gt;&lt;P&gt;&lt;I&gt;command: &amp;nbsp;&lt;/I&gt;&lt;B&gt;&lt;I&gt;cpuinfo&lt;/I&gt;&lt;/B&gt;&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Intel does provide tools to find problems in HPC applications:&lt;/P&gt;&lt;P&gt;&amp;nbsp;1)To check whether the problem is in cluster setup/hardware, you can check the status of hardware using cluster checker.&lt;/P&gt;&lt;P&gt;        &lt;B&gt;&lt;I&gt;source &amp;lt;install_dir&amp;gt;/clck/latest/env/vars.sh &lt;/I&gt;&lt;/B&gt;&lt;/P&gt;&lt;P&gt;&lt;B&gt;&lt;I&gt; clck -F&amp;lt;Framework&amp;gt; -f nodefile   &lt;/I&gt;&lt;/B&gt;&lt;/P&gt;&lt;P&gt;     The nodefile to be filled with hostnames, the framework can be cpu_info, hyper_threading etc based on your requirement. You can get the list of available frameworks using  &lt;B&gt;&lt;I&gt;clck -Xlist&lt;/I&gt;&lt;/B&gt;&lt;/P&gt;&lt;P&gt;For more info&amp;nbsp;please check &lt;A href="https://software.intel.com/content/www/us/en/develop/documentation/cluster-checker-user-guide/top/getting-started.html" rel="noopener noreferrer" target="_blank"&gt;https://software.intel.com/content/www/us/en/develop/documentation/cluster-checker-user-guide/top/getting-started.html&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;2)To check for problems in code,  you can analyze the application using ITAC.&lt;/P&gt;&lt;P&gt; &lt;B&gt;&lt;I&gt;source &amp;lt;itac_installdir&amp;gt;/bin/itacvars.sh&lt;/I&gt;&lt;/B&gt;&lt;/P&gt;&lt;P&gt; &lt;B&gt;&lt;I&gt;mpiicc -g&amp;nbsp;-trace &amp;lt;app_name&amp;gt;&amp;nbsp;&amp;lt;args..&amp;gt;&lt;/I&gt;&lt;/B&gt;&lt;/P&gt;&lt;P&gt; &lt;B&gt;&lt;I&gt;mpirun -genv VT_LOGFILE_FORMAT=SINGLESTF -trace -n 16 -ppn 2 -f hosts.txt ./&amp;lt;executable&amp;gt;&amp;nbsp;&lt;/I&gt;&lt;/B&gt;&lt;/P&gt;&lt;P&gt; &lt;B&gt;&lt;I&gt;traceanalyzer ./&amp;lt;application_name&amp;gt;.stf &amp;amp;&lt;/I&gt;&lt;/B&gt;&lt;/P&gt;&lt;P&gt;For more info please check &lt;A href="https://software.intel.com/content/www/us/en/develop/documentation/itac-vtune-mpi-openmp-tutorial-lin/top/identify-communication-issues-with-intel-trace-analyzer-and-collector.html" rel="noopener noreferrer" target="_blank"&gt;https://software.intel.com/content/www/us/en/develop/documentation/itac-vtune-mpi-openmp-tutorial-lin/top/identify-communication-issues-with-intel-trace-analyzer-and-collector.html&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Hope this information will help in debugging the problem.&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Regards&lt;/P&gt;&lt;P&gt;Prasanth&lt;/P&gt;&lt;BR /&gt;</description>
      <pubDate>Thu, 17 Sep 2020 10:32:38 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/Intel-Processor-Checker/m-p/1210149#M7160</guid>
      <dc:creator>PrasanthD_intel</dc:creator>
      <dc:date>2020-09-17T10:32:38Z</dc:date>
    </item>
    <item>
      <title>Re:Intel Processor Checker</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/Intel-Processor-Checker/m-p/1211818#M7180</link>
      <description>&lt;P&gt;Hi Kihang,&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Instead of a specific CPU not working, we think these errors might be due to data race conditions in the code.&lt;/P&gt;&lt;P&gt;Have you run the test enough times and found similar behaviour when that CPU is involved?&lt;/P&gt;&lt;P&gt;Can you provide your command line? (how you were launching the MPI) and if possible, provide the code too.&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;You can check for the correctness of the code using ITAC. (This will report if there any race conditions and other errors too).&lt;/P&gt;&lt;P&gt;&lt;B style="font-size: 14px;"&gt;&lt;I&gt;source &amp;lt;itac_installdir&amp;gt;/bin/itacvars.sh&lt;/I&gt;&lt;/B&gt;&lt;/P&gt;&lt;P&gt;&lt;B style="font-size: 14px;"&gt;&lt;I&gt;mpirun -np&amp;lt; &amp;gt; -check_mpi  ./&amp;lt;executable&amp;gt;&lt;/I&gt;&lt;/B&gt;&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;If your program involves OpenMP/TBB or any other threading use Intel Inspector to analyse the application.&lt;/P&gt;&lt;P&gt;You can see how to use Inspector for MPI here: &lt;A target="_blank" rel="noopener noreferrer" href="https://software.intel.com/content/www/us/en/develop/documentation/inspector-user-guide-linux/top/mpi-applications-support/collecting-mpi-performance-correctness-data.html"&gt;https://software.intel.com/content/www/us/en/develop/documentation/inspector-user-guide-linux/top/mpi-applications-support/collecting-mpi-performance-correctness-data.html&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Regards&lt;/P&gt;&lt;P&gt;Prasanth&lt;/P&gt;&lt;BR /&gt;</description>
      <pubDate>Wed, 23 Sep 2020 10:29:32 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/Intel-Processor-Checker/m-p/1211818#M7180</guid>
      <dc:creator>PrasanthD_intel</dc:creator>
      <dc:date>2020-09-23T10:29:32Z</dc:date>
    </item>
    <item>
      <title>Re:Intel Processor Checker</title>
      <link>https://community.intel.com/t5/Intel-MPI-Library/Intel-Processor-Checker/m-p/1213758#M7204</link>
      <description>&lt;P&gt;Hi Kihang,&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;We are closing this thread assuming your issue is resolved.&lt;/P&gt;&lt;P&gt;Please raise a new thread for any further questions. Any further interaction in this thread will be considered community only&lt;/P&gt;&lt;P&gt;Regards&lt;/P&gt;&lt;P&gt;Prasanth&lt;/P&gt;&lt;BR /&gt;</description>
      <pubDate>Wed, 30 Sep 2020 12:16:54 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-MPI-Library/Intel-Processor-Checker/m-p/1213758#M7204</guid>
      <dc:creator>PrasanthD_intel</dc:creator>
      <dc:date>2020-09-30T12:16:54Z</dc:date>
    </item>
  </channel>
</rss>

