<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Error mesage when running Intel® Optimized LINPACK Benchmark for Linux* OS on Intel Phi cards. in Intel® oneAPI Math Kernel Library</title>
    <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Error-mesage-when-running-Intel-Optimized-LINPACK-Benchmark-for/m-p/950782#M15193</link>
    <description>&lt;P&gt;Hi,&lt;/P&gt;
&lt;P&gt;I am trying Intel® Optimized LINPACK Benchmark for Linux* OS on Multi-Intel Phi cards configuration.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;(&lt;A href="http://software.intel.com/sites/products/documentation/doclib/mkl_sa/11/mkl_userguide_lnx/GUID-D15B5C2F-07AC-4449-B148-6AF1DFDE674D.htm"&gt;http://software.intel.com/sites/products/documentation/doclib/mkl_sa/11/mkl_userguide_lnx/GUID-D15B5C2F-07AC-4449-B148-6AF1DFDE674D.htm&lt;/A&gt;).&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;My test environment :&lt;/P&gt;
&lt;OL&gt;
&lt;LI&gt;AIC Sandy Bridge EP-4S server system with Sandy Bridge EP-4S *4 + 98GB memory&lt;/LI&gt;
&lt;LI&gt;Intel Xeon Phi : 3 pcs of 3110 and 4 pcs of 3115&lt;/LI&gt;
&lt;LI&gt;OS: Redhat Enterprise Linux 6.2 x64&lt;/LI&gt;
&lt;LI&gt;Xeon Phi MPSS: KNC_gold_update_2-2.1.5889-16-rhel-6.2.tar&lt;/LI&gt;
&lt;LI&gt;Intel Composer XE : l_ccompxe_2013.3.163.tgz&lt;/LI&gt;
&lt;LI&gt;Intel MPI : l_mpi_p_4.1.0.024.tgz or l_mpi_p_4.1.0.030.tgz&lt;/LI&gt;
&lt;/OL&gt;
&lt;P&gt;After ran the runme_xeon64_ao script to enables acceleration by offloading computations to Intel Xeon Phi coprocessors available on the system, I found that when I increase the HPL problem size(Ns) to a arrange, Linpack process(xlinpack_xeon64) will run endlessly and can’t be finished and found some relevant error message in host system log . For example, at 7 pcs Phi configuration, I got this problem when I set HPL problem size(Ns) to 46000. It related to Phi card quantity. At 1 pcs Phi configuration, I can increase HPL problem size(Ns) to 100000 without problem.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;The below is error message:&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;__scif_fence_wait 3041 err -16&lt;/P&gt;
&lt;P&gt;dma_mark_wait 1080 TO chan 0x0&lt;/P&gt;
&lt;P&gt;drain_dma_intr 1151 err -16&lt;/P&gt;
&lt;P&gt;micscif_rma_destroy_temp_windows 2082 DMA channel 0 hung ep-&amp;gt;state 2 window-&amp;gt;dma_mark 0x1c0 channel_mark 0x1c2&lt;/P&gt;
&lt;P&gt;------------[ cut here ]------------&lt;/P&gt;
&lt;P&gt;WARNING: at /home/build/sandbox/mpss/MPSS_4982/k1om/rhel-6.2/mpss/.rpmbuild_4982/BUILD/intel-mic-kmod-2.1.4982/micscif_rma.c:2084 micscif_rma_destroy_temp_windows+0x314/0x540 [mic]() (Not tainted)&lt;/P&gt;
&lt;P&gt;Hardware name: SB301-TO&lt;/P&gt;
&lt;P&gt;Modules linked in: ipmi_devintf ipmi_si ipmi_msghandler autofs4 sunrpc cpufreq_ondemand acpi_cpufreq freq_table mperf ipv6 mic(U) microcode sg ixgbe dca mdio sb_edac edac_core iTCO_wdt iTCO_vendor_support shpchp e1000e i2c_i801 i2c_core ext4 mbcache jbd2 sr_mod cdrom usb_storage sd_mod crc_t10dif ahci isci libsas scsi_transport_sas dm_mirror dm_region_hash dm_log dm_mod [last unloaded: scsi_wait_scan]&lt;/P&gt;
&lt;P&gt;Pid: 2812, comm: SCIF_MISC Not tainted 2.6.32-220.el6.x86_64 #1&lt;/P&gt;
&lt;P&gt;Call Trace:&lt;/P&gt;
&lt;P&gt;&amp;nbsp;[&amp;lt;ffffffff81069b77&amp;gt;] ? warn_slowpath_common+0x87/0xc0&lt;/P&gt;
&lt;P&gt;&amp;nbsp;[&amp;lt;ffffffff81069bca&amp;gt;] ? warn_slowpath_null+0x1a/0x20&lt;/P&gt;
&lt;P&gt;&amp;nbsp;[&amp;lt;ffffffffa0235664&amp;gt;] ? micscif_rma_destroy_temp_windows+0x314/0x540 [mic]&lt;/P&gt;
&lt;P&gt;&amp;nbsp;[&amp;lt;ffffffffa02321b5&amp;gt;] ? micscif_rma_handle_remote_fences+0x155/0x380 [mic]&lt;/P&gt;
&lt;P&gt;&amp;nbsp;[&amp;lt;ffffffff814eca40&amp;gt;] ? thread_return+0x4e/0x77e&lt;/P&gt;
&lt;P&gt;&amp;nbsp;[&amp;lt;ffffffff8100bc0e&amp;gt;] ? apic_timer_interrupt+0xe/0x20&lt;/P&gt;
&lt;P&gt;&amp;nbsp;[&amp;lt;ffffffffa022a0f0&amp;gt;] ? micscif_misc_handler+0x0/0xc0 [mic]&lt;/P&gt;
&lt;P&gt;&amp;nbsp;[&amp;lt;ffffffffa022a10a&amp;gt;] ? micscif_misc_handler+0x1a/0xc0 [mic]&lt;/P&gt;
&lt;P&gt;&amp;nbsp;[&amp;lt;ffffffffa022a0f0&amp;gt;] ? micscif_misc_handler+0x0/0xc0 [mic]&lt;/P&gt;
&lt;P&gt;&amp;nbsp;[&amp;lt;ffffffff8108b2b0&amp;gt;] ? worker_thread+0x170/0x2a0&lt;/P&gt;
&lt;P&gt;&amp;nbsp;[&amp;lt;ffffffff81090bf0&amp;gt;] ? autoremove_wake_function+0x0/0x40&lt;/P&gt;
&lt;P&gt;&amp;nbsp;[&amp;lt;ffffffff8108b140&amp;gt;] ? worker_thread+0x0/0x2a0&lt;/P&gt;
&lt;P&gt;&amp;nbsp;[&amp;lt;ffffffff81090886&amp;gt;] ? kthread+0x96/0xa0&lt;/P&gt;
&lt;P&gt;&amp;nbsp;[&amp;lt;ffffffff8100c14a&amp;gt;] ? child_rip+0xa/0x20&lt;/P&gt;
&lt;P&gt;&amp;nbsp;[&amp;lt;ffffffff810907f0&amp;gt;] ? kthread+0x0/0xa0&lt;/P&gt;
&lt;P&gt;&amp;nbsp;[&amp;lt;ffffffff8100c140&amp;gt;] ? child_rip+0x0/0x20&lt;/P&gt;
&lt;P&gt;---[ end trace e0d2c31584645743 ]---&lt;/P&gt;
&lt;P&gt;&lt;A href="http://software.intel.com/en-us/forums/topic/392243/feed"&gt;RSS&lt;/A&gt; &lt;A href="http://software.intel.com/en-us/forums/topic/392243#"&gt;Top&lt;/A&gt;&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;&lt;A href="http://software.intel.com/en-us/flag/flag/node_spam/392243?destination=node/392243&amp;amp;token=5HtWu2StfhfdX12GBDC-oalC6nghQqvH14bmGptqDSQ"&gt;Flag as spam&lt;/A&gt;&amp;nbsp;&lt;/LI&gt;
&lt;LI&gt;&lt;A href="http://software.intel.com/en-us/flag/flag/node_inappropriate/392243?destination=node/392243&amp;amp;token=5HtWu2StfhfdX12GBDC-oalC6nghQqvH14bmGptqDSQ"&gt;Flag as inappropriate&lt;/A&gt;&amp;nbsp;&lt;/LI&gt;
&lt;/UL&gt;</description>
    <pubDate>Tue, 07 May 2013 03:00:32 GMT</pubDate>
    <dc:creator>Tinway_Chen</dc:creator>
    <dc:date>2013-05-07T03:00:32Z</dc:date>
    <item>
      <title>Error mesage when running Intel® Optimized LINPACK Benchmark for Linux* OS on Intel Phi cards.</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Error-mesage-when-running-Intel-Optimized-LINPACK-Benchmark-for/m-p/950782#M15193</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;
&lt;P&gt;I am trying Intel® Optimized LINPACK Benchmark for Linux* OS on Multi-Intel Phi cards configuration.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;(&lt;A href="http://software.intel.com/sites/products/documentation/doclib/mkl_sa/11/mkl_userguide_lnx/GUID-D15B5C2F-07AC-4449-B148-6AF1DFDE674D.htm"&gt;http://software.intel.com/sites/products/documentation/doclib/mkl_sa/11/mkl_userguide_lnx/GUID-D15B5C2F-07AC-4449-B148-6AF1DFDE674D.htm&lt;/A&gt;).&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;My test environment :&lt;/P&gt;
&lt;OL&gt;
&lt;LI&gt;AIC Sandy Bridge EP-4S server system with Sandy Bridge EP-4S *4 + 98GB memory&lt;/LI&gt;
&lt;LI&gt;Intel Xeon Phi : 3 pcs of 3110 and 4 pcs of 3115&lt;/LI&gt;
&lt;LI&gt;OS: Redhat Enterprise Linux 6.2 x64&lt;/LI&gt;
&lt;LI&gt;Xeon Phi MPSS: KNC_gold_update_2-2.1.5889-16-rhel-6.2.tar&lt;/LI&gt;
&lt;LI&gt;Intel Composer XE : l_ccompxe_2013.3.163.tgz&lt;/LI&gt;
&lt;LI&gt;Intel MPI : l_mpi_p_4.1.0.024.tgz or l_mpi_p_4.1.0.030.tgz&lt;/LI&gt;
&lt;/OL&gt;
&lt;P&gt;After ran the runme_xeon64_ao script to enables acceleration by offloading computations to Intel Xeon Phi coprocessors available on the system, I found that when I increase the HPL problem size(Ns) to a arrange, Linpack process(xlinpack_xeon64) will run endlessly and can’t be finished and found some relevant error message in host system log . For example, at 7 pcs Phi configuration, I got this problem when I set HPL problem size(Ns) to 46000. It related to Phi card quantity. At 1 pcs Phi configuration, I can increase HPL problem size(Ns) to 100000 without problem.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;The below is error message:&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;__scif_fence_wait 3041 err -16&lt;/P&gt;
&lt;P&gt;dma_mark_wait 1080 TO chan 0x0&lt;/P&gt;
&lt;P&gt;drain_dma_intr 1151 err -16&lt;/P&gt;
&lt;P&gt;micscif_rma_destroy_temp_windows 2082 DMA channel 0 hung ep-&amp;gt;state 2 window-&amp;gt;dma_mark 0x1c0 channel_mark 0x1c2&lt;/P&gt;
&lt;P&gt;------------[ cut here ]------------&lt;/P&gt;
&lt;P&gt;WARNING: at /home/build/sandbox/mpss/MPSS_4982/k1om/rhel-6.2/mpss/.rpmbuild_4982/BUILD/intel-mic-kmod-2.1.4982/micscif_rma.c:2084 micscif_rma_destroy_temp_windows+0x314/0x540 [mic]() (Not tainted)&lt;/P&gt;
&lt;P&gt;Hardware name: SB301-TO&lt;/P&gt;
&lt;P&gt;Modules linked in: ipmi_devintf ipmi_si ipmi_msghandler autofs4 sunrpc cpufreq_ondemand acpi_cpufreq freq_table mperf ipv6 mic(U) microcode sg ixgbe dca mdio sb_edac edac_core iTCO_wdt iTCO_vendor_support shpchp e1000e i2c_i801 i2c_core ext4 mbcache jbd2 sr_mod cdrom usb_storage sd_mod crc_t10dif ahci isci libsas scsi_transport_sas dm_mirror dm_region_hash dm_log dm_mod [last unloaded: scsi_wait_scan]&lt;/P&gt;
&lt;P&gt;Pid: 2812, comm: SCIF_MISC Not tainted 2.6.32-220.el6.x86_64 #1&lt;/P&gt;
&lt;P&gt;Call Trace:&lt;/P&gt;
&lt;P&gt;&amp;nbsp;[&amp;lt;ffffffff81069b77&amp;gt;] ? warn_slowpath_common+0x87/0xc0&lt;/P&gt;
&lt;P&gt;&amp;nbsp;[&amp;lt;ffffffff81069bca&amp;gt;] ? warn_slowpath_null+0x1a/0x20&lt;/P&gt;
&lt;P&gt;&amp;nbsp;[&amp;lt;ffffffffa0235664&amp;gt;] ? micscif_rma_destroy_temp_windows+0x314/0x540 [mic]&lt;/P&gt;
&lt;P&gt;&amp;nbsp;[&amp;lt;ffffffffa02321b5&amp;gt;] ? micscif_rma_handle_remote_fences+0x155/0x380 [mic]&lt;/P&gt;
&lt;P&gt;&amp;nbsp;[&amp;lt;ffffffff814eca40&amp;gt;] ? thread_return+0x4e/0x77e&lt;/P&gt;
&lt;P&gt;&amp;nbsp;[&amp;lt;ffffffff8100bc0e&amp;gt;] ? apic_timer_interrupt+0xe/0x20&lt;/P&gt;
&lt;P&gt;&amp;nbsp;[&amp;lt;ffffffffa022a0f0&amp;gt;] ? micscif_misc_handler+0x0/0xc0 [mic]&lt;/P&gt;
&lt;P&gt;&amp;nbsp;[&amp;lt;ffffffffa022a10a&amp;gt;] ? micscif_misc_handler+0x1a/0xc0 [mic]&lt;/P&gt;
&lt;P&gt;&amp;nbsp;[&amp;lt;ffffffffa022a0f0&amp;gt;] ? micscif_misc_handler+0x0/0xc0 [mic]&lt;/P&gt;
&lt;P&gt;&amp;nbsp;[&amp;lt;ffffffff8108b2b0&amp;gt;] ? worker_thread+0x170/0x2a0&lt;/P&gt;
&lt;P&gt;&amp;nbsp;[&amp;lt;ffffffff81090bf0&amp;gt;] ? autoremove_wake_function+0x0/0x40&lt;/P&gt;
&lt;P&gt;&amp;nbsp;[&amp;lt;ffffffff8108b140&amp;gt;] ? worker_thread+0x0/0x2a0&lt;/P&gt;
&lt;P&gt;&amp;nbsp;[&amp;lt;ffffffff81090886&amp;gt;] ? kthread+0x96/0xa0&lt;/P&gt;
&lt;P&gt;&amp;nbsp;[&amp;lt;ffffffff8100c14a&amp;gt;] ? child_rip+0xa/0x20&lt;/P&gt;
&lt;P&gt;&amp;nbsp;[&amp;lt;ffffffff810907f0&amp;gt;] ? kthread+0x0/0xa0&lt;/P&gt;
&lt;P&gt;&amp;nbsp;[&amp;lt;ffffffff8100c140&amp;gt;] ? child_rip+0x0/0x20&lt;/P&gt;
&lt;P&gt;---[ end trace e0d2c31584645743 ]---&lt;/P&gt;
&lt;P&gt;&lt;A href="http://software.intel.com/en-us/forums/topic/392243/feed"&gt;RSS&lt;/A&gt; &lt;A href="http://software.intel.com/en-us/forums/topic/392243#"&gt;Top&lt;/A&gt;&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;&lt;A href="http://software.intel.com/en-us/flag/flag/node_spam/392243?destination=node/392243&amp;amp;token=5HtWu2StfhfdX12GBDC-oalC6nghQqvH14bmGptqDSQ"&gt;Flag as spam&lt;/A&gt;&amp;nbsp;&lt;/LI&gt;
&lt;LI&gt;&lt;A href="http://software.intel.com/en-us/flag/flag/node_inappropriate/392243?destination=node/392243&amp;amp;token=5HtWu2StfhfdX12GBDC-oalC6nghQqvH14bmGptqDSQ"&gt;Flag as inappropriate&lt;/A&gt;&amp;nbsp;&lt;/LI&gt;
&lt;/UL&gt;</description>
      <pubDate>Tue, 07 May 2013 03:00:32 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Error-mesage-when-running-Intel-Optimized-LINPACK-Benchmark-for/m-p/950782#M15193</guid>
      <dc:creator>Tinway_Chen</dc:creator>
      <dc:date>2013-05-07T03:00:32Z</dc:date>
    </item>
    <item>
      <title>   Could you please indicate</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Error-mesage-when-running-Intel-Optimized-LINPACK-Benchmark-for/m-p/950783#M15194</link>
      <description>&lt;P&gt;&amp;nbsp;&amp;nbsp; Could you please indicate which binary you are running?&amp;nbsp; There are multiple binaries for different sorts of configurations, all of which are LINPACK in one form or another.&lt;/P&gt;</description>
      <pubDate>Fri, 14 Jun 2013 22:15:06 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Error-mesage-when-running-Intel-Optimized-LINPACK-Benchmark-for/m-p/950783#M15194</guid>
      <dc:creator>Gregory_H_Intel</dc:creator>
      <dc:date>2013-06-14T22:15:06Z</dc:date>
    </item>
  </channel>
</rss>

