<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Linpack Benchmark Only Running On 2 of 4 cores in Intel® oneAPI Math Kernel Library</title>
    <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Linpack-Benchmark-Only-Running-On-2-of-4-cores/m-p/897015#M10946</link>
    <description>Sorry if this is rather is rather trivial but I'm having some difficulties getting the benchmark to run across all four cores of a twin socket dual-core machine.&lt;BR /&gt;&lt;BR /&gt;I am attempting to benchmark a Dell Poweredge SC1425 it has a couple of dual core Xeons (I have posted the output from /proc/cpuinfo at the bottom) and am running the file:&lt;BR /&gt;&lt;BR /&gt;/linpack_10.2.0/benchmarks/linpack/lininput_xeon32&lt;BR /&gt;&lt;BR /&gt;The file is as it came, i.e. unedited (yes, you are dealing with someone that ignorant). Top is showing:&lt;BR /&gt;&lt;BR /&gt;top - 14:44:10 up 23:56, 3 users, load average: 1.41, 0.99, 1.06&lt;BR /&gt;Tasks: 79 total, 3 running, 76 sleeping, 0 stopped, 0 zombie&lt;BR /&gt;Cpu0 :100.0%us, 0.0%sy, 0.0%ni, 0.0%id, 0.0%wa, 0.0%hi, 0.0%si, 0.0%st&lt;BR /&gt;Cpu1 : 0.0%us, 0.0%sy, 0.0%ni,100.0%id, 0.0%wa, 0.0%hi, 0.0%si, 0.0%st&lt;BR /&gt;Cpu2 :100.0%us, 0.0%sy, 0.0%ni, 0.0%id, 0.0%wa, 0.0%hi, 0.0%si, 0.0%st&lt;BR /&gt;Cpu3 : 0.3%us, 0.3%sy, 0.0%ni, 99.3%id, 0.0%wa, 0.0%hi, 0.0%si, 0.0%st&lt;BR /&gt;Mem: 2076120k total, 1903244k used, 172876k free, 23536k buffers&lt;BR /&gt;Swap: 4008120k total, 0k used, 4008120k free, 87384k cached&lt;BR /&gt;&lt;BR /&gt; PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND&lt;BR /&gt;16148 root 20 0 1727m 1.7g 656 R 100 84.9 0:50.63 xlinpack_xeon32&lt;BR /&gt;16220 root 20 0 1727m 1.7g 656 R 100 84.9 0:27.24 xlinpack_xeon32&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;and ps:&lt;BR /&gt;&lt;BR /&gt;root 3086 0.0 0.1 8160 2708 ? Ss 13:32 0:02 \_ sshd: root@pts/0&lt;BR /&gt;root 3094 0.0 0.1 5132 2688 pts/0 Ss 13:32 0:01 | \_ -bash&lt;BR /&gt;root 14143 0.1 0.0 3848 1172 pts/0 S+ 14:32 0:00 | \_ /bin/bash ./runme_xeon32&lt;BR /&gt;root 14146 99.8 84.9 1769204 1763468 pts/0 R+ 14:32 1:28 | \_ ./xlinpack_xeon32 lininput_xeon32&lt;BR /&gt;root 14217 0.0 84.9 1769204 1763468 pts/0 S+ 14:32 0:00 | \_ ./xlinpack_xeon32 lininput_xeon32&lt;BR /&gt;root 14218 0.0 84.9 1769204 1763468 pts/0 S+ 14:32 0:00 | \_ ./xlinpack_xeon32 lininput_xeon32&lt;BR /&gt;root 14219 100 84.9 1769204 1763468 pts/0 R+ 14:32 1:05 | \_ ./xlinpack_xeon32 lininput_xeon32&lt;BR /&gt;&lt;BR /&gt;However, the top of the output file reads:&lt;BR /&gt;&lt;BR /&gt;Wed Aug 12 14:32:24 UTC 2009&lt;BR /&gt;Intel LINPACK data&lt;BR /&gt;&lt;BR /&gt;Current date/time: Wed Aug 12 14:32:24 2009&lt;BR /&gt;&lt;BR /&gt;CPU frequency: 2.800 GHz&lt;BR /&gt;Number of CPUs: 4&lt;BR /&gt;Number of threads: 4&lt;BR /&gt;&lt;BR /&gt;Which looks like it got the number or cores correct but as far as I can tell it is only executing on two of them in two threads. I have tried setting the OMP_NUM_THREADS to 4 with no effect.&lt;BR /&gt;&lt;BR /&gt;I would really apprecaite any help you could provide. And apologies in advance if I've been really dense.&lt;BR /&gt;&lt;BR /&gt;Thanks&lt;BR /&gt;Elliot&lt;BR /&gt;&lt;BR /&gt;CPU info follows:&lt;BR /&gt;&lt;BR /&gt;processor : 0 &lt;BR /&gt;vendor_id : GenuineIntel &lt;BR /&gt;cpu family : 15 &lt;BR /&gt;model : 4 &lt;BR /&gt;model name : Intel Xeon CPU 2.80GHz &lt;BR /&gt;stepping : 3 &lt;BR /&gt;cpu MHz : 2800.328 &lt;BR /&gt;cache size : 2048 KB &lt;BR /&gt;physical id : 0 &lt;BR /&gt;siblings : 2 &lt;BR /&gt;core id : 0 &lt;BR /&gt;cpu cores : 1 &lt;BR /&gt;apicid : 0 &lt;BR /&gt;initial apicid : 0 &lt;BR /&gt;fdiv_bug : no &lt;BR /&gt;hlt_bug : no &lt;BR /&gt;f00f_bug : no &lt;BR /&gt;coma_bug : no &lt;BR /&gt;fpu : yes &lt;BR /&gt;fpu_exception : yes &lt;BR /&gt;cpuid level : 5 &lt;BR /&gt;wp : yes &lt;BR /&gt;flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe nx lm constant_tsc pebs bts pni monitor ds_cpl cid cx16 xtpr &lt;BR /&gt;bogomips : 5605.43 &lt;BR /&gt;clflush size : 64 &lt;BR /&gt;power management: &lt;BR /&gt;&lt;BR /&gt;processor : 1&lt;BR /&gt;vendor_id : GenuineIntel&lt;BR /&gt;cpu family : 15 &lt;BR /&gt;model : 4 &lt;BR /&gt;model name : Intel Xeon CPU 2.80GHz&lt;BR /&gt;stepping : 3 &lt;BR /&gt;cpu MHz : 2800.328 &lt;BR /&gt;cache size : 2048 KB &lt;BR /&gt;physical id : 3 &lt;BR /&gt;siblings : 2 &lt;BR /&gt;core id : 0 &lt;BR /&gt;cpu cores : 1 &lt;BR /&gt;apicid : 6 &lt;BR /&gt;initial apicid : 6 &lt;BR /&gt;fdiv_bug : no &lt;BR /&gt;hlt_bug : no &lt;BR /&gt;f00f_bug : no &lt;BR /&gt;coma_bug : no &lt;BR /&gt;fpu : yes &lt;BR /&gt;fpu_exception : yes &lt;BR /&gt;cpuid level : 5 &lt;BR /&gt;wp : yes &lt;BR /&gt;flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe nx lm constant_tsc pebs bts pni monitor ds_cpl cid cx16 xtpr &lt;BR /&gt;bogomips : 5600.29 &lt;BR /&gt;clflush size : 64 &lt;BR /&gt;power management: &lt;BR /&gt;&lt;BR /&gt;processor : 2&lt;BR /&gt;vendor_id : GenuineIntel&lt;BR /&gt;cpu family : 15 &lt;BR /&gt;model : 4 &lt;BR /&gt;model name : Intel Xeon CPU 2.80GHz&lt;BR /&gt;stepping : 3 &lt;BR /&gt;cpu MHz : 2800.328 &lt;BR /&gt;cache size : 2048 KB &lt;BR /&gt;physical id : 0 &lt;BR /&gt;siblings : 2 &lt;BR /&gt;core id : 0 &lt;BR /&gt;cpu cores : 1 &lt;BR /&gt;apicid : 1 &lt;BR /&gt;initial apicid : 1 &lt;BR /&gt;fdiv_bug : no &lt;BR /&gt;hlt_bug : no &lt;BR /&gt;f00f_bug : no &lt;BR /&gt;coma_bug : no &lt;BR /&gt;fpu : yes&lt;BR /&gt;fpu_exception : yes&lt;BR /&gt;cpuid level : 5&lt;BR /&gt;wp : yes&lt;BR /&gt;flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe nx lm constant_tsc pebs bts pni monitor ds_cpl cid cx16 xtpr&lt;BR /&gt;bogomips : 5600.24&lt;BR /&gt;clflush size : 64&lt;BR /&gt;power management:&lt;BR /&gt;&lt;BR /&gt;processor : 3&lt;BR /&gt;vendor_id : GenuineIntel&lt;BR /&gt;cpu family : 15&lt;BR /&gt;model : 4&lt;BR /&gt;model name : Intel Xeon CPU 2.80GHz&lt;BR /&gt;stepping : 3&lt;BR /&gt;cpu MHz : 2800.328&lt;BR /&gt;cache size : 2048 KB&lt;BR /&gt;physical id : 3&lt;BR /&gt;siblings : 2&lt;BR /&gt;core id : 0&lt;BR /&gt;cpu cores : 1&lt;BR /&gt;apicid : 7&lt;BR /&gt;initial apicid : 7&lt;BR /&gt;fdiv_bug : no&lt;BR /&gt;hlt_bug : no&lt;BR /&gt;f00f_bug : no&lt;BR /&gt;coma_bug : no&lt;BR /&gt;fpu : yes&lt;BR /&gt;fpu_exception : yes&lt;BR /&gt;cpuid level : 5&lt;BR /&gt;wp : yes&lt;BR /&gt;flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe nx lm constant_tsc pebs bts pni monitor ds_cpl cid cx16 xtpr&lt;BR /&gt;bogomips : 5600.25&lt;BR /&gt;clflush size : 64&lt;BR /&gt;power management:&lt;BR /&gt;</description>
    <pubDate>Wed, 12 Aug 2009 14:46:25 GMT</pubDate>
    <dc:creator>elliotc</dc:creator>
    <dc:date>2009-08-12T14:46:25Z</dc:date>
    <item>
      <title>Linpack Benchmark Only Running On 2 of 4 cores</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Linpack-Benchmark-Only-Running-On-2-of-4-cores/m-p/897015#M10946</link>
      <description>Sorry if this is rather is rather trivial but I'm having some difficulties getting the benchmark to run across all four cores of a twin socket dual-core machine.&lt;BR /&gt;&lt;BR /&gt;I am attempting to benchmark a Dell Poweredge SC1425 it has a couple of dual core Xeons (I have posted the output from /proc/cpuinfo at the bottom) and am running the file:&lt;BR /&gt;&lt;BR /&gt;/linpack_10.2.0/benchmarks/linpack/lininput_xeon32&lt;BR /&gt;&lt;BR /&gt;The file is as it came, i.e. unedited (yes, you are dealing with someone that ignorant). Top is showing:&lt;BR /&gt;&lt;BR /&gt;top - 14:44:10 up 23:56, 3 users, load average: 1.41, 0.99, 1.06&lt;BR /&gt;Tasks: 79 total, 3 running, 76 sleeping, 0 stopped, 0 zombie&lt;BR /&gt;Cpu0 :100.0%us, 0.0%sy, 0.0%ni, 0.0%id, 0.0%wa, 0.0%hi, 0.0%si, 0.0%st&lt;BR /&gt;Cpu1 : 0.0%us, 0.0%sy, 0.0%ni,100.0%id, 0.0%wa, 0.0%hi, 0.0%si, 0.0%st&lt;BR /&gt;Cpu2 :100.0%us, 0.0%sy, 0.0%ni, 0.0%id, 0.0%wa, 0.0%hi, 0.0%si, 0.0%st&lt;BR /&gt;Cpu3 : 0.3%us, 0.3%sy, 0.0%ni, 99.3%id, 0.0%wa, 0.0%hi, 0.0%si, 0.0%st&lt;BR /&gt;Mem: 2076120k total, 1903244k used, 172876k free, 23536k buffers&lt;BR /&gt;Swap: 4008120k total, 0k used, 4008120k free, 87384k cached&lt;BR /&gt;&lt;BR /&gt; PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND&lt;BR /&gt;16148 root 20 0 1727m 1.7g 656 R 100 84.9 0:50.63 xlinpack_xeon32&lt;BR /&gt;16220 root 20 0 1727m 1.7g 656 R 100 84.9 0:27.24 xlinpack_xeon32&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;and ps:&lt;BR /&gt;&lt;BR /&gt;root 3086 0.0 0.1 8160 2708 ? Ss 13:32 0:02 \_ sshd: root@pts/0&lt;BR /&gt;root 3094 0.0 0.1 5132 2688 pts/0 Ss 13:32 0:01 | \_ -bash&lt;BR /&gt;root 14143 0.1 0.0 3848 1172 pts/0 S+ 14:32 0:00 | \_ /bin/bash ./runme_xeon32&lt;BR /&gt;root 14146 99.8 84.9 1769204 1763468 pts/0 R+ 14:32 1:28 | \_ ./xlinpack_xeon32 lininput_xeon32&lt;BR /&gt;root 14217 0.0 84.9 1769204 1763468 pts/0 S+ 14:32 0:00 | \_ ./xlinpack_xeon32 lininput_xeon32&lt;BR /&gt;root 14218 0.0 84.9 1769204 1763468 pts/0 S+ 14:32 0:00 | \_ ./xlinpack_xeon32 lininput_xeon32&lt;BR /&gt;root 14219 100 84.9 1769204 1763468 pts/0 R+ 14:32 1:05 | \_ ./xlinpack_xeon32 lininput_xeon32&lt;BR /&gt;&lt;BR /&gt;However, the top of the output file reads:&lt;BR /&gt;&lt;BR /&gt;Wed Aug 12 14:32:24 UTC 2009&lt;BR /&gt;Intel LINPACK data&lt;BR /&gt;&lt;BR /&gt;Current date/time: Wed Aug 12 14:32:24 2009&lt;BR /&gt;&lt;BR /&gt;CPU frequency: 2.800 GHz&lt;BR /&gt;Number of CPUs: 4&lt;BR /&gt;Number of threads: 4&lt;BR /&gt;&lt;BR /&gt;Which looks like it got the number or cores correct but as far as I can tell it is only executing on two of them in two threads. I have tried setting the OMP_NUM_THREADS to 4 with no effect.&lt;BR /&gt;&lt;BR /&gt;I would really apprecaite any help you could provide. And apologies in advance if I've been really dense.&lt;BR /&gt;&lt;BR /&gt;Thanks&lt;BR /&gt;Elliot&lt;BR /&gt;&lt;BR /&gt;CPU info follows:&lt;BR /&gt;&lt;BR /&gt;processor : 0 &lt;BR /&gt;vendor_id : GenuineIntel &lt;BR /&gt;cpu family : 15 &lt;BR /&gt;model : 4 &lt;BR /&gt;model name : Intel Xeon CPU 2.80GHz &lt;BR /&gt;stepping : 3 &lt;BR /&gt;cpu MHz : 2800.328 &lt;BR /&gt;cache size : 2048 KB &lt;BR /&gt;physical id : 0 &lt;BR /&gt;siblings : 2 &lt;BR /&gt;core id : 0 &lt;BR /&gt;cpu cores : 1 &lt;BR /&gt;apicid : 0 &lt;BR /&gt;initial apicid : 0 &lt;BR /&gt;fdiv_bug : no &lt;BR /&gt;hlt_bug : no &lt;BR /&gt;f00f_bug : no &lt;BR /&gt;coma_bug : no &lt;BR /&gt;fpu : yes &lt;BR /&gt;fpu_exception : yes &lt;BR /&gt;cpuid level : 5 &lt;BR /&gt;wp : yes &lt;BR /&gt;flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe nx lm constant_tsc pebs bts pni monitor ds_cpl cid cx16 xtpr &lt;BR /&gt;bogomips : 5605.43 &lt;BR /&gt;clflush size : 64 &lt;BR /&gt;power management: &lt;BR /&gt;&lt;BR /&gt;processor : 1&lt;BR /&gt;vendor_id : GenuineIntel&lt;BR /&gt;cpu family : 15 &lt;BR /&gt;model : 4 &lt;BR /&gt;model name : Intel Xeon CPU 2.80GHz&lt;BR /&gt;stepping : 3 &lt;BR /&gt;cpu MHz : 2800.328 &lt;BR /&gt;cache size : 2048 KB &lt;BR /&gt;physical id : 3 &lt;BR /&gt;siblings : 2 &lt;BR /&gt;core id : 0 &lt;BR /&gt;cpu cores : 1 &lt;BR /&gt;apicid : 6 &lt;BR /&gt;initial apicid : 6 &lt;BR /&gt;fdiv_bug : no &lt;BR /&gt;hlt_bug : no &lt;BR /&gt;f00f_bug : no &lt;BR /&gt;coma_bug : no &lt;BR /&gt;fpu : yes &lt;BR /&gt;fpu_exception : yes &lt;BR /&gt;cpuid level : 5 &lt;BR /&gt;wp : yes &lt;BR /&gt;flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe nx lm constant_tsc pebs bts pni monitor ds_cpl cid cx16 xtpr &lt;BR /&gt;bogomips : 5600.29 &lt;BR /&gt;clflush size : 64 &lt;BR /&gt;power management: &lt;BR /&gt;&lt;BR /&gt;processor : 2&lt;BR /&gt;vendor_id : GenuineIntel&lt;BR /&gt;cpu family : 15 &lt;BR /&gt;model : 4 &lt;BR /&gt;model name : Intel Xeon CPU 2.80GHz&lt;BR /&gt;stepping : 3 &lt;BR /&gt;cpu MHz : 2800.328 &lt;BR /&gt;cache size : 2048 KB &lt;BR /&gt;physical id : 0 &lt;BR /&gt;siblings : 2 &lt;BR /&gt;core id : 0 &lt;BR /&gt;cpu cores : 1 &lt;BR /&gt;apicid : 1 &lt;BR /&gt;initial apicid : 1 &lt;BR /&gt;fdiv_bug : no &lt;BR /&gt;hlt_bug : no &lt;BR /&gt;f00f_bug : no &lt;BR /&gt;coma_bug : no &lt;BR /&gt;fpu : yes&lt;BR /&gt;fpu_exception : yes&lt;BR /&gt;cpuid level : 5&lt;BR /&gt;wp : yes&lt;BR /&gt;flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe nx lm constant_tsc pebs bts pni monitor ds_cpl cid cx16 xtpr&lt;BR /&gt;bogomips : 5600.24&lt;BR /&gt;clflush size : 64&lt;BR /&gt;power management:&lt;BR /&gt;&lt;BR /&gt;processor : 3&lt;BR /&gt;vendor_id : GenuineIntel&lt;BR /&gt;cpu family : 15&lt;BR /&gt;model : 4&lt;BR /&gt;model name : Intel Xeon CPU 2.80GHz&lt;BR /&gt;stepping : 3&lt;BR /&gt;cpu MHz : 2800.328&lt;BR /&gt;cache size : 2048 KB&lt;BR /&gt;physical id : 3&lt;BR /&gt;siblings : 2&lt;BR /&gt;core id : 0&lt;BR /&gt;cpu cores : 1&lt;BR /&gt;apicid : 7&lt;BR /&gt;initial apicid : 7&lt;BR /&gt;fdiv_bug : no&lt;BR /&gt;hlt_bug : no&lt;BR /&gt;f00f_bug : no&lt;BR /&gt;coma_bug : no&lt;BR /&gt;fpu : yes&lt;BR /&gt;fpu_exception : yes&lt;BR /&gt;cpuid level : 5&lt;BR /&gt;wp : yes&lt;BR /&gt;flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe nx lm constant_tsc pebs bts pni monitor ds_cpl cid cx16 xtpr&lt;BR /&gt;bogomips : 5600.25&lt;BR /&gt;clflush size : 64&lt;BR /&gt;power management:&lt;BR /&gt;</description>
      <pubDate>Wed, 12 Aug 2009 14:46:25 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Linpack-Benchmark-Only-Running-On-2-of-4-cores/m-p/897015#M10946</guid>
      <dc:creator>elliotc</dc:creator>
      <dc:date>2009-08-12T14:46:25Z</dc:date>
    </item>
    <item>
      <title>Re: Linpack Benchmark Only Running On 2 of 4 cores</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Linpack-Benchmark-Only-Running-On-2-of-4-cores/m-p/897016#M10947</link>
      <description>&lt;BR /&gt;It may be a same issue discribledas &lt;A href="http://software.intel.com/en-us/forums/showthread.php?t=67622"&gt;Intel MKL threading behavior on Hyper-Threading systems&lt;/A&gt;.&lt;BR /&gt;&lt;BR /&gt;Regards,&lt;BR /&gt;Ying</description>
      <pubDate>Fri, 14 Aug 2009 05:56:00 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Linpack-Benchmark-Only-Running-On-2-of-4-cores/m-p/897016#M10947</guid>
      <dc:creator>Ying_H_Intel</dc:creator>
      <dc:date>2009-08-14T05:56:00Z</dc:date>
    </item>
  </channel>
</rss>

