<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Hybrid mode faster than just mpi in Intel® oneAPI Math Kernel Library</title>
    <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Hybrid-mode-faster-than-just-mpi/m-p/1446411#M34160</link>
    <description>&lt;P&gt;Hi Laurence,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Ok, I think then the processor differs in our case and hence the difference in results.&lt;/P&gt;
&lt;P&gt;Here is the output of lscpu&lt;/P&gt;
&lt;LI-CODE lang="markup"&gt;Architecture:    x86_64
CPU op-mode(s):   32-bit, 64-bit
Byte Order:     Little Endian
CPU(s):       144
On-line CPU(s) list: 0-143
Thread(s) per core: 2
Core(s) per socket: 36
Socket(s):      2
NUMA node(s):    2
Vendor ID:      GenuineIntel
CPU family:     6
Model:        106
Model name:     Intel(R) Xeon(R) Platinum 8360Y CPU @ 2.40GHz
Stepping:      6
CPU MHz:       2400.000
CPU max MHz:     2401.0000
CPU min MHz:     800.0000
BogoMIPS:      4800.00
L1d cache:      48K
L1i cache:      32K
L2 cache:      1280K
L3 cache:      55296K
NUMA node0 CPU(s):  0-35,72-107
NUMA node1 CPU(s):  36-71,108-143
Flags:        fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx pdpe1gb rdtscp lm constant_tsc art arch_perfmon pebs bts rep_good nopl xtopology nonstop_tsc cpuid aperfmperf pni pclmulqdq dtes64 monitor ds_cpl smx est tm2 ssse3 sdbg fma cx16 xtpr pdcm pcid dca sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand lahf_lm abm 3dnowprefetch cpuid_fault epb cat_l3 invpcid_single ssbd mba ibrs ibpb stibp ibrs_enhanced fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid cqm rdt_a avx512f avx512dq rdseed adx avx512ifma clflushopt clwb intel_pt avx512cd sha_ni avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves cqm_llc cqm_occup_llc cqm_mbm_total cqm_mbm_local split_lock_detect wbnoinvd dtherm ida arat pln pts hwp hwp_act_window hwp_epp hwp_pkg_req avx512vbmi umip pku ospke avx512_vbmi2 gfni vaes vpclmulqdq avx512_vnni avx512_bitalg tme avx512_vpopcntdq la57 rdpid fsrm md_clear pconfig flush_l1d arch_capabilities&lt;/LI-CODE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Please let me know if you think the issue is specific to only the Intel Xeon gold CPU.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;EM&gt;&amp;gt;&amp;gt;Do you have a (very) slow NFS mounted disk&amp;nbsp;...&lt;/EM&gt;&lt;/P&gt;
&lt;P&gt;Also could you please let me know if there are any metrics/parameters to know if it is slow?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Regards,&lt;/P&gt;
&lt;P&gt;Vidya.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Fri, 13 Jan 2023 10:15:22 GMT</pubDate>
    <dc:creator>VidyalathaB_Intel</dc:creator>
    <dc:date>2023-01-13T10:15:22Z</dc:date>
    <item>
      <title>Hybrid mode faster than just mpi</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Hybrid-mode-faster-than-just-mpi/m-p/1430942#M33879</link>
      <description>&lt;P&gt;I am using a code (Wien2k) which extensively exploits lapavck/scalapack via the mkl library, and can also work in hybrid mode with openmp+mpi. In my prior experience, and that of others, the hybrid mode with 2 openmp threads was slightly slower, perhaps 10%.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;With a 64 core Gold 6338 it is very different, with 2 openmp &amp;amp; the rest mpi ~1.6 times faster! I cannot explain this, and I am wondering whether this somehow relates to the architecture or is a bug with using all 64 mpi.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;For reference I am using 2021.1.1 versions of mkl/compiler/impi as later ones don't work for reasons I have not been able to determine (large program for matrix eigensolving hangs).&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I can provide a way to reproduce this, but it would involve transferring a large code &amp;amp; some control files.&lt;/P&gt;</description>
      <pubDate>Thu, 17 Nov 2022 14:21:06 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Hybrid-mode-faster-than-just-mpi/m-p/1430942#M33879</guid>
      <dc:creator>L__D__Marks</dc:creator>
      <dc:date>2022-11-17T14:21:06Z</dc:date>
    </item>
    <item>
      <title>Re:Hybrid mode faster than just mpi</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Hybrid-mode-faster-than-just-mpi/m-p/1431249#M33885</link>
      <description>&lt;P&gt;Hi Laurence,&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Thanks for reaching out to us.&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;&lt;EM&gt;&amp;gt;&amp;gt;I can provide a way to reproduce this..&lt;/EM&gt;&lt;/P&gt;&lt;P&gt;It would be a great help if you can provide us with a sample reproducer code and steps to reproduce it so that we can check it from our end as well.&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Regards,&lt;/P&gt;&lt;P&gt;Vidya.&lt;/P&gt;&lt;BR /&gt;</description>
      <pubDate>Fri, 18 Nov 2022 11:02:21 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Hybrid-mode-faster-than-just-mpi/m-p/1431249#M33885</guid>
      <dc:creator>VidyalathaB_Intel</dc:creator>
      <dc:date>2022-11-18T11:02:21Z</dc:date>
    </item>
    <item>
      <title>Re: Hybrid mode faster than just mpi</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Hybrid-mode-faster-than-just-mpi/m-p/1431330#M33890</link>
      <description>&lt;P&gt;The attached can hopefully be used to reproduce the issue. Use tar -xj (bzip2) to decompress, then look at the README file inside. Contact me if it is not clear or does not want to compile/run.&lt;/P&gt;</description>
      <pubDate>Fri, 18 Nov 2022 15:59:02 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Hybrid-mode-faster-than-just-mpi/m-p/1431330#M33890</guid>
      <dc:creator>L__D__Marks</dc:creator>
      <dc:date>2022-11-18T15:59:02Z</dc:date>
    </item>
    <item>
      <title>Re: Hybrid mode faster than just mpi</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Hybrid-mode-faster-than-just-mpi/m-p/1432426#M33906</link>
      <description>&lt;P&gt;The Intel version is 2021.1.1 . Later versions have worse problems, hanging with no information.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;$ uname -a&lt;/P&gt;
&lt;P&gt;Linux qnode1058 3.10.0-1160.71.1.el7.x86_64 #1 SMP Wed Jun 15 08:55:08 UTC 2022 x86_64 x86_64 x86_64&lt;/P&gt;
&lt;P&gt;&amp;nbsp;GNU/Linux&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;[lma712@qnode1058 ~]$ head -20 /proc/cpuinfo&lt;BR /&gt;processor : 0&lt;BR /&gt;vendor_id : GenuineIntel&lt;BR /&gt;cpu family : 6&lt;BR /&gt;model : 106&lt;BR /&gt;model name : Intel(R) Xeon(R) Gold 6338 CPU @ 2.00GHz&lt;BR /&gt;stepping : 6&lt;BR /&gt;microcode : 0xd000363&lt;BR /&gt;cpu MHz : 2000.000&lt;BR /&gt;cache size : 49152 KB&lt;BR /&gt;physical id : 0&lt;BR /&gt;siblings : 32&lt;BR /&gt;core id : 0&lt;BR /&gt;cpu cores : 32&lt;BR /&gt;apicid : 0&lt;BR /&gt;initial apicid : 0&lt;BR /&gt;fpu : yes&lt;BR /&gt;fpu_exception : yes&lt;BR /&gt;cpuid level : 27&lt;BR /&gt;wp : yes&lt;/P&gt;</description>
      <pubDate>Wed, 23 Nov 2022 07:55:26 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Hybrid-mode-faster-than-just-mpi/m-p/1432426#M33906</guid>
      <dc:creator>L__D__Marks</dc:creator>
      <dc:date>2022-11-23T07:55:26Z</dc:date>
    </item>
    <item>
      <title>Re:Hybrid mode faster than just mpi</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Hybrid-mode-faster-than-just-mpi/m-p/1433545#M33915</link>
      <description>&lt;P&gt;Hi Laurence,&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Thanks for providing us with the details.&lt;/P&gt;&lt;P&gt;Could you please let us know if there are any additional dependencies needed to be installed here in this case?&lt;/P&gt;&lt;P&gt;If yes, please provide us with the required information so that we can proceed further in this case.&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Regards,&lt;/P&gt;&lt;P&gt;Vidya.&lt;/P&gt;&lt;BR /&gt;</description>
      <pubDate>Mon, 28 Nov 2022 08:19:59 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Hybrid-mode-faster-than-just-mpi/m-p/1433545#M33915</guid>
      <dc:creator>VidyalathaB_Intel</dc:creator>
      <dc:date>2022-11-28T08:19:59Z</dc:date>
    </item>
    <item>
      <title>Re: Hybrid mode faster than just mpi</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Hybrid-mode-faster-than-just-mpi/m-p/1433585#M33919</link>
      <description>&lt;P&gt;Nothing beyond ifort/icc/impi, tcsh &amp;amp; standard Linux.&lt;/P&gt;</description>
      <pubDate>Mon, 28 Nov 2022 11:29:11 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Hybrid-mode-faster-than-just-mpi/m-p/1433585#M33919</guid>
      <dc:creator>L__D__Marks</dc:creator>
      <dc:date>2022-11-28T11:29:11Z</dc:date>
    </item>
    <item>
      <title>Re: Hybrid mode faster than just mpi</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Hybrid-mode-faster-than-just-mpi/m-p/1433625#M33922</link>
      <description>&lt;P&gt;For reference, I am seeing the 6338 as about 1.5 times&amp;nbsp;&lt;EM&gt;slower&lt;/EM&gt; (normalized to the number of cores) than a 6130 in pure mpi, with a speedup of about 1.75 using hybrid.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I spoke to a colleague in Cambridge, UK and they have seen something similar, in fact&amp;nbsp;&lt;EM&gt;far&lt;/EM&gt; worse -- it can be a factor of 10. You are probably going to hear multiple grumbles from major supercomputer users around the world on similar issues.&lt;/P&gt;</description>
      <pubDate>Mon, 28 Nov 2022 15:34:44 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Hybrid-mode-faster-than-just-mpi/m-p/1433625#M33922</guid>
      <dc:creator>L__D__Marks</dc:creator>
      <dc:date>2022-11-28T15:34:44Z</dc:date>
    </item>
    <item>
      <title>Re: Hybrid mode faster than just mpi</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Hybrid-mode-faster-than-just-mpi/m-p/1435170#M33943</link>
      <description>&lt;P&gt;Hi Laurence,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I tried following the steps provided in README and this is the error I'm getting when trying to run this step&amp;nbsp;&lt;/P&gt;
&lt;LI-CODE lang="markup"&gt;x lapw1 -p -orb -up&lt;/LI-CODE&gt;
&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="VidyalathaB_Intel_0-1669976929335.png" style="width: 400px;"&gt;&lt;img src="https://community.intel.com/t5/image/serverpage/image-id/35722i5E0E03A339038CC4/image-size/medium?v=v2&amp;amp;px=400&amp;amp;whitelist-exif-data=Orientation%2CResolution%2COriginalDefaultFinalSize%2CCopyright" role="button" title="VidyalathaB_Intel_0-1669976929335.png" alt="VidyalathaB_Intel_0-1669976929335.png" /&gt;&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;Could you please let me know what am i missing here and help me to resolve this issue?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Regards,&lt;/P&gt;
&lt;P&gt;Vidya.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 02 Dec 2022 10:30:36 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Hybrid-mode-faster-than-just-mpi/m-p/1435170#M33943</guid>
      <dc:creator>VidyalathaB_Intel</dc:creator>
      <dc:date>2022-12-02T10:30:36Z</dc:date>
    </item>
    <item>
      <title>Re: Hybrid mode faster than just mpi</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Hybrid-mode-faster-than-just-mpi/m-p/1435202#M33946</link>
      <description>The message "mpirun not found" means that you do not have your PATH setup right. You need to source the oneapi setvars.sh. It will be wise to do "ldd lapw1_mpi" to check you have the right libraries.</description>
      <pubDate>Fri, 02 Dec 2022 13:51:55 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Hybrid-mode-faster-than-just-mpi/m-p/1435202#M33946</guid>
      <dc:creator>L__D__Marks</dc:creator>
      <dc:date>2022-12-02T13:51:55Z</dc:date>
    </item>
    <item>
      <title>Re:Hybrid mode faster than just mpi</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Hybrid-mode-faster-than-just-mpi/m-p/1436395#M33970</link>
      <description>&lt;P&gt;Hi Laurence,&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Yes, I've already sourced oneAPI setvars.sh script still I'm getting the mpirun command not found error (from the screenshot in my previous post you can see mpirun --version is working fine as I've already set up oneAPI environment).&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Do you have any idea about this error like is there any script that effects oneAPI environment setup? &lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Please do let me know so that i can proceed further.&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Regards,&lt;/P&gt;&lt;P&gt;Vidya.&lt;/P&gt;&lt;BR /&gt;</description>
      <pubDate>Wed, 07 Dec 2022 07:32:27 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Hybrid-mode-faster-than-just-mpi/m-p/1436395#M33970</guid>
      <dc:creator>VidyalathaB_Intel</dc:creator>
      <dc:date>2022-12-07T07:32:27Z</dc:date>
    </item>
    <item>
      <title>Re: Re:Hybrid mode faster than just mpi</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Hybrid-mode-faster-than-just-mpi/m-p/1436400#M33971</link>
      <description>I do not think that anything should be overwritten in the PATH, but I am sitting in an airport waiting for a flight, so I cannot fully check. I suggest two things:&lt;BR /&gt;1) Before doing "x lapw1 -up -p -orb" do "which mpirun". If mpirun is not found then, you have an incomplete oneapi. You will need to add the impi package.&lt;BR /&gt;2) If you find mpirun, edit lapw1para and after:&lt;BR /&gt;&lt;BR /&gt;#which def-file are we using?&lt;BR /&gt; &lt;BR /&gt;if ($#argv &amp;lt; 1) then&lt;BR /&gt;        echo usage: $0 deffile&lt;BR /&gt;        exit&lt;BR /&gt;endif&lt;BR /&gt;&lt;BR /&gt;Add (line 134) "which mpirun".&lt;BR /&gt;&lt;BR /&gt;Let me know if the first works but the second fails. If that is the case I will try and replicate it myself.</description>
      <pubDate>Wed, 07 Dec 2022 07:57:17 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Hybrid-mode-faster-than-just-mpi/m-p/1436400#M33971</guid>
      <dc:creator>L__D__Marks</dc:creator>
      <dc:date>2022-12-07T07:57:17Z</dc:date>
    </item>
    <item>
      <title>Re:Hybrid mode faster than just mpi</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Hybrid-mode-faster-than-just-mpi/m-p/1438562#M34005</link>
      <description>&lt;P&gt;Hi Laurence,&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Could you please let us know how much time it would take approximately to finish this step x lapw1 -up -p -orb? &lt;/P&gt;&lt;P&gt;I tried running it for about 2.5 hrs but still, it kept running.&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Regards,&lt;/P&gt;&lt;P&gt;Vidya.&lt;/P&gt;&lt;BR /&gt;</description>
      <pubDate>Wed, 14 Dec 2022 17:51:59 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Hybrid-mode-faster-than-just-mpi/m-p/1438562#M34005</guid>
      <dc:creator>VidyalathaB_Intel</dc:creator>
      <dc:date>2022-12-14T17:51:59Z</dc:date>
    </item>
    <item>
      <title>Re: Re:Hybrid mode faster than just mpi</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Hybrid-mode-faster-than-just-mpi/m-p/1438637#M34006</link>
      <description>&lt;P&gt;If you have done cp M2 .machines, where M2 is (with nodes edited)&lt;/P&gt;
&lt;P&gt;granularity:1&lt;BR /&gt;omp_lapw1:2&lt;BR /&gt;1:node01:32 node02:32 node03:32&lt;BR /&gt;extrafine&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;That should take 27 minutes. M1 shoul take about 45 minutes. If you used instead&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;granularity:1&lt;BR /&gt;omp_lapw1:1&lt;BR /&gt;1:node01:64&lt;/P&gt;
&lt;P&gt;extrafine&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;In principle it should take about 90 minutes. It will either crash somewhere in impi, or run forever for reasons I do not undestand.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Please ensure that you did not oversubscribed the number of mpi processes, as then they compete/conflict and it may never stop.&lt;/P&gt;</description>
      <pubDate>Wed, 14 Dec 2022 21:30:13 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Hybrid-mode-faster-than-just-mpi/m-p/1438637#M34006</guid>
      <dc:creator>L__D__Marks</dc:creator>
      <dc:date>2022-12-14T21:30:13Z</dc:date>
    </item>
    <item>
      <title>Re:Hybrid mode faster than just mpi</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Hybrid-mode-faster-than-just-mpi/m-p/1440951#M34047</link>
      <description>&lt;P&gt;Hi Laurence,&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;If possible, could you please try to isolate the issue that you are facing in the form of a sample reproducer code to reproduce the performance issue that you are observing with hybrid mode so that it would be easier to address the issue quickly?&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Regards,&lt;/P&gt;&lt;P&gt;Vidya.&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;BR /&gt;</description>
      <pubDate>Thu, 22 Dec 2022 18:23:48 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Hybrid-mode-faster-than-just-mpi/m-p/1440951#M34047</guid>
      <dc:creator>VidyalathaB_Intel</dc:creator>
      <dc:date>2022-12-22T18:23:48Z</dc:date>
    </item>
    <item>
      <title>Re: Hybrid mode faster than just mpi</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Hybrid-mode-faster-than-just-mpi/m-p/1440963#M34048</link>
      <description>I provided you with a "real" reproducer, that represents hard-core scientific computing with multicore parallel programming. It does require some setup of a proper Linux system. That is life.&lt;BR /&gt;&lt;BR /&gt;It is not appropriate to make a toy version, it will not be representative.&lt;BR /&gt;&lt;BR /&gt;I assume that you still cannot get it to run. What is your . machines file? What CPU are you using? What is your network?</description>
      <pubDate>Thu, 22 Dec 2022 19:09:30 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Hybrid-mode-faster-than-just-mpi/m-p/1440963#M34048</guid>
      <dc:creator>L__D__Marks</dc:creator>
      <dc:date>2022-12-22T19:09:30Z</dc:date>
    </item>
    <item>
      <title>Re: Hybrid mode faster than just mpi</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Hybrid-mode-faster-than-just-mpi/m-p/1441903#M34065</link>
      <description>&lt;P&gt;Hi Laurence,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Could you please let me know if I'm on right track in replicating the issue that you are observing?&lt;/P&gt;
&lt;P&gt;Here is the screenshot of the output of x lapw1 -p -orb -up command (5th step in Readme)&lt;/P&gt;
&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="VidyalathaB_Intel_0-1672138234923.png" style="width: 400px;"&gt;&lt;img src="https://community.intel.com/t5/image/serverpage/image-id/36534iD36BB764B07AB9A5/image-size/medium?v=v2&amp;amp;px=400&amp;amp;whitelist-exif-data=Orientation%2CResolution%2COriginalDefaultFinalSize%2CCopyright" role="button" title="VidyalathaB_Intel_0-1672138234923.png" alt="VidyalathaB_Intel_0-1672138234923.png" /&gt;&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Here is the screenshot of the output of x lapw1 -p -orb -up command (8th step in Readme)&lt;/P&gt;
&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="VidyalathaB_Intel_1-1672138313880.png" style="width: 400px;"&gt;&lt;img src="https://community.intel.com/t5/image/serverpage/image-id/36535iD6A45E8CAD99083F/image-size/medium?v=v2&amp;amp;px=400&amp;amp;whitelist-exif-data=Orientation%2CResolution%2COriginalDefaultFinalSize%2CCopyright" role="button" title="VidyalathaB_Intel_1-1672138313880.png" alt="VidyalathaB_Intel_1-1672138313880.png" /&gt;&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Regards,&lt;/P&gt;
&lt;P&gt;Vidya.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 27 Dec 2022 10:52:08 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Hybrid-mode-faster-than-just-mpi/m-p/1441903#M34065</guid>
      <dc:creator>VidyalathaB_Intel</dc:creator>
      <dc:date>2022-12-27T10:52:08Z</dc:date>
    </item>
    <item>
      <title>Re: Hybrid mode faster than just mpi</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Hybrid-mode-faster-than-just-mpi/m-p/1441908#M34066</link>
      <description>I think you have replicated the issue. After each of the two steps please do "tail *.output1up_1" (see below), which will give a more readable output in the last two lines.&lt;BR /&gt;&lt;BR /&gt;lma712@quser21 PtF]$ tail *output1up_1   0.6516724    0.6517651    0.6518586    0.6521098    0.6522035&lt;BR /&gt;   0.6522145    0.6523540    0.6524470    0.6525784    0.6528393&lt;BR /&gt;         0 EIGENVALUES BELOW THE ENERGY  -11.00000&lt;BR /&gt;    ********************************************************&lt;BR /&gt; &lt;BR /&gt;    NUMBER OF K-POINTS:           2&lt;BR /&gt;===&amp;gt; TOTAL CPU       TIME:   1979.8 (INIT =     16.8 + K-POINTS =   1963.0)&lt;BR /&gt;&amp;gt; SUM OF WALL CLOCK TIMES:   1020.1 (INIT =     17.1 + K-POINTS =   1003.0)&lt;BR /&gt;   Maximum WALL clock time:    2230.08410000001&lt;BR /&gt;   Maximum CPU time:           4212.38736000000&lt;BR /&gt;</description>
      <pubDate>Tue, 27 Dec 2022 11:07:39 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Hybrid-mode-faster-than-just-mpi/m-p/1441908#M34066</guid>
      <dc:creator>L__D__Marks</dc:creator>
      <dc:date>2022-12-27T11:07:39Z</dc:date>
    </item>
    <item>
      <title>Re: Hybrid mode faster than just mpi</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Hybrid-mode-faster-than-just-mpi/m-p/1441949#M34068</link>
      <description>&lt;P&gt;For reference, with just mpi the last two lines I have are&lt;/P&gt;
&lt;P&gt;Maximum WALL clock time: 3520.51289999999&lt;BR /&gt;Maximum CPU time: 3510.76150400000&lt;BR /&gt;&lt;BR /&gt;And combining omp &amp;amp; mpi&lt;BR /&gt;Maximum WALL clock time: 2228.30399999999&lt;BR /&gt;Maximum CPU time: 4237.83427500000&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;WALL is the expired time (in seconds), which is what matters. I expect your numbers to be similar.&lt;/P&gt;</description>
      <pubDate>Tue, 27 Dec 2022 15:53:26 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Hybrid-mode-faster-than-just-mpi/m-p/1441949#M34068</guid>
      <dc:creator>L__D__Marks</dc:creator>
      <dc:date>2022-12-27T15:53:26Z</dc:date>
    </item>
    <item>
      <title>Re: Hybrid mode faster than just mpi</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Hybrid-mode-faster-than-just-mpi/m-p/1442282#M34086</link>
      <description>&lt;P&gt;Hi Laurence,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Could you please check the below screenshot and let me know if this is the expected result (i guess there is some difference)?&lt;/P&gt;
&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="VidyalathaB_Intel_0-1672247566991.png" style="width: 400px;"&gt;&lt;img src="https://community.intel.com/t5/image/serverpage/image-id/36600i9EF61D6C75384D14/image-size/medium?v=v2&amp;amp;px=400&amp;amp;whitelist-exif-data=Orientation%2CResolution%2COriginalDefaultFinalSize%2CCopyright" role="button" title="VidyalathaB_Intel_0-1672247566991.png" alt="VidyalathaB_Intel_0-1672247566991.png" /&gt;&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Regards,&lt;/P&gt;
&lt;P&gt;Vidya.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 28 Dec 2022 17:12:54 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Hybrid-mode-faster-than-just-mpi/m-p/1442282#M34086</guid>
      <dc:creator>VidyalathaB_Intel</dc:creator>
      <dc:date>2022-12-28T17:12:54Z</dc:date>
    </item>
    <item>
      <title>Re: Hybrid mode faster than just mpi</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Hybrid-mode-faster-than-just-mpi/m-p/1442452#M34090</link>
      <description>&lt;P&gt;Something is very odd with your numbers as WALL should be only slight larger than CPU with pure MPI, and about 1/2 in hybrid. Thoughts:&lt;/P&gt;
&lt;P&gt;1) Do you have a fast infiniband or or a slow Ethernet interconnect?&lt;/P&gt;
&lt;P&gt;2) Is hyperthreading on (it can get in the way)?&lt;/P&gt;
&lt;P&gt;3) Was anyone else running on those nodes, beyond standard OS?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Please do "grep -ie time Up_1 Up_2" and find a way to get me the output. It might give me a clue.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Currently confused!&lt;/P&gt;</description>
      <pubDate>Thu, 29 Dec 2022 08:11:32 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Hybrid-mode-faster-than-just-mpi/m-p/1442452#M34090</guid>
      <dc:creator>L__D__Marks</dc:creator>
      <dc:date>2022-12-29T08:11:32Z</dc:date>
    </item>
  </channel>
</rss>

