<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic What is the easiest way to in Intel® oneAPI Math Kernel Library</title>
    <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/OpenMP-very-slow-when-run-outside-of-Visual-Studio/m-p/984055#M17614</link>
    <description>&lt;P&gt;What is the easiest way to handle heap defragmentation in c++&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Sat, 13 May 2017 01:34:56 GMT</pubDate>
    <dc:creator>amr_o_1</dc:creator>
    <dc:date>2017-05-13T01:34:56Z</dc:date>
    <item>
      <title>OpenMP very slow when run outside of Visual Studio</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/OpenMP-very-slow-when-run-outside-of-Visual-Studio/m-p/984046#M17605</link>
      <description>&lt;P&gt;Since we are using intel MKL library we have to load INTEL's OpenMP library (libiomp5md.dll) at run time and exclude vcomp.lib at link time. But we have to compile and link with VC++. With my release 64 bit build if I run it directly, part of my code won't fully utilize the cores I specified and it runs very slowly. It seems to be using multiple cores but might be even slower than one core. If I attach it (release build) to the visual studio debugger without doing anything else, then it fully utilize the cores I specified. Does anybody have any ideas?&lt;/P&gt;
&lt;P&gt;We are using Visual Studio 2010 on Window 7 professional.&amp;nbsp;libiomp5md.dll shows file version of 5.0.2012.803.&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 06 Jun 2013 21:50:57 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/OpenMP-very-slow-when-run-outside-of-Visual-Studio/m-p/984046#M17605</guid>
      <dc:creator>Isaac_Liu</dc:creator>
      <dc:date>2013-06-06T21:50:57Z</dc:date>
    </item>
    <item>
      <title>Hi Isaac Liu,</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/OpenMP-very-slow-when-run-outside-of-Visual-Studio/m-p/984047#M17606</link>
      <description>&lt;P&gt;Hi Isaac Liu,&lt;/P&gt;
&lt;P&gt;Does the app contain #pragma omp? If yes, does the app call MKL from OMP sections?&lt;/P&gt;
&lt;P&gt;Thanks,&lt;/P&gt;
&lt;P&gt;Evgueni.&lt;/P&gt;</description>
      <pubDate>Fri, 07 Jun 2013 02:21:00 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/OpenMP-very-slow-when-run-outside-of-Visual-Studio/m-p/984047#M17606</guid>
      <dc:creator>Evgueni_P_Intel</dc:creator>
      <dc:date>2013-06-07T02:21:00Z</dc:date>
    </item>
    <item>
      <title>Hi Evgueni,</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/OpenMP-very-slow-when-run-outside-of-Visual-Studio/m-p/984048#M17607</link>
      <description>&lt;P&gt;Hi&amp;nbsp;Evgueni,&lt;/P&gt;
&lt;P&gt;This is a very big application. The part with issue uses OpenMP but not MKL. Other parts of this application uses MKL. My code uses a lot of OpenMP. Most of them works great and the code in trouble is actually very similar to other part.&lt;/P&gt;
&lt;P&gt;Thanks,&lt;/P&gt;
&lt;P&gt;Isaac&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 07 Jun 2013 02:59:57 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/OpenMP-very-slow-when-run-outside-of-Visual-Studio/m-p/984048#M17607</guid>
      <dc:creator>Isaac_Liu</dc:creator>
      <dc:date>2013-06-07T02:59:57Z</dc:date>
    </item>
    <item>
      <title>The following links may be of</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/OpenMP-very-slow-when-run-outside-of-Visual-Studio/m-p/984049#M17608</link>
      <description>&lt;P&gt;The following links may be of use in your case since you mix in the app two OpenMP runtime libraries.&lt;/P&gt;
&lt;P&gt;&lt;A href="http://software.intel.com/en-us/forums/topic/293731"&gt;http://software.intel.com/en-us/forums/topic/293731&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;&lt;A href="http://software.intel.com/en-us/articles/how-to-use-intelr-compiler-openmp-compatibility-libraries-on-windows/"&gt;http://software.intel.com/en-us/articles/how-to-use-intelr-compiler-openmp-compatibility-libraries-on-windows/&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 07 Jun 2013 03:25:33 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/OpenMP-very-slow-when-run-outside-of-Visual-Studio/m-p/984049#M17608</guid>
      <dc:creator>Evgueni_P_Intel</dc:creator>
      <dc:date>2013-06-07T03:25:33Z</dc:date>
    </item>
    <item>
      <title>As I read the original post,</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/OpenMP-very-slow-when-run-outside-of-Visual-Studio/m-p/984050#M17609</link>
      <description>&lt;P&gt;As I read the original post, it was recognized that vcomp.lib has to be excluded so that only the single Intel OpenMP instance is active, as that will support the vcomp calls.&lt;/P&gt;
&lt;P&gt;This raises the possibility of working with KMP_AFFINITY and number of threads so as to improve the distribution of work across cores.&lt;/P&gt;
&lt;P&gt;If Intel(c) hyperthreading is active, MKL will use a single thread per core, but you will need to set OMP_NUM_THREADS and KMP_AFFINITY to get a similar effect from the C++ parallel regions, e.g.&lt;/P&gt;
&lt;P&gt;KMP_AFFINITY=compact,1,1&lt;/P&gt;
&lt;P&gt;to spread threads out 1 per core.&lt;/P&gt;
&lt;P&gt;I don't know what effects might be produced by transitioning from 1 thread per core in MKL to something different in the C++ code.&lt;/P&gt;
&lt;P&gt;If you have a 2 socket platform affinity will be particularly important.&lt;/P&gt;</description>
      <pubDate>Fri, 07 Jun 2013 14:42:49 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/OpenMP-very-slow-when-run-outside-of-Visual-Studio/m-p/984050#M17609</guid>
      <dc:creator>TimP</dc:creator>
      <dc:date>2013-06-07T14:42:49Z</dc:date>
    </item>
    <item>
      <title>It is hard to guess what may</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/OpenMP-very-slow-when-run-outside-of-Visual-Studio/m-p/984051#M17610</link>
      <description>&lt;P&gt;It is hard to guess what may be happening without knowing details of the application. &amp;nbsp;Do the application creates threads for example (I mean non-OpenMP threads)? If it does then the resources oversubscription is possible. Some applications gain from setting environment variable KMP_BLOCKTIME=0, especially in case of oversubscription, when idle-spinning OpenMP worker threads slow down active OpenMP threads.&lt;/P&gt;
&lt;P&gt;If the problem is different, then you can try to create small reproducer and submit support request.&lt;/P&gt;
&lt;P&gt;- Andrey&lt;/P&gt;</description>
      <pubDate>Fri, 07 Jun 2013 14:53:10 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/OpenMP-very-slow-when-run-outside-of-Visual-Studio/m-p/984051#M17610</guid>
      <dc:creator>Andrey_C_Intel1</dc:creator>
      <dc:date>2013-06-07T14:53:10Z</dc:date>
    </item>
    <item>
      <title>After some trial and error</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/OpenMP-very-slow-when-run-outside-of-Visual-Studio/m-p/984052#M17611</link>
      <description>&lt;P&gt;After some trial and error the issue is resolved. Part of my code is called repeatedly, in the millions, and it &amp;nbsp;uses a few local std::vector of some data type of size about 100s bytes. The memory management should be very simple compared to the complexity of the computations involved. But somehow the&amp;nbsp;memory management brings down the whole process.&lt;/P&gt;</description>
      <pubDate>Tue, 18 Jun 2013 13:09:30 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/OpenMP-very-slow-when-run-outside-of-Visual-Studio/m-p/984052#M17611</guid>
      <dc:creator>Isaac_Liu</dc:creator>
      <dc:date>2013-06-18T13:09:30Z</dc:date>
    </item>
    <item>
      <title>well. thanks for letting us</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/OpenMP-very-slow-when-run-outside-of-Visual-Studio/m-p/984053#M17612</link>
      <description>&lt;P&gt;well. thanks for letting us know about that cause.&lt;/P&gt;</description>
      <pubDate>Thu, 20 Jun 2013 07:18:47 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/OpenMP-very-slow-when-run-outside-of-Visual-Studio/m-p/984053#M17612</guid>
      <dc:creator>Gennady_F_Intel</dc:creator>
      <dc:date>2013-06-20T07:18:47Z</dc:date>
    </item>
    <item>
      <title>&gt;&gt;...Part of my code is</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/OpenMP-very-slow-when-run-outside-of-Visual-Studio/m-p/984054#M17613</link>
      <description>&amp;gt;&amp;gt;...Part of my code is called repeatedly, in the millions, and it  uses a few local std::vector of some data type of
&amp;gt;&amp;gt;size about 100s bytes. The memory management should be very simple compared to the complexity of
&amp;gt;&amp;gt;the computations involved. But somehow the memory management brings down the whole process...

It is hard to tell you what could be exactly wrong but I would assume that there is a problem with Heap defragmentation.</description>
      <pubDate>Sat, 22 Jun 2013 23:24:47 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/OpenMP-very-slow-when-run-outside-of-Visual-Studio/m-p/984054#M17613</guid>
      <dc:creator>SergeyKostrov</dc:creator>
      <dc:date>2013-06-22T23:24:47Z</dc:date>
    </item>
    <item>
      <title>What is the easiest way to</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/OpenMP-very-slow-when-run-outside-of-Visual-Studio/m-p/984055#M17614</link>
      <description>&lt;P&gt;What is the easiest way to handle heap defragmentation in c++&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Sat, 13 May 2017 01:34:56 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/OpenMP-very-slow-when-run-outside-of-Visual-Studio/m-p/984055#M17614</guid>
      <dc:creator>amr_o_1</dc:creator>
      <dc:date>2017-05-13T01:34:56Z</dc:date>
    </item>
    <item>
      <title>&gt;&gt;...What is the easiest way</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/OpenMP-very-slow-when-run-outside-of-Visual-Studio/m-p/984056#M17615</link>
      <description>&amp;gt;&amp;gt;...What is the easiest way to handle heap defragmentation in c++...

Don't use STL if you can.</description>
      <pubDate>Mon, 15 May 2017 18:48:34 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/OpenMP-very-slow-when-run-outside-of-Visual-Studio/m-p/984056#M17615</guid>
      <dc:creator>SergeyKostrov</dc:creator>
      <dc:date>2017-05-15T18:48:34Z</dc:date>
    </item>
  </channel>
</rss>

