<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Hi Gennedy, in Intel® oneAPI Math Kernel Library</title>
    <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Pardiso-with-TBB-threading/m-p/1097734#M23667</link>
    <description>&lt;P&gt;&lt;SPAN style="font-size: 12px; line-height: 18px;"&gt;Hi Gennedy,&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;&lt;SPAN style="font-size: 12px; line-height: 18px;"&gt;I tested it on another computer, &amp;nbsp;crash every time when using tbb.&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;&lt;SPAN style="font-size: 12px; line-height: 18px;"&gt;windows 10, visual studio 2012/2015 update 1, &amp;nbsp;i7 2600 and i7 4770. &amp;nbsp; mkl 11.3.2.1&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;&lt;SPAN style="font-size: 12px; line-height: 18px;"&gt;Ill stay with OMP for now...&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;&lt;SPAN style="font-size: 12px; line-height: 18px;"&gt;tnx&lt;/SPAN&gt;&lt;/P&gt;</description>
    <pubDate>Mon, 04 Apr 2016 17:30:13 GMT</pubDate>
    <dc:creator>Daniel_S_2</dc:creator>
    <dc:date>2016-04-04T17:30:13Z</dc:date>
    <item>
      <title>Pardiso with TBB threading</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Pardiso-with-TBB-threading/m-p/1097722#M23655</link>
      <description>&lt;P&gt;I'm tryng to use the&amp;nbsp;Pardiso &amp;nbsp;solver with the TBB treading layer.&lt;/P&gt;

&lt;P&gt;It seems that&amp;nbsp;&lt;SPAN style="font-size: 13.008px; line-height: 17.7382px;"&gt;Pardiso &amp;nbsp;&lt;/SPAN&gt;got alot of idle time with OMP in my kind of problems&lt;/P&gt;

&lt;P&gt;this page say that&amp;nbsp;&lt;SPAN style="font-size: 13.008px; line-height: 17.7382px;"&gt;Pardiso &amp;nbsp;supports TBB&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;&lt;A href="https://software.intel.com/en-us/articles/using-intel-mkl-and-intel-tbb-in-the-same-application" target="_blank"&gt;https://software.intel.com/en-us/articles/using-intel-mkl-and-intel-tbb-in-the-same-application&lt;/A&gt;&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;

&lt;P&gt;so I gave it a try&lt;/P&gt;

&lt;P&gt;I'm linking with&lt;/P&gt;

&lt;P&gt;mkl_intel_lp64_dll.lib mkl_core_dll.lib mkl_tbb_thread_dll.lib tbb.lib&lt;/P&gt;

&lt;P&gt;and get single threaded execution. (same result with the static libs)&lt;/P&gt;

&lt;P&gt;I'm using MVSV 2015. &amp;nbsp;&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;

&lt;P&gt;what am I missing ?&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;

&lt;P&gt;tnx&lt;/P&gt;

&lt;P&gt;D&lt;/P&gt;</description>
      <pubDate>Thu, 03 Mar 2016 19:11:48 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Pardiso-with-TBB-threading/m-p/1097722#M23655</guid>
      <dc:creator>Daniel_S_2</dc:creator>
      <dc:date>2016-03-03T19:11:48Z</dc:date>
    </item>
    <item>
      <title>&gt;&gt; It seems that Pardiso  got</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Pardiso-with-TBB-threading/m-p/1097723#M23656</link>
      <description>&lt;P&gt;&amp;gt;&amp;gt;&amp;nbsp;&lt;SPAN style="font-size: 12px; line-height: 16.3636px;"&gt;It seems that&amp;nbsp;Pardiso &amp;nbsp;got alot of idle time with OMP in my kind of problems.&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;&lt;SPAN style="font-size: 12px; line-height: 16.3636px;"&gt;&amp;lt;&amp;lt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN style="font-size: 1em; line-height: 1.5;"&gt;what is the problem size? and could you try to take the openmp threaded version and compare the perf results?&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 04 Mar 2016 10:57:46 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Pardiso-with-TBB-threading/m-p/1097723#M23656</guid>
      <dc:creator>Gennady_F_Intel</dc:creator>
      <dc:date>2016-03-04T10:57:46Z</dc:date>
    </item>
    <item>
      <title>The reference about using MKL</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Pardiso-with-TBB-threading/m-p/1097724#M23657</link>
      <description>&lt;P&gt;The reference about using MKL with TBB appears to say that certain MKL functions are available in a TBB version, and gives a specific link command for that purpose,different from what you show here.&amp;nbsp; If you use both OpenMP and TBB threading, you will expect that idle OpenMP threads persist for KMP_BLOCKTIME before a TBB thread can run on the same hardware thread.&lt;/P&gt;

&lt;P&gt;If you are following the suggestion about tbb:affinity_partitioner and still using OpenMP as well, you might try some scheme such as limiting TBB threads to 1 per core (if you have enabled HyperThreading), taking advantage of the Intel OpenMP default limit of 1 per core, or specifically pinning OpenMP and TBB threads to different cores.&lt;/P&gt;</description>
      <pubDate>Fri, 04 Mar 2016 13:32:56 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Pardiso-with-TBB-threading/m-p/1097724#M23657</guid>
      <dc:creator>TimP</dc:creator>
      <dc:date>2016-03-04T13:32:56Z</dc:date>
    </item>
    <item>
      <title>I think I have a direction. </title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Pardiso-with-TBB-threading/m-p/1097725#M23658</link>
      <description>&lt;P&gt;I think I have a direction.&amp;nbsp;&lt;/P&gt;

&lt;P&gt;for some reason&amp;nbsp;mkl_sequential is loaded even when&amp;nbsp;mkl_tbb_thread_dll.lib is linked.&lt;/P&gt;

&lt;P&gt;so this is the reason for the single threaded times.&lt;/P&gt;

&lt;P&gt;any idea? &amp;nbsp;&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;

&lt;P&gt;as for the MOP performance,&amp;nbsp;here some info:&lt;/P&gt;

&lt;P&gt;&lt;SPAN style="font-size: 13.008px; line-height: 17.7382px;"&gt;threads &amp;nbsp;time&lt;/SPAN&gt;&lt;BR style="font-size: 13.008px; line-height: 17.7382px;" /&gt;
	&lt;SPAN style="font-size: 13.008px; line-height: 17.7382px;"&gt;1 &amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp;1000&lt;/SPAN&gt;&lt;BR style="font-size: 13.008px; line-height: 17.7382px;" /&gt;
	&lt;SPAN style="font-size: 13.008px; line-height: 17.7382px;"&gt;2 &amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp;630&lt;/SPAN&gt;&lt;BR style="font-size: 13.008px; line-height: 17.7382px;" /&gt;
	&lt;SPAN style="font-size: 13.008px; line-height: 17.7382px;"&gt;4 &amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp;400&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;problem information:&lt;/P&gt;

&lt;P&gt;0-based array is turned ON&lt;BR /&gt;
	PARDISO double precision computation is turned ON&lt;BR /&gt;
	Parallel METIS algorithm at reorder step is turned ON&lt;BR /&gt;
	Scaling is turned ON&lt;/P&gt;

&lt;P&gt;&lt;BR /&gt;
	Summary: ( reordering phase )&lt;BR /&gt;
	================&lt;/P&gt;

&lt;P&gt;Times:&lt;BR /&gt;
	======&lt;BR /&gt;
	Time spent in calculations of symmetric matrix portrait (fulladj): 0.022640 s&lt;BR /&gt;
	Time spent in reordering of the initial matrix (reorder) &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; : 0.398392 s&lt;BR /&gt;
	Time spent in symbolic factorization (symbfct) &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; : 0.244236 s&lt;BR /&gt;
	Time spent in data preparations for factorization (parlist) &amp;nbsp; &amp;nbsp; &amp;nbsp;: 0.005588 s&lt;BR /&gt;
	Time spent in allocation of internal data structures (malloc) &amp;nbsp; &amp;nbsp;: 0.014131 s&lt;BR /&gt;
	Time spent in additional calculations &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;: 0.170513 s&lt;BR /&gt;
	Total time spent &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; : 0.855501 s&lt;/P&gt;

&lt;P&gt;Statistics:&lt;BR /&gt;
	===========&lt;BR /&gt;
	Parallel Direct Factorization is running on 1 OpenMP&lt;/P&gt;

&lt;P&gt;&amp;lt; Linear system Ax = b &amp;gt;&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;number of equations: &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 258687&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;number of non-zeros in A: &amp;nbsp; &amp;nbsp; &amp;nbsp;2821302&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;number of non-zeros in A (%): 0.004216&lt;/P&gt;

&lt;P&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;number of right-hand sides: &amp;nbsp; &amp;nbsp;1&lt;/P&gt;

&lt;P&gt;&amp;lt; Factors L and U &amp;gt;&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;number of columns for each panel: 128&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;number of independent subgraphs: &amp;nbsp;0&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;number of supernodes: &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;50167&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;size of largest supernode: &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1041&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;number of non-zeros in L: &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;32337906&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;number of non-zeros in U: &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;1&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;number of non-zeros in L+U: &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;32337907&lt;/P&gt;

&lt;P&gt;&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp;&lt;BR /&gt;
	&lt;BR /&gt;
	&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 04 Mar 2016 16:26:21 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Pardiso-with-TBB-threading/m-p/1097725#M23658</guid>
      <dc:creator>Daniel_S_2</dc:creator>
      <dc:date>2016-03-04T16:26:21Z</dc:date>
    </item>
    <item>
      <title>UPDATE:</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Pardiso-with-TBB-threading/m-p/1097726#M23659</link>
      <description>&lt;P&gt;UPDATE:&lt;/P&gt;

&lt;P&gt;after stripping the project and converting to Intel compiler, &amp;nbsp;&lt;SPAN style="font-size: 12px; line-height: 16.3636px;"&gt;mkl_tbb_thread_dll is loaded but crashes :(&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;

&lt;P&gt;&lt;SPAN style="font-size: 12px; line-height: 16.3636px;"&gt;here is the call stack:&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;&amp;gt;&amp;nbsp;&amp;nbsp; &amp;nbsp;mkl_tbb_thread.dll!00007ffa1095a067()&amp;nbsp;&amp;nbsp; &amp;nbsp;Unknown&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp;tbb.dll!tbb::internal::custom_scheduler&amp;lt;tbb::internal::IntelSchedulerTraits&amp;gt;::local_wait_for_all(tbb::task &amp;amp; parent, tbb::task * child) Line 467&amp;nbsp;&amp;nbsp; &amp;nbsp;C++&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp;tbb.dll!tbb::internal::arena::process(tbb::internal::generic_scheduler &amp;amp; s) Line 147&amp;nbsp;&amp;nbsp; &amp;nbsp;C++&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp;tbb.dll!tbb::internal::market::process(rml::job &amp;amp; j) Line 677&amp;nbsp;&amp;nbsp; &amp;nbsp;C++&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp;tbb.dll!tbb::internal::rml::private_worker::run() Line 276&amp;nbsp;&amp;nbsp; &amp;nbsp;C++&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp;tbb.dll!tbb::internal::rml::private_worker::thread_routine(void * arg) Line 229&amp;nbsp;&amp;nbsp; &amp;nbsp;C++&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp;ucrtbase.dll!00007ffa3dc982dd()&amp;nbsp;&amp;nbsp; &amp;nbsp;Unknown&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;

&lt;P&gt;probably some runtime version incompatibility.&lt;/P&gt;

&lt;P&gt;the tbb runtime is&amp;nbsp;&lt;/P&gt;

&lt;P&gt;compilers_and_libraries_2016.2.180\windows\redist\intel64_win\tbb\vc14\tbb.dll&lt;/P&gt;

&lt;P&gt;tested with vc_mt\tbb.dll&lt;/P&gt;

&lt;P&gt;a simple tbb for loop works fine in the same project.&lt;/P&gt;

&lt;P&gt;it seems that&amp;nbsp;&lt;SPAN style="font-size: 12px; line-height: 16.3636px;"&gt;mkl_tbb_thread_dll&amp;nbsp;&lt;/SPAN&gt;&amp;nbsp;gor for ABI compatibility&amp;nbsp;issues with the tbb runtime&lt;/P&gt;

&lt;P&gt;any idea?&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;

&lt;P&gt;tnx&amp;nbsp;&lt;/P&gt;

&lt;P&gt;D&lt;/P&gt;</description>
      <pubDate>Mon, 07 Mar 2016 08:19:10 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Pardiso-with-TBB-threading/m-p/1097726#M23659</guid>
      <dc:creator>Daniel_S_2</dc:creator>
      <dc:date>2016-03-07T08:19:10Z</dc:date>
    </item>
    <item>
      <title>D.S! How could we reproduce</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Pardiso-with-TBB-threading/m-p/1097727#M23660</link>
      <description>&lt;P&gt;D.S! How could we reproduce the problem? I checked with some of Pardiso's example and linked with vc14 tbb's dll. no issues were detected.&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 17 Mar 2016 10:43:42 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Pardiso-with-TBB-threading/m-p/1097727#M23660</guid>
      <dc:creator>Gennady_F_Intel</dc:creator>
      <dc:date>2016-03-17T10:43:42Z</dc:date>
    </item>
    <item>
      <title>Hello Gennady,</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Pardiso-with-TBB-threading/m-p/1097728#M23661</link>
      <description>&lt;P&gt;Hello&amp;nbsp;Gennady,&lt;/P&gt;

&lt;P&gt;Just got to test it again. &amp;nbsp;the crash is data dependent. &amp;nbsp;tbb work fine with a few test matrices I tried , but crash in phase 11 with some of my data sets.&lt;/P&gt;

&lt;P&gt;for the a diagonal marix with 100000, and a few fandom OD elements tbb was actually a little slower. &amp;nbsp; and phase 11 seems not threaded at all.&lt;/P&gt;

&lt;P&gt;I can send you the data with a simple code that loads it if you need it.&lt;/P&gt;

&lt;P&gt;Daniel&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 21 Mar 2016 14:39:24 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Pardiso-with-TBB-threading/m-p/1097728#M23661</guid>
      <dc:creator>Daniel_S_2</dc:creator>
      <dc:date>2016-03-21T14:39:24Z</dc:date>
    </item>
    <item>
      <title>Daniel, pls try to set iparm</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Pardiso-with-TBB-threading/m-p/1097729#M23662</link>
      <description>&lt;P&gt;Daniel, pls try to set iparm[1]=0 instead of iparm[1]=2 (which is default) and check how it will work on your side.&lt;/P&gt;</description>
      <pubDate>Mon, 21 Mar 2016 16:36:06 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Pardiso-with-TBB-threading/m-p/1097729#M23662</guid>
      <dc:creator>Gennady_F_Intel</dc:creator>
      <dc:date>2016-03-21T16:36:06Z</dc:date>
    </item>
    <item>
      <title>still the same crash... with </title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Pardiso-with-TBB-threading/m-p/1097730#M23663</link>
      <description>&lt;P&gt;still the same crash... with&amp;nbsp;&lt;SPAN style="font-size: 12px; line-height: 18px;"&gt;&amp;nbsp;iparm[1] = 0,2,3&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;&lt;SPAN style="font-size: 12px; line-height: 18px;"&gt;i'm using 3, that make phase 11 about x2 faster with openmp.&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;&lt;SPAN style="font-size: 12px; line-height: 18px;"&gt;it seems that the data that don't crash is not using tbb threads at all&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;

&lt;P&gt;&lt;SPAN style="font-size: 12px; line-height: 18px;"&gt;here is that crash:&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;Exception thrown at 0x00007FFBF459A067 (mkl_tbb_thread.dll) in MklTester.exe: 0xC0000005: Access violation writing location 0x00000025485EC000.&lt;/P&gt;

&lt;P&gt;some time at the main thread some time on &amp;nbsp;a worker thread,&lt;/P&gt;

&lt;P&gt;stack:&lt;/P&gt;

&lt;P&gt;&amp;gt;&amp;nbsp;&amp;nbsp; &amp;nbsp;mkl_tbb_thread.dll!00007ffbf459a067()&amp;nbsp;&amp;nbsp; &amp;nbsp;Unknown&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp;tbb.dll!tbb::internal::custom_scheduler&amp;lt;tbb::internal::IntelSchedulerTraits&amp;gt;::local_wait_for_all(tbb::task &amp;amp; parent, tbb::task * child) Line 467&amp;nbsp;&amp;nbsp; &amp;nbsp;C++&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp;tbb.dll!tbb::internal::arena::process(tbb::internal::generic_scheduler &amp;amp; s) Line 147&amp;nbsp;&amp;nbsp; &amp;nbsp;C++&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp;tbb.dll!tbb::internal::market::process(rml::job &amp;amp; j) Line 677&amp;nbsp;&amp;nbsp; &amp;nbsp;C++&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp;tbb.dll!tbb::internal::rml::private_worker::run() Line 276&amp;nbsp;&amp;nbsp; &amp;nbsp;C++&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp;tbb.dll!tbb::internal::rml::private_worker::thread_routine(void * arg) Line 229&amp;nbsp;&amp;nbsp; &amp;nbsp;C++&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp;ucrtbase.dll!00007ffc1e9482dd()&amp;nbsp;&amp;nbsp; &amp;nbsp;Unknown&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 21 Mar 2016 18:11:06 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Pardiso-with-TBB-threading/m-p/1097730#M23663</guid>
      <dc:creator>Daniel_S_2</dc:creator>
      <dc:date>2016-03-21T18:11:06Z</dc:date>
    </item>
    <item>
      <title>&gt;&gt; I can send you the data</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Pardiso-with-TBB-threading/m-p/1097731#M23664</link>
      <description>&lt;P&gt;&lt;SPAN style="font-size: 12px; line-height: 18px;"&gt;&amp;gt;&amp;gt; I can send you the data with a simple code that loads it if you need it.&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;&lt;SPAN style="font-size: 1em; line-height: 1.5;"&gt;Daniel, we still don't see the problem on our side with the latest version. &amp;nbsp;&lt;/SPAN&gt;&lt;SPAN style="font-size: 1em; line-height: 1.5;"&gt;Could you please send us these data and the code for reproducing the problem on our side. &lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;&lt;SPAN style="font-size: 1em; line-height: 1.5;"&gt;Thanks, Gennady&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 30 Mar 2016 08:04:55 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Pardiso-with-TBB-threading/m-p/1097731#M23664</guid>
      <dc:creator>Gennady_F_Intel</dc:creator>
      <dc:date>2016-03-30T08:04:55Z</dc:date>
    </item>
    <item>
      <title>Hi Gennedy,</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Pardiso-with-TBB-threading/m-p/1097732#M23665</link>
      <description>&lt;P&gt;Hi Gennedy,&lt;/P&gt;

&lt;P&gt;here is the Visual studio 2015 project with data, &amp;nbsp;just unzip, open sln and run.&lt;/P&gt;

&lt;P&gt;the matrix values are empty to make the zip smaller, but same result with true values. &amp;nbsp; runs ok with openmp.&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 30 Mar 2016 09:52:14 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Pardiso-with-TBB-threading/m-p/1097732#M23665</guid>
      <dc:creator>Daniel_S_2</dc:creator>
      <dc:date>2016-03-30T09:52:14Z</dc:date>
    </item>
    <item>
      <title>Daniel, with regard to</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Pardiso-with-TBB-threading/m-p/1097733#M23666</link>
      <description>&lt;P&gt;Daniel, with regard to exception with TBB threading. I checked your example with mkl 11.3 u2, and linked with TBB universal vc_mt.dll and with vc12 and vc14. &amp;nbsp;&lt;/P&gt;

&lt;P&gt;I have used the test you provided ( slightly modified by added the&amp;nbsp;mkl_get_version(&amp;amp;Version); &amp;nbsp;function&amp;nbsp;) and &amp;nbsp;compiling launching from command line because MVSC 2015 is not available on my system.&amp;nbsp;&lt;/P&gt;

&lt;P&gt;all cases work fine. Below the output when vc14\tbb.dll is used:&lt;/P&gt;

&lt;P&gt;&lt;BR /&gt;
	..\_Forums\u611238_pardiso_tbb&amp;gt;_5tbb.exe&lt;BR /&gt;
	file mkl-860663123-00z.bin&lt;BR /&gt;
	matrix dim 258687&lt;BR /&gt;
	matrix nnz/2 2821302&lt;BR /&gt;
	64 bits&lt;BR /&gt;
	Major version: &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 11&lt;BR /&gt;
	Minor version: &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 3&lt;BR /&gt;
	Update version: &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;2&lt;BR /&gt;
	Product status: &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Product&lt;BR /&gt;
	Build: &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 20160120&lt;BR /&gt;
	Platform: &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;Intel(R) 64 architecture&lt;BR /&gt;
	Processor optimization: &amp;nbsp;Intel(R) Advanced Vector Extensions 2 (Intel(R) AVX2) enabled processors&lt;BR /&gt;
	================================================================&lt;/P&gt;

&lt;P&gt;num threads 2&lt;/P&gt;

&lt;P&gt;=== PARDISO: solving a symmetric positive definite system ===&lt;BR /&gt;
	1-based array indexing is turned ON&lt;BR /&gt;
	PARDISO double precision computation is turned ON&lt;BR /&gt;
	Parallel METIS algorithm at reorder step is turned ON&lt;BR /&gt;
	Scaling is turned ON&lt;/P&gt;

&lt;P&gt;Summary: ( reordering phase )&lt;BR /&gt;
	================&lt;/P&gt;

&lt;P&gt;Times:&lt;BR /&gt;
	======&lt;BR /&gt;
	Time spent in calculations of symmetric matrix portrait (fulladj): 0.026380 s&lt;BR /&gt;
	Time spent in reordering of the initial matrix (reorder) &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; : 0.522636 s&lt;BR /&gt;
	Time spent in symbolic factorization (symbfct) &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; : 0.264818 s&lt;BR /&gt;
	Time spent in data preparations for factorization (parlist) &amp;nbsp; &amp;nbsp; &amp;nbsp;: 0.006496 s&lt;BR /&gt;
	Time spent in allocation of internal data structures (malloc) &amp;nbsp; &amp;nbsp;: 0.031902 s&lt;BR /&gt;
	Time spent in additional calculations &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;: 0.198542 s&lt;BR /&gt;
	Total time spent &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; : 1.050775 s&lt;/P&gt;

&lt;P&gt;Statistics:&lt;BR /&gt;
	===========&lt;BR /&gt;
	Parallel Direct Factorization is running on 1 OpenMP&lt;/P&gt;

&lt;P&gt;&amp;lt; Linear system Ax = b &amp;gt;&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;number of equations: &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 258687&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;number of non-zeros in A: &amp;nbsp; &amp;nbsp; &amp;nbsp;2821302&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;number of non-zeros in A (%): 0.004216&lt;/P&gt;

&lt;P&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;number of right-hand sides: &amp;nbsp; &amp;nbsp;1&lt;/P&gt;

&lt;P&gt;&amp;lt; Factors L and U &amp;gt;&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;number of columns for each panel: 128&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;number of independent subgraphs: &amp;nbsp;0&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;number of supernodes: &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;50167&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;size of largest supernode: &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1041&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;number of non-zeros in L: &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;32337906&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;number of non-zeros in U: &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;1&lt;BR /&gt;
	&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;number of non-zeros in L+U: &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;32337907&lt;/P&gt;

&lt;P&gt;&lt;BR /&gt;
	symolic &amp;nbsp; factorization time is &amp;nbsp;1379 ms&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 01 Apr 2016 12:42:00 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Pardiso-with-TBB-threading/m-p/1097733#M23666</guid>
      <dc:creator>Gennady_F_Intel</dc:creator>
      <dc:date>2016-04-01T12:42:00Z</dc:date>
    </item>
    <item>
      <title>Hi Gennedy,</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Pardiso-with-TBB-threading/m-p/1097734#M23667</link>
      <description>&lt;P&gt;&lt;SPAN style="font-size: 12px; line-height: 18px;"&gt;Hi Gennedy,&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;&lt;SPAN style="font-size: 12px; line-height: 18px;"&gt;I tested it on another computer, &amp;nbsp;crash every time when using tbb.&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;&lt;SPAN style="font-size: 12px; line-height: 18px;"&gt;windows 10, visual studio 2012/2015 update 1, &amp;nbsp;i7 2600 and i7 4770. &amp;nbsp; mkl 11.3.2.1&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;&lt;SPAN style="font-size: 12px; line-height: 18px;"&gt;Ill stay with OMP for now...&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;&lt;SPAN style="font-size: 12px; line-height: 18px;"&gt;tnx&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 04 Apr 2016 17:30:13 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/Pardiso-with-TBB-threading/m-p/1097734#M23667</guid>
      <dc:creator>Daniel_S_2</dc:creator>
      <dc:date>2016-04-04T17:30:13Z</dc:date>
    </item>
  </channel>
</rss>

