<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic How to use multiprocessors? in Intel® oneAPI Math Kernel Library</title>
    <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/How-to-use-multiprocessors/m-p/766447#M281</link>
    <description>Here is the link line:&lt;BR /&gt;xeons03 270&amp;gt; make sointel64 interface=ilp64 compiler=gnu function=pardiso_sym_c&lt;BR /&gt;&lt;BR /&gt;----- Compiling gnu_ilp64_parallel_intel64_so ----- pardiso_sym_c&lt;BR /&gt;gcc -m64 -w -DMKL_ILP64 -I"/opt/intel/Compiler/11.1/046/mkl/include" \&lt;BR /&gt; ./source/pardiso_sym_c.c \&lt;BR /&gt; -L"/opt/intel/Compiler/11.1/046/mkl/lib/intel64" -lmkl_intel_ilp64 \&lt;BR /&gt; -lmkl_intel_thread \&lt;BR /&gt; -lmkl_core \&lt;BR /&gt; -L"/opt/intel/Compiler/11.1/046/mkl/../lib/intel64" -liomp5 -lpthread -lm -o _results/gnu_ilp64_parallel_intel64_so/pardiso_sym_c.out&lt;BR /&gt;----- Execution gnu_ilp64_parallel_intel64_so ----- pardiso_sym_c&lt;BR /&gt;export LD_LIBRARY_PATH="/opt/intel/Compiler/11.1/046/mkl/lib/intel64":/opt/intel/Compiler/11.1/046/mkl/lib/em64t:/opt/intel/Compiler/11.1/046/mkl/../lib/intel64; \&lt;BR /&gt; _results/gnu_ilp64_parallel_intel64_so/pardiso_sym_c.out &amp;gt; _results/gnu_ilp64_parallel_intel64_so/pardiso_sym_c.res&lt;BR /&gt;&lt;BR /&gt;Is it because my problem size is too small?</description>
    <pubDate>Mon, 13 Dec 2010 21:41:24 GMT</pubDate>
    <dc:creator>xian-zhong_guous_cd-</dc:creator>
    <dc:date>2010-12-13T21:41:24Z</dc:date>
    <item>
      <title>How to use multiprocessors?</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/How-to-use-multiprocessors/m-p/766445#M279</link>
      <description>I am testing MKL Sparse Solver PARDISO on linux with 8 processors. I've set the enviroment variable MKL_NUM_THREADS=8. However, the output keeps telling me it uses only processor(see enclosed below). Any idea what I did wrong?&lt;BR /&gt;Here is my make:&lt;BR /&gt;make sointel64 interface=lp64 compiler=gnu function=pardiso_sym_c&lt;BR /&gt;Here is the output:&lt;BR /&gt;&lt;META http-equiv="CONTENT-TYPE" content="text/html; charset=utf-8" /&gt;
	&lt;TITLE&gt;&lt;/TITLE&gt;
	&lt;META name="GENERATOR" content="OpenOffice.org 3.2  (Unix)" /&gt;
	&lt;STYLE type="text/css"&gt;&amp;lt;!--
		@page { margin: 0.79in }
		PRE { font-family: "Liberation Serif" }
		P { margin-bottom: 0.08in }
--&amp;gt;&lt;/STYLE&gt;

&lt;PRE&gt;=== PARDISO is running in In-Core mode, because iparam(60)=0 ===


================  PARDISO: solving a  symmetric indef. system  ================


Summary PARDISO: ( reorder to reorder )
================

Times:
======
      Time fulladj: 0.000005 s
      Time reorder: 0.000250 s
      Time symbfct: 0.005507 s
      Time malloc : 0.000074 s
      Time total  : 0.006585 s total - sum: 0.000748 s

Statistics:
===========
&lt;A name="__DdeLink__0_1421249491"&gt;&lt;/A&gt; &amp;lt; Parallel Direct Factorization with #processors: &amp;gt;         1
 &amp;lt; Numerical Factorization with Level-3 BLAS performance &amp;gt;

 &amp;lt; Linear system Ax = b&amp;gt; &lt;TRANSPOSE&gt; 
             #equations:                                     9
             #non-zeros in A:                                19
             non-zeros in A (%):                            23.456790

             #right-hand sides:                              1

 &amp;lt; Factors L and U &amp;gt; 
             #columns for each panel:                        128
             #independent subgraphs:                         0
 &amp;lt; Preprocessing with state of the art partitioning metis&amp;gt;
             #supernodes:                                    6
             size of largest supernode:                      4
             number of nonzeros in L                       29
             number of nonzeros in U                       1
             number of nonzeros in L+U                     30

Reordering completed ... 
Number of nonzeros in factors = 30
Number of factorization MFLOPS = 0 Percentage of computed non-zeros for LL^T factorization
 0 %  3 %  13 %  24 %  31 %  44 %  100 %  
 
================  PARDISO: solving a  symmetric indef. system  ================


Summary PARDISO: ( factorize to factorize )
================

Times:
======
      Time A to LU: 0.000000 s
      Time numfct : 0.001327 s
      Time malloc : 0.000012 s
      Time total  : 0.001343 s total - sum: 0.000004 s

Statistics:
===========
 &amp;lt; Parallel Direct Factorization with #processors: &amp;gt;         1
 &amp;lt; Numerical Factorization with Level-3 BLAS performance &amp;gt;

 &amp;lt; Linear system Ax = b&amp;gt; &lt;TRANSPOSE&gt; 
             #equations:                                     9
             #non-zeros in A:                                19
             non-zeros in A (%):                            23.456790

             #right-hand sides:                              1

 &amp;lt; Factors L and U &amp;gt; 
             #columns for each panel:                        128
             #independent subgraphs:                         0
 &amp;lt; Preprocessing with state of the art partitioning metis&amp;gt;
             #supernodes:                                    6
             size of largest supernode:                      4
             number of nonzeros in L                       29
             number of nonzeros in U                       1
             number of nonzeros in L+U                     30
             gflop   for the numerical factorization:        0.000000

             gflop/s for the numerical factorization:        0.000047


Factorization completed ... 
================  PARDISO: solving a  symmetric indef. system  ================


Summary PARDISO: ( solve to solve )
================

Times:
======
      Time solve  : 0.000047 s
      Time total  : 0.000066 s total - sum: 0.000019 s

Statistics:
===========
 &amp;lt; Parallel Direct Factorization with #processors: &amp;gt;         1
 &amp;lt; Numerical Factorization with Level-3 BLAS performance &amp;gt;

 &amp;lt; Linear system Ax = b&amp;gt; &lt;TRANSPOSE&gt; 
             #equations:                                     9
             #non-zeros in A:                                19
             non-zeros in A (%):                            23.456790

             #right-hand sides:                              1

 &amp;lt; Factors L and U &amp;gt; 
             #columns for each panel:                        128
             #independent subgraphs:                         0
 &amp;lt; Preprocessing with state of the art partitioning metis&amp;gt;
             #supernodes:                                    6
             size of largest supernode:                      4
             number of nonzeros in L                       29
             number of nonzeros in U                       1
             number of nonzeros in L+U                     30
             gflop   for the numerical factorization:        0.000000

             gflop/s for the numerical factorization:        0.000047


Solve completed ... 
The solution of the system is: 
 x [0] = -0.041860
 x [1] = -0.003413
 x [2] =  0.117250
 x [3] = -0.112640
 x [4] =  0.024172
 x [5] = -0.107633
 x [6] =  0.198720
 x [7] =  0.190383
 x [8] =  1.000000&lt;/TRANSPOSE&gt;&lt;/TRANSPOSE&gt;&lt;/TRANSPOSE&gt;&lt;/PRE&gt;&lt;BR /&gt;&lt;BR /&gt;</description>
      <pubDate>Fri, 10 Dec 2010 23:14:16 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/How-to-use-multiprocessors/m-p/766445#M279</guid>
      <dc:creator>xian-zhong_guous_cd-</dc:creator>
      <dc:date>2010-12-10T23:14:16Z</dc:date>
    </item>
    <item>
      <title>How to use multiprocessors?</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/How-to-use-multiprocessors/m-p/766446#M280</link>
      <description>Hi,&lt;DIV&gt;Could you provide link line of your example? It's seem that you link with sequential library. To check your link line please use this article&lt;A href="http://software.intel.com/en-us/articles/intel-mkl-link-line-advisor/"&gt;http://software.intel.com/en-us/articles/intel-mkl-link-line-advisor/&lt;/A&gt;&lt;/DIV&gt;&lt;DIV&gt;With best regards,&lt;/DIV&gt;&lt;DIV&gt;Alexander Kalinkin&lt;/DIV&gt;</description>
      <pubDate>Sat, 11 Dec 2010 08:04:23 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/How-to-use-multiprocessors/m-p/766446#M280</guid>
      <dc:creator>Alexander_K_Intel2</dc:creator>
      <dc:date>2010-12-11T08:04:23Z</dc:date>
    </item>
    <item>
      <title>How to use multiprocessors?</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/How-to-use-multiprocessors/m-p/766447#M281</link>
      <description>Here is the link line:&lt;BR /&gt;xeons03 270&amp;gt; make sointel64 interface=ilp64 compiler=gnu function=pardiso_sym_c&lt;BR /&gt;&lt;BR /&gt;----- Compiling gnu_ilp64_parallel_intel64_so ----- pardiso_sym_c&lt;BR /&gt;gcc -m64 -w -DMKL_ILP64 -I"/opt/intel/Compiler/11.1/046/mkl/include" \&lt;BR /&gt; ./source/pardiso_sym_c.c \&lt;BR /&gt; -L"/opt/intel/Compiler/11.1/046/mkl/lib/intel64" -lmkl_intel_ilp64 \&lt;BR /&gt; -lmkl_intel_thread \&lt;BR /&gt; -lmkl_core \&lt;BR /&gt; -L"/opt/intel/Compiler/11.1/046/mkl/../lib/intel64" -liomp5 -lpthread -lm -o _results/gnu_ilp64_parallel_intel64_so/pardiso_sym_c.out&lt;BR /&gt;----- Execution gnu_ilp64_parallel_intel64_so ----- pardiso_sym_c&lt;BR /&gt;export LD_LIBRARY_PATH="/opt/intel/Compiler/11.1/046/mkl/lib/intel64":/opt/intel/Compiler/11.1/046/mkl/lib/em64t:/opt/intel/Compiler/11.1/046/mkl/../lib/intel64; \&lt;BR /&gt; _results/gnu_ilp64_parallel_intel64_so/pardiso_sym_c.out &amp;gt; _results/gnu_ilp64_parallel_intel64_so/pardiso_sym_c.res&lt;BR /&gt;&lt;BR /&gt;Is it because my problem size is too small?</description>
      <pubDate>Mon, 13 Dec 2010 21:41:24 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/How-to-use-multiprocessors/m-p/766447#M281</guid>
      <dc:creator>xian-zhong_guous_cd-</dc:creator>
      <dc:date>2010-12-13T21:41:24Z</dc:date>
    </item>
    <item>
      <title>How to use multiprocessors?</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/How-to-use-multiprocessors/m-p/766448#M282</link>
      <description>Can you please check the value of MKL_NUM_THREADS environment variable?&lt;BR /&gt;&lt;BR /&gt;More details on controlling# of threads using env variables can be found in the MKL user manual&lt;BR /&gt;&lt;BR /&gt;&lt;A href="http://software.intel.com/sites/products/documentation/hpc/composerxe/en-us/mklxe/mkl_userguide_lnx/MKL_UG_managing_performance/MKL_Environment_Variables_For_Threading_Control.htm" target="_blank"&gt;http://software.intel.com/sites/products/documentation/hpc/composerxe/en-us/mklxe/mkl_userguide_lnx/MKL_UG_managing_performance/MKL_Environment_Variables_For_Threading_Control.htm&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt; --Vipin</description>
      <pubDate>Tue, 14 Dec 2010 07:14:16 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/How-to-use-multiprocessors/m-p/766448#M282</guid>
      <dc:creator>VipinKumar_E_Intel</dc:creator>
      <dc:date>2010-12-14T07:14:16Z</dc:date>
    </item>
    <item>
      <title>How to use multiprocessors?</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/How-to-use-multiprocessors/m-p/766449#M283</link>
      <description>Hi,&lt;DIV&gt;&lt;SPAN style="line-height: 16px;"&gt;&lt;BR /&gt;&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN style="line-height: 16px;"&gt;You
are correct, the task is too small and PARDISO decided to solve it using only 1
thread because there's no any benefit to use multithreading for such really
small matrices.&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN style="line-height: 16px;"&gt;&lt;BR /&gt;&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN style="line-height: 16px;"&gt;&lt;/SPAN&gt;&lt;SPAN style="line-height: 16px;"&gt;Most
likely, all is correct with threading in your program. Please just try to solvea
matrix
with substantially larger number of equations (say, more than one thousand).&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN style="line-height: 16px;"&gt;&lt;BR /&gt;&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN style="line-height: 16px;"&gt;Best regards,&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN style="line-height: 16px;"&gt;Konstantin&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;/DIV&gt;</description>
      <pubDate>Tue, 14 Dec 2010 08:28:21 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/How-to-use-multiprocessors/m-p/766449#M283</guid>
      <dc:creator>Konstantin_A_Intel</dc:creator>
      <dc:date>2010-12-14T08:28:21Z</dc:date>
    </item>
  </channel>
</rss>

