<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic MKL PARDISO statistics in Intel® oneAPI Math Kernel Library</title>
    <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/MKL-PARDISO-statistics/m-p/817199#M4374</link>
    <description>From the following PARDISO statistics, it seems solving for rhs (solve to solve) is domainant. This does not seem to consistent with my expection: I expect factorization (factorize to factorize) is domainant.&lt;BR /&gt;&lt;BR /&gt;PARDISO stattistics starts here:&lt;BR /&gt;&lt;META http-equiv="CONTENT-TYPE" content="text/html; charset=utf-8" /&gt;
	&lt;TITLE&gt;&lt;/TITLE&gt;
	&lt;META name="GENERATOR" content="OpenOffice.org 3.2  (Unix)" /&gt;
	&lt;STYLE type="text/css"&gt;&amp;lt;!--
		@page { margin: 0.79in }
		P { margin-bottom: 0.08in }
--&amp;gt;&lt;/STYLE&gt;

&lt;P&gt;ooc_max_core_size        got by Env = 
20000&lt;/P&gt;
&lt;P&gt;The file ./pardiso_ooc.cfg was not
opened&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;
&lt;P&gt;=== PARDISO is running in Out-Of-Core
mode, because iparam(60)=1 and there is not enough RAM for In-Core
===&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;
&lt;P&gt;================  PARDISO: solving a
symm. posit. def. system  ================&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;
&lt;P&gt;Summary PARDISO: ( reorder to reorder )&lt;/P&gt;
&lt;P&gt;================&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;
&lt;P&gt;Times:&lt;/P&gt;
&lt;P&gt;======&lt;/P&gt;
&lt;P&gt;      Time fulladj: 0.684705 s&lt;/P&gt;
&lt;P&gt;      Time reorder: 89.446126 s&lt;/P&gt;
&lt;P&gt;      Time symbfct: 9.093952 s&lt;/P&gt;
&lt;P&gt;      Time parlist: 23.178850 s&lt;/P&gt;
&lt;P&gt;      Time malloc : 1.119903 s&lt;/P&gt;
&lt;P&gt;      Time total  : 155.063945 s total
- sum: 31.540409 s&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;
&lt;P&gt;Statistics:&lt;/P&gt;
&lt;P&gt;===========&lt;/P&gt;
&lt;P&gt; &amp;lt; Parallel Direct Factorization
with #processors: &amp;gt;         8&lt;/P&gt;
&lt;P&gt; &amp;lt; Numerical Factorization with
BLAS3 and O(n) synchronization &amp;gt;&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;
&lt;P&gt; &amp;lt; Linear system Ax = b&amp;gt;
&lt;TRANSPOSE&gt; 
&lt;/TRANSPOSE&gt;&lt;/P&gt;
&lt;P&gt;             #equations:               
                     5389726&lt;/P&gt;
&lt;P&gt;             #non-zeros in A:          
                     42250735&lt;/P&gt;
&lt;P&gt;             non-zeros in A (%):       
                    0.000145&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;
&lt;P&gt;             #right-hand sides:        
                     1&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;
&lt;P&gt; &amp;lt; Factors L and U &amp;gt; 
&lt;/P&gt;
&lt;P&gt;             #columns for each panel:  
                     96&lt;/P&gt;
&lt;P&gt;             #independent subgraphs:   
                     0&lt;/P&gt;
&lt;P&gt; &amp;lt; Preprocessing with state of the
art partitioning metis&amp;gt;&lt;/P&gt;
&lt;P&gt;             #supernodes:              
                     2241484&lt;/P&gt;
&lt;P&gt;             size of largest supernode:
                     9951&lt;/P&gt;
&lt;P&gt;             number of nonzeros in L   
                   3046299967&lt;/P&gt;
&lt;P&gt;             number of nonzeros in U   
                   1&lt;/P&gt;
&lt;P&gt;             number of nonzeros in L+U 
                   3046299968&lt;/P&gt;
&lt;P&gt; Percentage of computed non-zeros for
LL^T factorization&lt;/P&gt;
&lt;P&gt; 0 %  1 %  2 %  3 %  4 %  5 %  6 %  7 %
 8 %  9 %  10 %  11 %  12 %  13 %  14 %  15 %  16 %  17 %  18 %  19 %
 20 %  21 %  22 %  23 %  24 %  25 %  26 %  27 %  28 %  29 %  30 %  31
%  32 %  33 %  34 %  35 %  36 %  37 %  38 %  39 %  40 %  41 %  42 % 
43 %  44 %  45 %  46 %  47 %  48 %  49 %  50 %  51 %  52 %  53 %  54
%  55 %  56 %  57 %  58 %  59 %  60 %  61 %  62 %  63 %  64 %  65 % 
66 %  67 %  68 %  69 %  70 %  71 %  72 %  73 %  74 %  75 %  76 %  77
%  78 %  79 %  80 %  81 %  82 %  83 %  84 %  85 %  86 %  87 %  88 % 
89 %  90 %  91 %  92 %  93 %  94 %  95 %  96 %  97 %  98 %  100 %  
&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;
&lt;P&gt;================  PARDISO: solving a
symm. posit. def. system  ================&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;
&lt;P&gt;Summary PARDISO: ( factorize to
factorize )&lt;/P&gt;
&lt;P&gt;================&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;
&lt;P&gt;Times:&lt;/P&gt;
&lt;P&gt;======&lt;/P&gt;
&lt;P&gt;      Time A to LU: 0.000000 s&lt;/P&gt;
&lt;P&gt;      Factorization:        Time for
writing to files : 0.000000&lt;/P&gt;
&lt;P&gt;      Factorization:        Time for
reading from files : 0.000000&lt;/P&gt;
&lt;P&gt;      Time numfct : 663.914768 s&lt;/P&gt;
&lt;P&gt;      Time malloc : 0.000054 s&lt;/P&gt;
&lt;P&gt;      Time total  : 663.954133 s total
- sum: 0.039310 s&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;
&lt;P&gt;Statistics:&lt;/P&gt;
&lt;P&gt;===========&lt;/P&gt;
&lt;P&gt; &amp;lt; Parallel Direct Factorization
with #processors: &amp;gt;         8&lt;/P&gt;
&lt;P&gt; &amp;lt; Numerical Factorization with
BLAS3 and O(n) synchronization &amp;gt;&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;
&lt;P&gt; &amp;lt; Linear system Ax = b&amp;gt;
&lt;TRANSPOSE&gt; 
&lt;/TRANSPOSE&gt;&lt;/P&gt;
&lt;P&gt;             #equations:               
                     5389726&lt;/P&gt;
&lt;P&gt;             #non-zeros in A:          
                     42250735&lt;/P&gt;
&lt;P&gt;             non-zeros in A (%):       
                    0.000145&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;
&lt;P&gt;             #right-hand sides:        
                     1&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;
&lt;P&gt; &amp;lt; Factors L and U &amp;gt; 
&lt;/P&gt;
&lt;P&gt;             #columns for each panel:  
                     96&lt;/P&gt;
&lt;P&gt;             #independent subgraphs:   
                     0&lt;/P&gt;
&lt;P&gt; &amp;lt; Preprocessing with state of the
art partitioning metis&amp;gt;&lt;/P&gt;
&lt;P&gt;             #supernodes:              
                     2241484&lt;/P&gt;
&lt;P&gt;             size of largest supernode:
                     9951&lt;/P&gt;
&lt;P&gt;             number of nonzeros in L   
                   3046299967&lt;/P&gt;
&lt;P&gt;             number of nonzeros in U   
                   1&lt;/P&gt;
&lt;P&gt;             number of nonzeros in L+U 
                   3046299968&lt;/P&gt;
&lt;P&gt;             gflop   for the numerical
factorization:        11175.376693&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;
&lt;P&gt;             gflop/s for the numerical
factorization:        16.832547&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;
&lt;P&gt;================  PARDISO: solving a
symm. posit. def. system  ================&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;
&lt;P&gt;Summary PARDISO: ( solve to solve )&lt;/P&gt;
&lt;P&gt;================&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;
&lt;P&gt;Times:&lt;/P&gt;
&lt;P&gt;======&lt;/P&gt;
&lt;P&gt;      Solution:        Time for reading
from files : 0.000000&lt;/P&gt;
&lt;P&gt;      Time solve  : 716.844178 s&lt;/P&gt;
&lt;P&gt;      Time total  : 1867.447714 s total
- sum: 1150.603536 s&lt;/P&gt;
&lt;P&gt;
==============================================================&lt;/P&gt;
&lt;P&gt; ----------- Out of core time (in
percent (%)) --------------&lt;/P&gt;
&lt;P&gt;  Factorization step (100 (%)):&lt;/P&gt;
&lt;P&gt;      write to files : 0 
&lt;/P&gt;
&lt;P&gt;      read from files: 0 
&lt;/P&gt;
&lt;P&gt;      factorization - write&amp;amp;read :
100 
&lt;/P&gt;
&lt;P&gt;  Solution step (100 (%)):&lt;/P&gt;
&lt;P&gt;      read from files: 0 
&lt;/P&gt;
&lt;P&gt;      solve - write&amp;amp;read: 100 
&lt;/P&gt;
&lt;P&gt;  Total time (100 (%)):&lt;/P&gt;
&lt;P&gt;      read from files: 0 
&lt;/P&gt;
&lt;P&gt;      total - write&amp;amp;read: 100 
&lt;/P&gt;
&lt;P&gt; ----------- Out of core Mb
--------------&lt;/P&gt;
&lt;P&gt;  Factorization step:&lt;/P&gt;
&lt;P&gt;      write to files :      0.000 Mb 
&lt;/P&gt;
&lt;P&gt;      read from files:      0.000 Mb 
&lt;/P&gt;
&lt;P&gt;  Solution step:&lt;/P&gt;
&lt;P&gt;      read from files:      0.000 Mb 
&lt;/P&gt;
&lt;P&gt;  Total size of data transferred :&lt;/P&gt;
&lt;P&gt;      write&amp;amp;read        :     
0.000 Mb 
&lt;/P&gt;
&lt;P&gt;
==============================================================&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;
&lt;P&gt;Statistics:&lt;/P&gt;
&lt;P&gt;===========&lt;/P&gt;
&lt;P&gt; &amp;lt; Parallel Direct Factorization
with #processors: &amp;gt;         8&lt;/P&gt;
&lt;P&gt; &amp;lt; Numerical Factorization with
BLAS3 and O(n) synchronization &amp;gt;&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;
&lt;P&gt; &amp;lt; Linear system Ax = b&amp;gt;
&lt;TRANSPOSE&gt; 
&lt;/TRANSPOSE&gt;&lt;/P&gt;
&lt;P&gt;             #equations:               
                     5389726&lt;/P&gt;
&lt;P&gt;             #non-zeros in A:          
                     42250735&lt;/P&gt;
&lt;P&gt;             non-zeros in A (%):       
                    0.000145&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;
&lt;P&gt;             #right-hand sides:        
                     1&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;
&lt;P&gt; &amp;lt; Factors L and U &amp;gt; 
&lt;/P&gt;
&lt;P&gt;             #columns for each panel:  
                     96&lt;/P&gt;
&lt;P&gt;             #independent subgraphs:   
                     0&lt;/P&gt;
&lt;P&gt; &amp;lt; Preprocessing with state of the
art partitioning metis&amp;gt;&lt;/P&gt;
&lt;P&gt;             #supernodes:              
                     2241484&lt;/P&gt;
&lt;P&gt;             size of largest supernode:
                     9951&lt;/P&gt;
&lt;P&gt;             number of nonzeros in L   
                   3046299967&lt;/P&gt;
&lt;P&gt;             number of nonzeros in U   
                   1&lt;/P&gt;
&lt;P&gt;             number of nonzeros in L+U 
                   3046299968&lt;/P&gt;
&lt;P&gt;             gflop   for the numerical
factorization:        11175.376693&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;
&lt;P&gt;             gflop/s for the numerical
factorization:        16.832547&lt;/P&gt;</description>
    <pubDate>Mon, 24 Jan 2011 22:52:06 GMT</pubDate>
    <dc:creator>xian-zhong_guous_cd-</dc:creator>
    <dc:date>2011-01-24T22:52:06Z</dc:date>
    <item>
      <title>MKL PARDISO statistics</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/MKL-PARDISO-statistics/m-p/817199#M4374</link>
      <description>From the following PARDISO statistics, it seems solving for rhs (solve to solve) is domainant. This does not seem to consistent with my expection: I expect factorization (factorize to factorize) is domainant.&lt;BR /&gt;&lt;BR /&gt;PARDISO stattistics starts here:&lt;BR /&gt;&lt;META http-equiv="CONTENT-TYPE" content="text/html; charset=utf-8" /&gt;
	&lt;TITLE&gt;&lt;/TITLE&gt;
	&lt;META name="GENERATOR" content="OpenOffice.org 3.2  (Unix)" /&gt;
	&lt;STYLE type="text/css"&gt;&amp;lt;!--
		@page { margin: 0.79in }
		P { margin-bottom: 0.08in }
--&amp;gt;&lt;/STYLE&gt;

&lt;P&gt;ooc_max_core_size        got by Env = 
20000&lt;/P&gt;
&lt;P&gt;The file ./pardiso_ooc.cfg was not
opened&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;
&lt;P&gt;=== PARDISO is running in Out-Of-Core
mode, because iparam(60)=1 and there is not enough RAM for In-Core
===&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;
&lt;P&gt;================  PARDISO: solving a
symm. posit. def. system  ================&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;
&lt;P&gt;Summary PARDISO: ( reorder to reorder )&lt;/P&gt;
&lt;P&gt;================&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;
&lt;P&gt;Times:&lt;/P&gt;
&lt;P&gt;======&lt;/P&gt;
&lt;P&gt;      Time fulladj: 0.684705 s&lt;/P&gt;
&lt;P&gt;      Time reorder: 89.446126 s&lt;/P&gt;
&lt;P&gt;      Time symbfct: 9.093952 s&lt;/P&gt;
&lt;P&gt;      Time parlist: 23.178850 s&lt;/P&gt;
&lt;P&gt;      Time malloc : 1.119903 s&lt;/P&gt;
&lt;P&gt;      Time total  : 155.063945 s total
- sum: 31.540409 s&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;
&lt;P&gt;Statistics:&lt;/P&gt;
&lt;P&gt;===========&lt;/P&gt;
&lt;P&gt; &amp;lt; Parallel Direct Factorization
with #processors: &amp;gt;         8&lt;/P&gt;
&lt;P&gt; &amp;lt; Numerical Factorization with
BLAS3 and O(n) synchronization &amp;gt;&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;
&lt;P&gt; &amp;lt; Linear system Ax = b&amp;gt;
&lt;TRANSPOSE&gt; 
&lt;/TRANSPOSE&gt;&lt;/P&gt;
&lt;P&gt;             #equations:               
                     5389726&lt;/P&gt;
&lt;P&gt;             #non-zeros in A:          
                     42250735&lt;/P&gt;
&lt;P&gt;             non-zeros in A (%):       
                    0.000145&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;
&lt;P&gt;             #right-hand sides:        
                     1&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;
&lt;P&gt; &amp;lt; Factors L and U &amp;gt; 
&lt;/P&gt;
&lt;P&gt;             #columns for each panel:  
                     96&lt;/P&gt;
&lt;P&gt;             #independent subgraphs:   
                     0&lt;/P&gt;
&lt;P&gt; &amp;lt; Preprocessing with state of the
art partitioning metis&amp;gt;&lt;/P&gt;
&lt;P&gt;             #supernodes:              
                     2241484&lt;/P&gt;
&lt;P&gt;             size of largest supernode:
                     9951&lt;/P&gt;
&lt;P&gt;             number of nonzeros in L   
                   3046299967&lt;/P&gt;
&lt;P&gt;             number of nonzeros in U   
                   1&lt;/P&gt;
&lt;P&gt;             number of nonzeros in L+U 
                   3046299968&lt;/P&gt;
&lt;P&gt; Percentage of computed non-zeros for
LL^T factorization&lt;/P&gt;
&lt;P&gt; 0 %  1 %  2 %  3 %  4 %  5 %  6 %  7 %
 8 %  9 %  10 %  11 %  12 %  13 %  14 %  15 %  16 %  17 %  18 %  19 %
 20 %  21 %  22 %  23 %  24 %  25 %  26 %  27 %  28 %  29 %  30 %  31
%  32 %  33 %  34 %  35 %  36 %  37 %  38 %  39 %  40 %  41 %  42 % 
43 %  44 %  45 %  46 %  47 %  48 %  49 %  50 %  51 %  52 %  53 %  54
%  55 %  56 %  57 %  58 %  59 %  60 %  61 %  62 %  63 %  64 %  65 % 
66 %  67 %  68 %  69 %  70 %  71 %  72 %  73 %  74 %  75 %  76 %  77
%  78 %  79 %  80 %  81 %  82 %  83 %  84 %  85 %  86 %  87 %  88 % 
89 %  90 %  91 %  92 %  93 %  94 %  95 %  96 %  97 %  98 %  100 %  
&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;
&lt;P&gt;================  PARDISO: solving a
symm. posit. def. system  ================&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;
&lt;P&gt;Summary PARDISO: ( factorize to
factorize )&lt;/P&gt;
&lt;P&gt;================&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;
&lt;P&gt;Times:&lt;/P&gt;
&lt;P&gt;======&lt;/P&gt;
&lt;P&gt;      Time A to LU: 0.000000 s&lt;/P&gt;
&lt;P&gt;      Factorization:        Time for
writing to files : 0.000000&lt;/P&gt;
&lt;P&gt;      Factorization:        Time for
reading from files : 0.000000&lt;/P&gt;
&lt;P&gt;      Time numfct : 663.914768 s&lt;/P&gt;
&lt;P&gt;      Time malloc : 0.000054 s&lt;/P&gt;
&lt;P&gt;      Time total  : 663.954133 s total
- sum: 0.039310 s&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;
&lt;P&gt;Statistics:&lt;/P&gt;
&lt;P&gt;===========&lt;/P&gt;
&lt;P&gt; &amp;lt; Parallel Direct Factorization
with #processors: &amp;gt;         8&lt;/P&gt;
&lt;P&gt; &amp;lt; Numerical Factorization with
BLAS3 and O(n) synchronization &amp;gt;&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;
&lt;P&gt; &amp;lt; Linear system Ax = b&amp;gt;
&lt;TRANSPOSE&gt; 
&lt;/TRANSPOSE&gt;&lt;/P&gt;
&lt;P&gt;             #equations:               
                     5389726&lt;/P&gt;
&lt;P&gt;             #non-zeros in A:          
                     42250735&lt;/P&gt;
&lt;P&gt;             non-zeros in A (%):       
                    0.000145&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;
&lt;P&gt;             #right-hand sides:        
                     1&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;
&lt;P&gt; &amp;lt; Factors L and U &amp;gt; 
&lt;/P&gt;
&lt;P&gt;             #columns for each panel:  
                     96&lt;/P&gt;
&lt;P&gt;             #independent subgraphs:   
                     0&lt;/P&gt;
&lt;P&gt; &amp;lt; Preprocessing with state of the
art partitioning metis&amp;gt;&lt;/P&gt;
&lt;P&gt;             #supernodes:              
                     2241484&lt;/P&gt;
&lt;P&gt;             size of largest supernode:
                     9951&lt;/P&gt;
&lt;P&gt;             number of nonzeros in L   
                   3046299967&lt;/P&gt;
&lt;P&gt;             number of nonzeros in U   
                   1&lt;/P&gt;
&lt;P&gt;             number of nonzeros in L+U 
                   3046299968&lt;/P&gt;
&lt;P&gt;             gflop   for the numerical
factorization:        11175.376693&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;
&lt;P&gt;             gflop/s for the numerical
factorization:        16.832547&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;
&lt;P&gt;================  PARDISO: solving a
symm. posit. def. system  ================&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;
&lt;P&gt;Summary PARDISO: ( solve to solve )&lt;/P&gt;
&lt;P&gt;================&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;
&lt;P&gt;Times:&lt;/P&gt;
&lt;P&gt;======&lt;/P&gt;
&lt;P&gt;      Solution:        Time for reading
from files : 0.000000&lt;/P&gt;
&lt;P&gt;      Time solve  : 716.844178 s&lt;/P&gt;
&lt;P&gt;      Time total  : 1867.447714 s total
- sum: 1150.603536 s&lt;/P&gt;
&lt;P&gt;
==============================================================&lt;/P&gt;
&lt;P&gt; ----------- Out of core time (in
percent (%)) --------------&lt;/P&gt;
&lt;P&gt;  Factorization step (100 (%)):&lt;/P&gt;
&lt;P&gt;      write to files : 0 
&lt;/P&gt;
&lt;P&gt;      read from files: 0 
&lt;/P&gt;
&lt;P&gt;      factorization - write&amp;amp;read :
100 
&lt;/P&gt;
&lt;P&gt;  Solution step (100 (%)):&lt;/P&gt;
&lt;P&gt;      read from files: 0 
&lt;/P&gt;
&lt;P&gt;      solve - write&amp;amp;read: 100 
&lt;/P&gt;
&lt;P&gt;  Total time (100 (%)):&lt;/P&gt;
&lt;P&gt;      read from files: 0 
&lt;/P&gt;
&lt;P&gt;      total - write&amp;amp;read: 100 
&lt;/P&gt;
&lt;P&gt; ----------- Out of core Mb
--------------&lt;/P&gt;
&lt;P&gt;  Factorization step:&lt;/P&gt;
&lt;P&gt;      write to files :      0.000 Mb 
&lt;/P&gt;
&lt;P&gt;      read from files:      0.000 Mb 
&lt;/P&gt;
&lt;P&gt;  Solution step:&lt;/P&gt;
&lt;P&gt;      read from files:      0.000 Mb 
&lt;/P&gt;
&lt;P&gt;  Total size of data transferred :&lt;/P&gt;
&lt;P&gt;      write&amp;amp;read        :     
0.000 Mb 
&lt;/P&gt;
&lt;P&gt;
==============================================================&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;
&lt;P&gt;Statistics:&lt;/P&gt;
&lt;P&gt;===========&lt;/P&gt;
&lt;P&gt; &amp;lt; Parallel Direct Factorization
with #processors: &amp;gt;         8&lt;/P&gt;
&lt;P&gt; &amp;lt; Numerical Factorization with
BLAS3 and O(n) synchronization &amp;gt;&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;
&lt;P&gt; &amp;lt; Linear system Ax = b&amp;gt;
&lt;TRANSPOSE&gt; 
&lt;/TRANSPOSE&gt;&lt;/P&gt;
&lt;P&gt;             #equations:               
                     5389726&lt;/P&gt;
&lt;P&gt;             #non-zeros in A:          
                     42250735&lt;/P&gt;
&lt;P&gt;             non-zeros in A (%):       
                    0.000145&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;
&lt;P&gt;             #right-hand sides:        
                     1&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;
&lt;P&gt; &amp;lt; Factors L and U &amp;gt; 
&lt;/P&gt;
&lt;P&gt;             #columns for each panel:  
                     96&lt;/P&gt;
&lt;P&gt;             #independent subgraphs:   
                     0&lt;/P&gt;
&lt;P&gt; &amp;lt; Preprocessing with state of the
art partitioning metis&amp;gt;&lt;/P&gt;
&lt;P&gt;             #supernodes:              
                     2241484&lt;/P&gt;
&lt;P&gt;             size of largest supernode:
                     9951&lt;/P&gt;
&lt;P&gt;             number of nonzeros in L   
                   3046299967&lt;/P&gt;
&lt;P&gt;             number of nonzeros in U   
                   1&lt;/P&gt;
&lt;P&gt;             number of nonzeros in L+U 
                   3046299968&lt;/P&gt;
&lt;P&gt;             gflop   for the numerical
factorization:        11175.376693&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;
&lt;P&gt;             gflop/s for the numerical
factorization:        16.832547&lt;/P&gt;</description>
      <pubDate>Mon, 24 Jan 2011 22:52:06 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/MKL-PARDISO-statistics/m-p/817199#M4374</guid>
      <dc:creator>xian-zhong_guous_cd-</dc:creator>
      <dc:date>2011-01-24T22:52:06Z</dc:date>
    </item>
    <item>
      <title>MKL PARDISO statistics</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/MKL-PARDISO-statistics/m-p/817200#M4375</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;In In-Core mode solving step is faster than factorization one.&lt;/P&gt;&lt;P&gt;In OOC mode following situation can be happened: factorisation step doesnt read L-factors, just saves. Solving step reads all L-factor twice: at first during forward step, secondly during backward one. Let me note, if iterative refinement step (&lt;I&gt;iparm&lt;/I&gt;(8)) is switch on, so solving step can be significantly slower because PARDISO should read L-factors several times.&lt;/P&gt;Regards, Sergey</description>
      <pubDate>Tue, 25 Jan 2011 12:51:27 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/MKL-PARDISO-statistics/m-p/817200#M4375</guid>
      <dc:creator>Sergey_Solovev__Inte</dc:creator>
      <dc:date>2011-01-25T12:51:27Z</dc:date>
    </item>
    <item>
      <title>MKL PARDISO statistics</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/MKL-PARDISO-statistics/m-p/817201#M4376</link>
      <description>Hi,&lt;BR /&gt;May I know how you got Pardiso to run with 8 processors? Of couse you need to have 8 physical processors. My machine has 4 processors and I have the line&lt;BR /&gt;&lt;BR /&gt;CALL OMP_SET_NUM_THREADS(4)&lt;BR /&gt;&lt;BR /&gt;in my subroutine. However, when I run it, the report says that the number of processors used in the factorization is 1. When I printedomp_get_max_threads() it shows value of 4.&lt;BR /&gt;&lt;BR /&gt;Am I missing something?&lt;BR /&gt;&lt;BR /&gt;Thank you.&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;Sincerely,&lt;BR /&gt;EH</description>
      <pubDate>Wed, 16 Nov 2011 12:44:11 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/MKL-PARDISO-statistics/m-p/817201#M4376</guid>
      <dc:creator>eh4</dc:creator>
      <dc:date>2011-11-16T12:44:11Z</dc:date>
    </item>
    <item>
      <title>MKL PARDISO statistics</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/MKL-PARDISO-statistics/m-p/817202#M4377</link>
      <description>&lt;P&gt;What the mode of PARDISO do youuse: InCore or OOC ? And what the version of MKL?&lt;BR /&gt;Our quick advice is to use call mkl_set_num_threads(4) for setting MKL_NUM_THREADS environment variable.&lt;BR /&gt;Thank you!&lt;BR /&gt;Sergey&lt;/P&gt;</description>
      <pubDate>Thu, 17 Nov 2011 03:42:49 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-Math-Kernel-Library/MKL-PARDISO-statistics/m-p/817202#M4377</guid>
      <dc:creator>Sergey_Solovev__Inte</dc:creator>
      <dc:date>2011-11-17T03:42:49Z</dc:date>
    </item>
  </channel>
</rss>

