xian-zhong_guous_cd-

Beginner

01-24-2011
03:41 PM

59 Views

PARDISO single precision

Summary PARDISO: ( factorize to factorize )

================

Times:

======

Time A to LU: 0.000000 s

Factorization: Time for writing to files : 0.000000

Factorization: Time for reading from files : 0.000000

Time numfct : 4066.776367 s

Time malloc : 0.000056 s

Time total : 4066.812012 s total - sum: 0.035689 s

Statistics:

===========

< Parallel Direct Factorization with #processors: > 8

< Numerical Factorization with BLAS3 and O(n) synchronization >

< Linear system Ax = b>

#equations: 5389726

#non-zeros in A: 42250735

non-zeros in A (%): 0.000145

#right-hand sides: 1

< Factors L and U >

#columns for each panel: 96

#independent subgraphs: 0

< Preprocessing with state of the art partitioning metis>

#supernodes: 2241484

size of largest supernode: 9951

number of nonzeros in L 3046299967

number of nonzeros in U 1

number of nonzeros in L+U 3046299968

gflop for the numerical factorization: 11175.376953

gflop/s for the numerical factorization: 2.747969

------------------------------------------------------------------------------------------------------------

Summary PARDISO: ( factorize to factorize )

================

Times:

======

Time A to LU: 0.000000 s

Factorization: Time for writing to files : 0.000000

Factorization: Time for reading from files : 0.000000

Time numfct : 663.914768 s

Time malloc : 0.000054 s

Time total : 663.954133 s total - sum: 0.039310 s

Statistics:

===========

< Parallel Direct Factorization with #processors: > 8

< Numerical Factorization with BLAS3 and O(n) synchronization >

< Linear system Ax = b>

#equations: 5389726

#non-zeros in A: 42250735

non-zeros in A (%): 0.000145

#right-hand sides: 1

< Factors L and U >

#columns for each panel: 96

#independent subgraphs: 0

< Preprocessing with state of the art partitioning metis>

#supernodes: 2241484

size of largest supernode: 9951

number of nonzeros in L 3046299967

number of nonzeros in U 1

number of nonzeros in L+U 3046299968

gflop for the numerical factorization: 11175.376693

gflop/s for the numerical factorization: 16.832547

Sergey_P_Intel2

Employee

01-25-2011
04:17 AM

59 Views

Described issue looks too strange becauseon our tests single precision version of PARDISO provides significantly better performance than double precision version.So is it possible for you to provide us with your test case? Alsoplease provide us with MKL version you useand withplatform description.

Regards,

Sergey

Sergey_Solovev__Inte

New Contributor I

01-25-2011
05:31 AM

59 Views

Could you provide us with the following info:

What the iparm(60) do you set? What MKL_PARDISO_OOC_MAX_CORE_SIZE do you set and how much RAM your system has?

Regards, Sergey

xian-zhong_guous_cd-

Beginner

01-25-2011
09:35 AM

59 Views

By the way, I set MKL_PARDISO_OOC_MAX_CORE_SIZE=20000 and my RAM=32G.

xian-zhong_guous_cd-

Beginner

01-25-2011
01:05 PM

59 Views

PARDISO (pt, &maxfct, &mnum, &mtype, &phase,

&N, (float*)mkl_a, mkl_ia, mkl_ja, &idum, &nrhs,

iparm, &msglvl, &ddum, &ddum, &error);

In adiition, I set iparm(28)=1. However, I got gabage sometime for solution, even though error=0.

Sergey_P_Intel2

Employee

01-25-2011
10:15 PM

59 Views

According to MKL manual, for single precision mode A, x and b should be offloat type:

A: DOUBLE PRECISION - for real types of matrices (mtype=1,2, -2 and 11) and for double precision PARDISO (iparm(28)=0)

REAL - for real types of matrices (mtype=1, 2, -2 and 11) and for single precision PARDISO (iparm(28)=1)

Regards,

Sergey

