06-17-2011 11:10 PM
I was wondering whether
mkl_dbsrmmhas a parallel implementation. I have set both
MKL_NUM_THREADSto 8 and I checked that number with
[fortran] nthr = mkl_domain_get_max_threads( MKL_BLAS )[/fortran]
However, when I check the CPU usage with top it never goes above 100%. I remember seeing higher CPU usage (400%) when using
We really need the parallellism because we are operating on some 6000x6000 matrices and a single core is really slow.
Thanks a lot,
06-24-2011 03:23 PM
I do seethe User Guide says it should be threaded, so we can check if that is accurate. The scaling will depend on the sparsity pattern so it's possibleyour case is not optimal.Is performance pegged at 100% or just always below it?