Link Copied
that's because of RCI ISS routines (incliding dcg) are not threaded. regarding ddiasymv - the sparse matrix multiplication typically is memory bandwidth limited, with a high cache miss rate. In such cases pretty difficult to reach the good scalability.
--Gennady
that's because of RCI ISS routines (incliding dcg) are not threaded. regarding ddiasymv - the sparse matrix multiplication typically is memory bandwidth limited, with a high cache miss rate. In such cases pretty difficult to reach the good scalability.
--Gennady
For more complete information about compiler optimizations, see our Optimization Notice.