I am using the mkl_?coosv, specifically one with d, and compiled the program with 'ifort -openmp -mkl'. I have set 'mkl_omp_num_threads', and also 'omp_proc_bind=true'. I have tested it for three different thread numbers 4, 8, 16. I am getting the following timings: 0.36, 0.3, and 0.32. I am running it on a machine with 16 cores. are these timings reasonable? or are there anything else I should be doing before doing the runs. thanks.
链接已复制
7 回复数
all of triangle solvers routines are not threaded for now.In the case if you call this routine many time into your application, we would recommend you to make the convertion from coo -> csr format and then call mkl_?csrsv. It would faster because of internally, we make conversion coo->csr.
--Gennady