I was wondering are individual mkl operation calls parallelized or do the separate mkl calls run in parallel? e.g. I have code similar to this
for (int i=0; i<N; i++)
Is the mkl_dot_product_call() executed with multiple threads or will multiple threads each execute their own mkl_dot_product_call()?
Please take a look at these documented cases: Calling Intel MKL Functions from Multi-threaded Applications-https://software.intel.com/en-us/mkl-linux-developer-guide-calling-intel-mkl-functions-from-multi-threaded-applications and OpenMP threaded Function and Problems: https://software.intel.com/en-us/mkl-linux-developer-guide-openmp-threaded-functions-and-problems
You may find out the answers on your questions depends on your usage model.