When I SSH into that node, and run the "top" command, the CPU usage isclose to 400%. If I use 1 processor, the CPU usage is close to 100%, but it takes the same amount of time. If I use 2 nodes, the solution is twice as fast. My matrix type is 6, and I'm compiling with these libraries:
-lmkl_intel_lp64 -lmkl_core -lmkl_intel_thread -lguide -lmkl_solver
Why isn't PARDISO any faster with 4 processors? I'm guessing there's some setting or something I've missed. Thanks.