I have a coarray program which runs fine on a small 4-core desktop, also runs oversubscribed well beyond the number of cores; this is using parallel_studio_xe_2017.0.035.
On a KNL single node (64 core) using the cluster edition on the same ifort above, it runs perfectly with up to 15 images, but it hangs forever at 16 or more images without running any code. I'm compiling with -coarray=shared and also setting the number of images with, say, -coarray-num-images=16 for 16 images. I'm unable to figure out what configuration detail is missing to hang at 16 or more images. Any insight would be appreciated!
Many thanks, Ted
I had missed this thread URL below because it was in intel-visual-fortran-compiler-for-windows , sorry.
The problem is the same for Linux intel cluster edition 17 using ifort coarrays, hangs when images>=16, and the same suggested fix in the meanwhile (setting I_MPI_COLL_INTRANODE=pt2pt) works. I guess this means that shared memory mode fails when images>=16 but MPI is fine.