I am a beginner in linux; i am using quantum espresso software; The cluster we have at University has three types of partition short (for jobs within an hour), long (jobs within 4 days) and superlong (jobs within 10 days). each node has 8 processors; however recently when I am running a job, only the SHORT partition works properly; this is not very useful for me as I need to run longer jobs. when i run the other two (long and superlong) I get several errors: running on more than one node say : 16 processors (2 nodes) producesan error:
"veredas60:30606: open_hca: getaddr_netdev ERROR: Connection refused. Is ib0 configured?
veredas60:30606: open_hca: getaddr_netdev ERROR: Connection refused. Is ib1 configured?