- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi I'm testing a small culster of 8 nodes with Intel Cluster Checker and I get Fails which do not really explain the issue:
Basic network connectivity, (ping).....................................................................................................................Failed [010100] subtest 'ping request delay is less than 100 ms' passed node: hadoop5: 0.059 ms node: hadoop4: 0.070 ms node: hadoop3: 0.075 ms node: hadoop2: 0.082 ms [010000] subtest 'shall contain at least 4 compute nodes per group' passed node: hadoop2: 4 computes [010301] subtest 'shall contain at least a head node' failed node: hadoop2: 0 head Node remote connectivity, (remote_login)..............................................................................................................Skipped failed dependencies: ping
My nodes list is the following:
hadoop1 # type: head hadoop2 hadoop3 hadoop4 hadoop5
Also I get the following error from the mpi_local test:
Intel(R) MPI Library intranode runtime, (mpi_local)....................................................................................................Failed [160201] subtest 'mpi runtime' failed node: hadoop[2-5]: no mpirun output for device shm
I got the same when I defined the device as tcp in an xml file.
Any help would be appreciated.
Thanks,
Chris
Link Copied
0 Replies
Reply
Topic Options
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page