Community
cancel
Showing results for 
Search instead for 
Did you mean: 
Davis__Timothy
Beginner
88 Views

Most of the plat8153 nodes are down

Out of 12 plat8153 nodes (n145 to n156), only 4 are up and running.  Nodes n145 and n146 are offline, and have been offline for a long time.  Nodes n151 to n156 are down, and have been down for several days.

Why is that?  Can you bring up some of these plat8153 nodes?

We are using these for the Intel GAP graph algorithm benchmark effort.  I'm trying to get my final results, and only one job of mine is running.  I've had some jobs queued for 26 hours that haven't started running yet.

 

 

Tags (1)
0 Kudos
4 Replies
ArunJ_Intel
Moderator
88 Views

Hi Timothy,

Thanks for pointing this out. We have observed the same issue of nodes being offline and will be contacting the concerned team regarding this.


Arun Jose

ArunJ_Intel
Moderator
88 Views

Hi Timothy,


This issue is due to Memory failure problem. By end of this week this will be resolved.


Arun Jose

ArunJ_Intel
Moderator
88 Views

Hi Timothy,

 

Most of the plat8153 nodes are up now. Hope this resolves your issue.

Is there anything else you need help with, if not could we close this case ?

 

Arun Jose

ArunJ_Intel
Moderator
88 Views

Hi Timothy,

We are closing this case. Please feel free to raise a new thread in case of further issues.

 

Thanks

Arun Jose

Reply