[Sigma-users] Sigma compute nodes down
Mats Kronberg
kronberg at nsc.liu.se
Tue Jan 14 05:51:56 CET 2020
Dear Sigma users,
Due to a partial power failure (one of three power subcentrals powering
Tetralith and Sigma), most compute nodes in Sigma (all 104 "thin" nodes)
went offline at 05:10 today.
All running jobs on these nodes failed.
Until power is restored to these compute nodes, only the four "fat"
compute nodes and the Sigma login node are available.
Estimated downtime: 4h (best case - after a quick inspection we press
the reset switch on the subcentral and boot the nodes) to 1-2 days
(something is damaged and we need to make repairs or move some compute
nodes from Tetralith to Sigma).
--
Mats Kronberg, NSC Support <support at nsc.liu.se>
More information about the Sigma-users
mailing list