failed disk on fsn-node-01
Date: Thu, 11 May 2023 07:29:41 +0000
From: mdadm monitoring <root@fsn-node-01.torproject.org>
To: root@fsn-node-01.torproject.org
Subject: Fail event on /dev/md/boot:fsn-node-01
This is an automatically generated mail message from mdadm
running on fsn-node-01
A Fail event had been detected on md device /dev/md/boot.
It could be related to component device /dev/nvme0n1p2.
Faithfully yours, etc.
P.S. The /proc/mdstat file currently contains the following:
Personalities : [raid1] [linear] [multipath] [raid0] [raid6] [raid5] [raid4] [raid10]
md125 : active raid1 sda1[0] sdb1[1]
9766303744 blocks super 1.2 [2/2] [UU]
bitmap: 6/73 pages [24KB], 65536KB chunk
md126 : active raid1 nvme1n1p3[1]
936512512 blocks super 1.2 [2/1] [_U]
bitmap: 5/7 pages [20KB], 65536KB chunk
md127 : active raid1 nvme1n1p2[1] nvme0n1p2[2](F)
523712 blocks super 1.2 [2/1] [_U]
unused devices: <none>
plan:
- evacuate node
- rotate master
- verify with previous incidents if it's the same drive that failed (#40805 (closed), #40818 (closed))
- shutdown node
- ask hetzner for a replacement
/cc @lavamind