chi-node-05 has hardware (memory, disk) issues
when trying to install chi-node-05 (#40365 (closed)), i had many problems. but most worrying is what seems to be a hardware issue.
the iDRAC says:
Persistent correctable memory errors detected on a memory device at location DIMM_A2.
I believe DIMM_A5 also has issues based on BIOS warnings at boot.
In general, the console is barely useable and i am not sure the machine will be reliable in any way. need to talk with cymru about this.
Update: there's also warnings from smartd...
From: root <root@chi-node-05.torproject.org>
Subject: SMART error (ErrorCount) detected on host: chi-node-05
To: root@chi-node-05.torproject.org
Date: Thu, 09 Sep 2021 19:51:59 +0000
This message was generated by the smartd daemon running on:
host name: chi-node-05
DNS domain: torproject.org
The following warning/error was logged by the smartd daemon:
Device: /dev/bus/0 [megaraid_disk_00] [SAT], ATA error count increased from 0 to 2
Device info:
ST9500620NS, S/N:9XF0AC2F, WWN:5-000c50-0356dad0e, FW:AA02, 500 GB
For details see host's SYSLOG.
You can also use the smartctl utility for further investigation.
Another message will be sent in 24 hours if the problem persists.