Skip to content

chi-node-05 has hardware (memory, disk) issues

when trying to install chi-node-05 (#40365 (closed)), i had many problems. but most worrying is what seems to be a hardware issue.

the iDRAC says:

Persistent correctable memory errors detected on a memory device at location DIMM_A2.

I believe DIMM_A5 also has issues based on BIOS warnings at boot.

In general, the console is barely useable and i am not sure the machine will be reliable in any way. need to talk with cymru about this.

Update: there's also warnings from smartd...

From: root <root@chi-node-05.torproject.org>
Subject: SMART error (ErrorCount) detected on host: chi-node-05
To: root@chi-node-05.torproject.org
Date: Thu, 09 Sep 2021 19:51:59 +0000

This message was generated by the smartd daemon running on:

   host name:  chi-node-05
   DNS domain: torproject.org

The following warning/error was logged by the smartd daemon:

Device: /dev/bus/0 [megaraid_disk_00] [SAT], ATA error count increased from 0 to 2

Device info:
ST9500620NS, S/N:9XF0AC2F, WWN:5-000c50-0356dad0e, FW:AA02, 500 GB

For details see host's SYSLOG.

You can also use the smartctl utility for further investigation.
Another message will be sent in 24 hours if the problem persists.
Edited by anarcat
To upload designs, you'll need to enable LFS and have an admin enable hashed storage. More information