Quellcode durchsuchen

:memo: Replace a failed diske in a RAID config

DricomDragon vor 2 Jahren
Ursprung
Commit
1d2957883e
1 geänderte Dateien mit 46 neuen und 1 gelöschten Zeilen
  1. 46 1
      Guides/nextcloud.md

+ 46 - 1
Guides/nextcloud.md

@@ -160,4 +160,49 @@ Configure HTTPS on reverse-proxy.
 
 Apps are not working (example from my own migration : *Contacts* and *Tasks* not working because of javascript file not found.
 
-I had some errors after `rsync` about those files. That might be related.
+I had some errors after `rsync` about those files. That might be related.
+
+## Manage disks
+
+### Check health
+
+#### Check RAID
+
+```
+cat /proc/mdstat
+```
+
+#### Check individual HDD
+
+```
+smartctl --scan
+sudo smartctl -a /dev/sdc
+```
+
+If one the tested disk is failing, write down its serial number and continue with the replacement procedure.
+
+### Replace the disk
+
+0. RAID disconnection
+	01. `cat /proc/mdstat`
+	02. `mdadm --manage /dev/md0 --fail /dev/sdy1`
+	03. `cat /proc/mdstat`
+	04. `mdadm --manage /dev/md0 --remove /dev/sdy1`
+	05. `cat /proc/mdstat`
+1. Physical replacement
+	11. Shut down the server
+	12. Spot the HDD to remove based on its serial number
+	13. Replace it by the new one
+	14. Boot the server
+	15. Mark the retrieved HDD as dead
+2. RAID update
+	21. `cat /proc/mdstat`
+	22. Become superuser
+	23. `fdisk -l` -> spot the device with no partition `/dev/sdz`
+	24. Copy partition information
+		241. For MBR partitioning (legacy) : `sfdisk -d /dev/sdx | sfdisk /dev/sdz`
+		242. For GPT partitioning : `sgdisk -R /dev/sdz /dev/sdx` then `sgdisk -G /dev/sdz`
+	25. `fdisk -l` -> check that `/dev/sdz` is partitioned
+	26. `mdadm --manage /dev/md0 --add /dev/sdz1`
+	27. `cat /proc/mdstat`
+	28. Hope you first disk won't die during recovery