You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: pages/dedibox-hardware/troubleshooting/diagnose-defective-disk.mdx
+261-5Lines changed: 261 additions & 5 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -10,7 +10,7 @@ dates:
10
10
validation: 2025-02-06
11
11
posted: 2021-11-02
12
12
categories:
13
-
- dedibox-servers
13
+
- dedibox-hardware
14
14
---
15
15
16
16
`Smartmontools` is a set of tools that controls and monitors a disk using the **SMART** standard (Self-Monitoring, Analysis, and Reporting Technology System).
@@ -56,7 +56,7 @@ On these servers, the physical disks are referred to as `sg*` devices.
56
56
As the devices can be positioned a little further away, do not hesitate to test up to `sg5` if you do not have conclusive results.
57
57
</Message>
58
58
59
-
### Dell PERC H310 controller
59
+
### Dell PERC controller (H310, H700, H710, H730-P, LSI9361)
60
60
61
61
Two possibilities exist for this type of controller:
62
62
@@ -83,7 +83,7 @@ The first one displays the status of the RAID volume, whilst the second one disp
83
83
smartctl -s on -a -d megaraid,${i} ${DEVICE} -T permissive
84
84
done
85
85
```
86
-
## How to check an HP multi-disk server
86
+
## How to check an HP multi-disk server (P410, P420, P222)
87
87
88
88
1. Log into your server using SSH.
89
89
2. Run the following command to display the status of the RAID:
@@ -121,7 +121,7 @@ The first one displays the status of the RAID volume, whilst the second one disp
121
121
122
122
### How to configure SMARTD
123
123
124
-
Below, you find an example of a single-disk server installed on a Debian-like machine.
124
+
Below, you will find an example of a single-disk server installed on a Debian-like machine.
125
125
126
126
<Messagetype="note">
127
127
The following commands are to be executed as `root` or via `sudo`.
@@ -193,4 +193,260 @@ Local Time is: Fri Oct 29 11:20:27 2010 CEST
193
193
194
194
<Messagetype="tip">
195
195
For more information on Smartmontools, refer to the [official documentation](https://www.smartmontools.org/wiki/TocDoc).
196
-
</Message>
196
+
</Message>
197
+
198
+
<Tabsid="Smart data examples">
199
+
<TabsTablabel="HDD example">
200
+
The example below shows SMART data for the HDD storage type:
201
+
202
+
```
203
+
=== START OF INFORMATION SECTION ===
204
+
Model Family: Seagate Constellation ES.3
205
+
Device Model: ST1000NM0033-9ZM173
206
+
Serial Number: Z1W2P3WL
207
+
LU WWN Device Id: 5 000c50 0790721c5
208
+
Add. Product Id: DELL(tm)
209
+
Firmware Version: GA0A
210
+
User Capacity: 1 000 204 886 016 bytes [1,00 TB]
211
+
Sector Size: 512 bytes logical/physical
212
+
Rotation Rate: 7200 rpm
213
+
Form Factor: 3.5 inches
214
+
Device is: In smartctl database [for details use: -P show]
215
+
ATA Version is: ACS-2 (minor revision not indicated)
216
+
SATA Version is: SATA 3.0, 3.0 Gb/s (current: 3.0 Gb/s)
217
+
Local Time is: Wed Jan 22 11:26:49 2025 CET
218
+
SMART support is: Available - device has SMART capability.
219
+
SMART support is: Enabled
220
+
221
+
=== START OF READ SMART DATA SECTION ===
222
+
SMART overall-health self-assessment test result: PASSED
223
+
224
+
General SMART Values:
225
+
Offline data collection status: (0x82) Offline data collection activity
226
+
was completed without error.
227
+
Auto Offline Data Collection: Enabled.
228
+
Self-test execution status: ( 0) The previous self-test routine completed
If the `RAW_VALUE` column for `Reallocated_Sector_Ct` or ` Runtime_Bad_Block` or `Current_Pending_Sector` is > 5, the disk can already be considered as unhealthy. If it is > 20, the disk is out of order.
380
+
</TabsTab>
381
+
<TabsTablabel="NVMe example">
382
+
The example below shows SMART data for the NVMe storage type:
SMART overall-health self-assessment test result: PASSED
420
+
421
+
SMART/Health Information (NVMe Log 0x02)
422
+
Critical Warning: 0x00
423
+
Temperature: 42 Celsius
424
+
Available Spare: 100%
425
+
Available Spare Threshold: 10%
426
+
Percentage Used: 1%
427
+
Data Units Read: 5,718,407 [2.92 TB]
428
+
Data Units Written: 9,717,865 [4.97 TB]
429
+
Host Read Commands: 43,061,485
430
+
Host Write Commands: 142,156,172
431
+
Controller Busy Time: 5,906
432
+
Power Cycles: 1,315
433
+
Power On Hours: 2,261
434
+
Unsafe Shutdowns: 56
435
+
Media and Data Integrity Errors: 0
436
+
Error Information Log Entries: 0
437
+
Warning Comp. Temperature Time: 0
438
+
Critical Comp. Temperature Time: 0
439
+
Temperature Sensor 1: 44 Celsius
440
+
Temperature Sensor 2: 42 Celsius
441
+
442
+
Error Information (NVMe Log 0x01, 16 of 256 entries)
443
+
No Errors Logged
444
+
445
+
Read Self-test Log failed: Invalid Field in Command (0x002)
446
+
```
447
+
</TabsTab>
448
+
</Tabs>
449
+
450
+
<Messagetype="note">
451
+
If you encounter **Health status: Failed** or **Failing Now**, the disk is considered out of order. Make sure that you have backups, then open a [support ticket](/account/how-to/open-a-support-ticket/) and ask for the disk to be replaced, indicating the serial number with the result of the `smartctl` command.
0 commit comments