SMART device temperature critical critical
Device temperature critical on {{ $labels.instance }} drive {{ $labels.device }} over 70°C
>>>
(
max_over_time
(
{temperature_type="current"} [5m]) unless on (instance, device)
{temperature_type="drive_trip"}) > 70
The rule looks at each device’s smartctl_device_temperature{temperature_type="current"} metric, takes the highest value observed in the last 5 minutes, and ignores the device if a smartctl_device_temperature{temperature_type="drive_trip"} metric exists for the same instance and device. If the resulting max current temperature exceeds 70 °C, the alert fires for that instance/device.
Get Alert✕
Download
Copy to Clipboard