SMART device temperature critical
critical

Description Device temperature critical on {{ $labels.instance }} drive {{ $labels.device }} over 70°C
Query
>>>
	
				
					(
				
			
				
					
					
						max_over_time
					
				
			
				
					(
				
			
				
					
				
			
				
					{temperature_type="current"} [5m]) unless on (instance, device) 
				
			
				
					
				
			
				
					{temperature_type="drive_trip"}) > 70
				
			
    
Query Explanation

The rule looks at each device’s smartctl_device_temperature{temperature_type="current"} metric, takes the highest value observed in the last 5 minutes, and ignores the device if a smartctl_device_temperature{temperature_type="drive_trip"} metric exists for the same instance and device. If the resulting max current temperature exceeds 70 °C, the alert fires for that instance/device.

Get Alert
Download
Copy to Clipboard