SMART device temperature nearing trip value
warning

Description Device temperature at 80% of trip value on {{ $labels.instance }} drive {{ $labels.device }})
Query
>>>
	
				
					
					
						max_over_time
					
				
			
				
					(
				
			
				
					
				
			
				
					{temperature_type="current"} [10m]) >= on(device, instance) (
				
			
				
					
				
			
				
					{temperature_type="drive_trip"} * .80)
				
			
    
Query Explanation

The rule looks at the highest observed current temperature of each SMART device over the past 10 minutes (max_over_time(...[10m])). If that maximum is ≥ 80 % of the device’s configured trip temperature (smartctl_device_temperature{temperature_type="drive_trip"} * 0.80), the alert fires, indicating the device on the given instance and device label is approaching its critical temperature threshold.

Get Alert
Download
Copy to Clipboard