KubeStateMetricsListErrors
critical

Description kube-state-metrics is experiencing errors at an elevated rate in list operations. This is likely causing it to not be able to expose metrics about Kubernetes objects correctly or at all.
Query for 15m
>>>
	
				
					(
				
			
				
					
					
						sum
					
				
			
				
					(
				
			
				
					
					
						rate
					
				
			
				
					(
				
			
				
					
				
			
				
					{job="kube-state-metrics",result="error"}[5m])) by (cluster)
  /

				
			
				
					
					
						sum
					
				
			
				
					(
				
			
				
					
					
						rate
					
				
			
				
					(
				
			
				
					
				
			
				
					{job="kube-state-metrics"}[5m])) by (cluster))
> 0.01

				
			
    
Query Explanation

The rule calculates the error rate of list operations performed by kube‑state‑metrics over the last 5 minutes (error count / total count) per cluster, and fires when this rate exceeds 1 %. If the proportion of list requests that returned an error is greater than 0.01 (1 %) for a cluster, the alert triggers.

Get Alert
Download
Copy to Clipboard