Kubernetes API server latency
warning

Description Kubernetes API server has a 99th percentile latency of {{ $value }} seconds for {{ $labels.verb }} {{ $labels.resource }}.
Query for 2m
>>>
	
				
					
					
						histogram_quantile
					
				
			
				
					(0.99, 
				
			
				
					
					
						sum
					
				
			
				
					(
				
			
				
					
					
						rate
					
				
			
				
					(
				
			
				
					
				
			
				
					{verb!~"(?:CONNECT|WATCHLIST|WATCH|PROXY)"} [10m])) WITHOUT (subresource)) > 1
				
			
    
Query Explanation

The query computes the 99th‑percentile latency (in seconds) of all apiserver requests, ignoring CONNECT, WATCHLIST, WATCH and PROXY verbs, by summing the per‑second rate of the histogram buckets over the last 10 minutes and applying histogram_quantile(0.99). The alert fires whenever this 99th‑percentile latency exceeds 1 second for a specific verb and resource.

Get Alert
Download
Copy to Clipboard