Kubernetes API server latency
warning
Get Alert

Description Kubernetes API server has a 99th percentile latency of {{ $value }} seconds for {{ $labels.verb }} {{ $labels.resource }}.

Query for 2m

>>>
	
				
					
					
						histogram_quantile
					
				
			
				
					(0.99, 
				
			
				
					
					
						sum
					
				
			
				
					(
				
			
				
					
					
						rate
					
				
			
				
					(
				
			
				
					
				
			
				
					{verb!~"(?:CONNECT|WATCHLIST|WATCH|PROXY)"} [10m])) WITHOUT (subresource)) > 1

Query Explanation

The query computes the 99th‑percentile latency (in seconds) of all apiserver requests, ignoring CONNECT, WATCHLIST, WATCH and PROXY verbs, by summing the per‑second rate of the histogram buckets over the last 10 minutes and applying histogram_quantile(0.99). The alert fires whenever this 99th‑percentile latency exceeds 1 second for a specific verb and resource.

Source Awesome Prometheus Alerts

Get Alert✕

Download

Copy to Clipboard

Kubernetes API server latency warning Get Alert Download Copy to Clipboard

Kubernetes API server latency
warning
Get Alert