Etcd high number of failed GRPC requests
warning

Description More than 1% GRPC request failure detected in Etcd
Query for 2m
>>>
	
				
					
					
						sum
					
				
			
				
					(
				
			
				
					
					
						rate
					
				
			
				
					(
				
			
				
					
				
			
				
					{grpc_code!="OK"}[1m])) BY (grpc_service, grpc_method) / 
				
			
				
					
					
						sum
					
				
			
				
					(
				
			
				
					
					
						rate
					
				
			
				
					(
				
			
				
					
				
			
				
					[1m])) BY (grpc_service, grpc_method) > 0.01
				
			
    
Query Explanation

The query calculates, for each grpc_service/grpc_method, the 1‑minute rate of failed GRPC calls (grpc_code!="OK") divided by the total GRPC call rate, and triggers when this failure ratio exceeds 0.01 (1 %). In other words, if any service‑method pair has more than 1 % failed requests over the last minute, the alert fires.

Get Alert
Download
Copy to Clipboard