Kubernetes Job failed warning
Job {{ $labels.namespace }}/{{ $labels.job_name }} failed to complete
>>>
> 0
The query kube_job_status_failed > 0 monitors the kube_job_status_failed metric, which records the number of failed pods for each Kubernetes Job. The alert triggers when this value is greater than zero for any job (i.e., when a job has at least one failure), causing the alert to fire for the corresponding namespace and job name.
Get Alert✕
Download
Copy to Clipboard