Prometheus 2.0 Overview
5,860,336 5.0 (2 reviews)

Created 11/9/2017
Updated 12/23/2020
Revision 2
Grafana Version >=4.5.0-beta1
Datasources
Prometheus

Description

This dashboard provides a technical snapshot of Prometheus-based performance and reliability metrics, focusing on resource usage, query health, and data ingestion. It highlights essential panels such as CPU and memory utilization to track node health, query latency and error rate to monitor Prometheus data paths, and data ingestion throughput to ensure scrapes and remote writes stay within capacity. Key metrics include instance:node_cpu_seconds_total:rate10m, node_memory_bytes{type="available"}, and prometheus_rule_evaluation_duration_seconds to surface latency and operational stability.

Screenshots

Source Grafana.com
Get Dashboard
Download
Copy to Clipboard