Redpanda Ops Dashboard
7,210

Created 2/18/2023
Updated 10/24/2024
Revision 3
Categories
Host Metrics
Grafana Version >=9.3.6
Datasources
Prometheus

A dashboard focused on Redpanda operations, with the following charts:

  • nodes up (count)
  • node uptime
  • topics (count)
  • partitions (count)
  • throughput (total)
  • leadership transfers per 5min
  • partition balance (percentage)
  • under-replicated partitions (count)
  • leaderless partitions (count)
  • storage used (percentage)
  • storage health
  • allocated memory (total, percentage)
  • CPU utilization (total, percentage)
  • Kafka RPC
  • cluster info
Get Dashboard
Download
Copy to Clipboard
Source Grafana.com

Used Metrics 24

  • redpanda_application_uptime_seconds_total

  • :

  • redpanda_cluster_topics

  • redpanda_kafka_request_bytes_total

  • redpanda_raft_leadership_changes

  • redpanda_kafka_under_replicated_replicas

  • min

  • redpanda_kafka_partitions

  • stddev

  • redpanda_kafka_max_offset

  • redpanda_cluster_unavailable_partitions

  • redpanda_storage_disk_free_bytes

  • redpanda_storage_disk_total_bytes

  • redpanda_cpu_busy_seconds_total

  • redpanda_rpc_active_connections

  • redpanda_application_build

  • redpanda_storage_disk_free_space_alert

  • redpanda_kafka_handler_latency_seconds_bucket

  • redpanda_io_queue_total_write_ops

  • redpanda_io_queue_total_read_ops

  • redpanda_memory_available_memory

  • redpanda_memory_available_memory_low_water_mark

  • redpanda_rpc_request_latency_seconds_bucket

  • topk