Cilium v1.9 Operator Metrics
416,553

Created 12/8/2020
Updated 12/8/2020
Revision 1
Grafana Version >=7.0.1
Datasources
Prometheus

Description

This dashboard monitors the operational health and resource usage of the Cilium v1.9 Operator, providing insights into CPU and memory footprint alongside networking and EC2 interactions. It highlights per-node CPU usage and resident memory, as well as operational metrics like cilium_operator_eni_nodes and cilium_operator_eni_available to track ENI allocation and readiness, while also surfacing EC2 API latency with cilium_operator_eni_aws_api_duration_seconds_sum/count and rate limiting through cilium_operator_ec2_rate_limit_duration_seconds_sum/count. Overall, it combines node-level metrics, ENI management status, and API interaction performance to diagnose capacity, resource pressure, and external API bottlenecks.

Source Grafana.com

Used Metrics 11

  • cilium_operator_ec2_rate_limit_duration_seconds_count

  • cilium_operator_ec2_rate_limit_duration_seconds_sum

  • cilium_operator_eni_available

  • cilium_operator_eni_aws_api_duration_seconds_count

  • cilium_operator_eni_aws_api_duration_seconds_sum

  • cilium_operator_eni_interface_creation_ops

  • cilium_operator_eni_ips

  • cilium_operator_eni_nodes

  • cilium_operator_eni_resync_total

  • cilium_operator_process_cpu_seconds_total

  • cilium_operator_process_resident_memory_bytes

Get Dashboard
Download
Copy to Clipboard