Kubernetes / Kubelet 4,702,8034,702,803
Description
This dashboard provides an at-a-glance view of a node’s kubelet health and performance, correlating system readiness with pod, container, and volume activity. It emphasizes operation latency and reliability, with key metrics such as up, kubelet_runtime_operations_duration_seconds_bucket, and kubelet_pod_start_duration_seconds_count to surface both overall availability and latency/throughput of core kubelet tasks. Other notable areas include pod and container counts, volume state, and configuration error monitoring, enabling rapid diagnosis of scheduling and runtime issues.
Used Metrics 2424
-
go_goroutines
kubelet_cgroup_manager_duration_seconds_bucket
kubelet_cgroup_manager_duration_seconds_count
kubelet_node_config_error
kubelet_pleg_relist_duration_seconds_bucket
kubelet_pleg_relist_duration_seconds_count
kubelet_pleg_relist_interval_seconds_bucket
kubelet_pod_start_duration_seconds_count
kubelet_pod_worker_duration_seconds_bucket
kubelet_pod_worker_duration_seconds_count
kubelet_running_container_count
kubelet_running_pod_count
kubelet_runtime_operations_duration_seconds_bucket
kubelet_runtime_operations_errors_total
kubelet_runtime_operations_total
-
process_cpu_seconds_total
-
process_resident_memory_bytes
rest_client_request_latency_seconds_bucket
rest_client_requests_total
storage_operation_duration_seconds_bucket
storage_operation_duration_seconds_count
storage_operation_errors_total
-
up
volume_manager_total_volumes