Skip to content

Kubernetes / Cluster Overview

There is no description on this dashboard

Tags

  • k8s
  • k8s-cluster

Panels

Overview

Name Description Thresholds Repeat
API Servers Shows the number of running API servers depending on the number of endpoints exposing metrics Default:
Mode: absolute
Level 1: 1
Level 2: 2

ETCD Servers Shows number of active ETCD servers. May contain no data for PaaS clouds. Default:
Mode: absolute
Level 1: 1
Level 2: 3

kubelet Nodes Shows number of active kubelet servers
Ingress Pods Shows count of running monitoring pods

Note: Metrics may not be collected from Openshift cluster
Containers Status Show the count of containers in Running, Waiting, and Terminated statuses

Note: Terminated pods not include "Completed" status
Nodes Status Shows count of running and not running nodes in the cluster. May contain no data for PaaS clouds.
Nodes Transition Status Shows transition status of each node in the cluster

1 - OK, 0 - Problem

! This panel doesn't work in restricted mode if kube-state-metrics doesn't have permission to scrape the following resources: nodes
Persistent Volumes Status Shows persistent volumes count grouped by statuses
CPU Usage Shows CPU usage and allocation for the whole cluster

! This panel doesn't work in restricted mode if kube-state-metrics doesn't have permission to scrape the following resources: nodes
Default:
Mode: absolute
Level 1: 0.75
Level 2: 0.9

Storage Shows usage disk space for all mounted volumes to nodes in the cluster Default:
Mode: absolute
Level 1: 0.6
Level 2: 0.8

Memory Usage Shows memory utilization and allocation for the whole cluster

! This panel doesn't work in restricted mode if kube-state-metrics doesn't have permission to scrape the following resources: nodes
Default:
Mode: absolute
Level 1: 0.7
Level 2: 0.9

Not Healthy Pods Show information about the reason the container is currently in waiting or terminated state Default:
Mode: absolute
Level 1: 80

Last Terminated Status Show information about the last reason the container was in terminated state Default:
Mode: absolute
Level 1: 80

Resources

Name Description Thresholds Repeat
Daemon Sets Shows the counts of running/all daemon sets in the cluster
Stateful Sets Shows the counts of running/all stateful sets in the cluster
Deployments Shows the counts of running/all deployments in the cluster
Jobs Shows the counts of running/all jobs in the cluster
Pods Shows the counts of running/all pods in the cluster
Replica Sets Shows the counts of running/all replica sets in the cluster
Replication Controllers Shows the counts of running/all replication controllers in the cluster
Cron Jobs Shows the counts of running/all cron jobs in the cluster

Ingress

Name Description Thresholds Repeat
Controller Request Volume Shows number of operation per second

Note: Metrics may not be collected from Openshift cluster
Ingress Request Volume Show the total number of client requests by ingress

Note: Metrics may not be collected from Openshift cluster
Ingress Controller Success Rate Shows the percentage of successful requests with status codes 2xx-3xx

Note: Metrics may not be collected from Openshift cluster
Default:
Mode: percentage
Level 1: 70
Level 2: 85
Level 3: 90

Certificates

Name Description Thresholds Repeat
Certificates from files Shows count of total, expiring in 30 days and expired certificates from files.
To get the information about certificates cert-exporter must be running.
Certificates from kubeconfig Shows count of total, expiring in 30 days and expired certificates from kubeconfig. To get the information about certificates cert-exporter must be running.
Certificates from secrets Shows count of total, expiring in 30 days and expired certificates from secrets. To get the information about certificates cert-exporter must be running.

CPU

Name Description Thresholds Repeat
CPU Usage Shows CPU usage by each namespace in the cluster
CPU Quota Shows CPU requests and limits quota allocation per namespace in the cluster

Memory

Name Description Thresholds Repeat
Memory Usage Shows memory usage by namespace in the cluster
Memory Quota Shows memory requests and limits quota allocation per namespace in the cluster

Disk

Name Description Thresholds Repeat
Disk I/O utilization per namspaces Shows data read/write per namespace
IOps per namespace Count of writes/reads completed
Disk I/O utilization per node Shows data read/write per node
IOps per node Count of writes/reads completed
Device storage usage Shows the amounts of total and used device storage in the cluster

Volumes

Name Description Thresholds Repeat
Count PVC Show the number of pvc in the cluster
Error: Persistent Volumes Not Bound Show the number of PV with status != "Bound" in the cluster
Error: Persistent Volumes Claim Not Bound Show the number of pvc with status != "Bound" in the cluster
Count PV Show the number of pv in the cluster

Network

Name Description Thresholds Repeat
Current Network Usage Shows network usage details per namespace in the cluster
Network RX/TX Total Shows overall network input and output traffic
Receive/Transmit Bandwidth Shows network traffic per namespace in the cluster
Average Container Bandwidth by Namespace: Received/Transmitted Shows network traffic per container in the cluster
Rate of Received/Transmitted Packets Shows average packets rate per namespace in the cluster
Rate of Received/Transmitted Packets Dropped Shows dropped packets per namespaces in the cluster