Kubernetes / Cluster Overview¶
There is no description on this dashboard
Tags¶
k8s
k8s-cluster
Panels¶
Overview¶
Name | Description | Thresholds | Repeat |
---|---|---|---|
API Servers | Shows the number of running API servers depending on the number of endpoints exposing metrics | Default: Mode: absolute Level 1: 1 Level 2: 2 |
|
ETCD Servers | Shows number of active ETCD servers. May contain no data for PaaS clouds. | Default: Mode: absolute Level 1: 1 Level 2: 3 |
|
kubelet Nodes | Shows number of active kubelet servers | ||
Ingress Pods | Shows count of running monitoring pods Note: Metrics may not be collected from Openshift cluster |
||
Containers Status | Show the count of containers in Running, Waiting, and Terminated statuses Note: Terminated pods not include "Completed" status |
||
Nodes Status | Shows count of running and not running nodes in the cluster. May contain no data for PaaS clouds. | ||
Nodes Transition Status | Shows transition status of each node in the cluster 1 - OK, 0 - Problem ! This panel doesn't work in restricted mode if kube-state-metrics doesn't have permission to scrape the following resources: nodes |
||
Persistent Volumes Status | Shows persistent volumes count grouped by statuses | ||
CPU Usage | Shows CPU usage and allocation for the whole cluster ! This panel doesn't work in restricted mode if kube-state-metrics doesn't have permission to scrape the following resources: nodes |
Default: Mode: absolute Level 1: 0.75 Level 2: 0.9 |
|
Storage | Shows usage disk space for all mounted volumes to nodes in the cluster | Default: Mode: absolute Level 1: 0.6 Level 2: 0.8 |
|
Memory Usage | Shows memory utilization and allocation for the whole cluster ! This panel doesn't work in restricted mode if kube-state-metrics doesn't have permission to scrape the following resources: nodes |
Default: Mode: absolute Level 1: 0.7 Level 2: 0.9 |
|
Not Healthy Pods | Show information about the reason the container is currently in waiting or terminated state | Default: Mode: absolute Level 1: 80 |
|
Last Terminated Status | Show information about the last reason the container was in terminated state | Default: Mode: absolute Level 1: 80 |
Resources¶
Name | Description | Thresholds | Repeat |
---|---|---|---|
Daemon Sets | Shows the counts of running/all daemon sets in the cluster | ||
Stateful Sets | Shows the counts of running/all stateful sets in the cluster | ||
Deployments | Shows the counts of running/all deployments in the cluster | ||
Jobs | Shows the counts of running/all jobs in the cluster | ||
Pods | Shows the counts of running/all pods in the cluster | ||
Replica Sets | Shows the counts of running/all replica sets in the cluster | ||
Replication Controllers | Shows the counts of running/all replication controllers in the cluster | ||
Cron Jobs | Shows the counts of running/all cron jobs in the cluster |
Ingress¶
Name | Description | Thresholds | Repeat |
---|---|---|---|
Controller Request Volume | Shows number of operation per second Note: Metrics may not be collected from Openshift cluster |
||
Ingress Request Volume | Show the total number of client requests by ingress Note: Metrics may not be collected from Openshift cluster |
||
Ingress Controller Success Rate | Shows the percentage of successful requests with status codes 2xx-3xx Note: Metrics may not be collected from Openshift cluster |
Default: Mode: percentage Level 1: 70 Level 2: 85 Level 3: 90 |
Certificates¶
Name | Description | Thresholds | Repeat |
---|---|---|---|
Certificates from files | Shows count of total, expiring in 30 days and expired certificates from files. To get the information about certificates cert-exporter must be running. |
||
Certificates from kubeconfig | Shows count of total, expiring in 30 days and expired certificates from kubeconfig. To get the information about certificates cert-exporter must be running. | ||
Certificates from secrets | Shows count of total, expiring in 30 days and expired certificates from secrets. To get the information about certificates cert-exporter must be running. |
CPU¶
Name | Description | Thresholds | Repeat |
---|---|---|---|
CPU Usage | Shows CPU usage by each namespace in the cluster | ||
CPU Quota | Shows CPU requests and limits quota allocation per namespace in the cluster |
Memory¶
Name | Description | Thresholds | Repeat |
---|---|---|---|
Memory Usage | Shows memory usage by namespace in the cluster | ||
Memory Quota | Shows memory requests and limits quota allocation per namespace in the cluster |
Disk¶
Name | Description | Thresholds | Repeat |
---|---|---|---|
Disk I/O utilization per namspaces | Shows data read/write per namespace | ||
IOps per namespace | Count of writes/reads completed | ||
Disk I/O utilization per node | Shows data read/write per node | ||
IOps per node | Count of writes/reads completed | ||
Device storage usage | Shows the amounts of total and used device storage in the cluster |
Volumes¶
Name | Description | Thresholds | Repeat |
---|---|---|---|
Count PVC | Show the number of pvc in the cluster | ||
Error: Persistent Volumes Not Bound | Show the number of PV with status != "Bound" in the cluster | ||
Error: Persistent Volumes Claim Not Bound | Show the number of pvc with status != "Bound" in the cluster | ||
Count PV | Show the number of pv in the cluster |
Network¶
Name | Description | Thresholds | Repeat |
---|---|---|---|
Current Network Usage | Shows network usage details per namespace in the cluster | ||
Network RX/TX Total | Shows overall network input and output traffic | ||
Receive/Transmit Bandwidth | Shows network traffic per namespace in the cluster | ||
Average Container Bandwidth by Namespace: Received/Transmitted | Shows network traffic per container in the cluster | ||
Rate of Received/Transmitted Packets | Shows average packets rate per namespace in the cluster | ||
Rate of Received/Transmitted Packets Dropped | Shows dropped packets per namespaces in the cluster |