Service with alarms¶

Service with alarms
ServiceMonitor
PrometheusRule
Files
How to apply example
Links

This example show how to setup Monitoring for collect metrics from service and add alerting rule to Prometheus.

ServiceMonitor¶

spec:
  endpoints:
  - interval: 30s
    port: http
  jobLabel: k8s-app
  selector:
    matchLabels:
      k8s-app: sample-service

It means that Prometheus will collect metrics from service with settings:

metrics will collect with job with label k8s-app
metrics will collect from all pods with label k8s-app: sample-service
metrics will collect from all discovered pod from port with name http with interval 30s

PrometheusRule¶

spec:
  groups:
  - name: general.rules
    rules:
    - alert: TargetDown-service-prom
      annotations:
        description: '{{ $value }}% of {{ $labels.job }} targets are down.'
        summary: Targets are down
      expr: 100 * (count(up == 0) BY (job) / count(up) BY (job)) > 10
      for: 10m
      labels:
        severity: warning
    - alert: DeadMansSwitch-service-prom
      annotations:
        description: This is a DeadMansSwitch meant to ensure that the entire Alerting pipeline is functional.
        summary: Alerting DeadMansSwitch
      expr: vector(1)
      labels:
        severity: none

It means that AlertManager will collect data from Prometheus and will check specified expressions. In this example:

Alert: Targets are down
Description contains templates, AlertManager Templating
Evaluate every 10 minutes (for: 10m)
Evaluate expression and check that result is true. If result is true alarm will be raise
If expression result will true alarm will raise with severity warning (labels.severity: warning)
Alert: Alerting DeadMansSwitch
Simple alarm to verify that AlertManager is working

Files¶

How to apply example¶

Kubernetes:

kubectl apply -f service-monitor.yaml
kubectl apply -f prometheus-rule.yaml

OpenShift:

oc apply -f service-monitor.yaml
oc apply -f prometheus-rule.yaml

Links¶

Prometheus operator
API Documentation
Alerting Rules
Victoriametrics operator
API Documentation