Deploy a custom Prometheus stack, in which you deploy the kube-prometheus-stack from the upstream Helm chart.
Observe application metrics, in which you redirect the cpu_temperature_celsius
metric to the localhost and the Prometheus UI. You later observe how the metric value changes in the predefined 10 seconds interval in which Prometheus scrapes the metric values from the service's /metrics
endpoint.
Create a Grafana dashboard, in which you create a Grafana dashboard of a Gauge type for the cpu_temperature_celsius
metric. This dashboard shows explicitly when the CPU temperature is equal to or higher than the predefined threshold of 75 degrees Celsius, at which point the dashboard turns red.
Define alerting rules, in which you define the CPUTempHigh
alerting rule by creating a PrometheusRule resource. Prometheus accesses the /metrics
endpoint every 10 seconds and validates the current value of the cpu_temperature_celsius
metric. If the value is equal to or higher than 75 degrees Celsius, Prometheus waits for 10 seconds to recheck it. If the value still exceeds the threshold, Prometheus triggers the rule. You can observe both the rule and the alert it generates on the Prometheus dashboard.