组件监控配置说明

韵味老鸟 2024-06-26 17:58:36

组件监控配置说明

一:Record说明

在这个文件中,我们定义了一个名为node-exporter-rules的Prometheus Rule。它包含一个名为node-exporter.rules的规则组。该规则组定义了一个名为node_filesystem_avail_ratio的指标,并将其记录在Prometheus中

kind: PrometheusRulemetadata: name: node-exporter-rulesspec: groups: - name: node-exporter.rules rules: - expr: node_filesystem_avail_bytes / node_filesystem_size_bytes * 100 < 10 record: instance:node_filesystem_avail_ratio:ratio

二:Kubernetes及相关组件告警规则

Kubernetes及相关组件告警规则,可参考:

https://runbooks.prometheus-operator.dev/

1.prometheus告警自身

- alert: PrometheusJobMissing expr: absent(up{job="prometheus"}) for: 0m labels: severity: warning annotations: summary: Prometheus job missing (instance {{ $labels.instance }}) description: "A Prometheus job has disappeared\n VALUE = {{ $value }}\n LABELS = {{ $labels }}"

0 阅读:7