组件监控配置说明
一:Record说明
在这个文件中,我们定义了一个名为node-exporter-rules的Prometheus Rule。它包含一个名为node-exporter.rules的规则组。该规则组定义了一个名为node_filesystem_avail_ratio的指标,并将其记录在Prometheus中
kind: PrometheusRulemetadata: name: node-exporter-rulesspec: groups: - name: node-exporter.rules rules: - expr: node_filesystem_avail_bytes / node_filesystem_size_bytes * 100 < 10 record: instance:node_filesystem_avail_ratio:ratio二:Kubernetes及相关组件告警规则
Kubernetes及相关组件告警规则,可参考:
https://runbooks.prometheus-operator.dev/
1.prometheus告警自身
- alert: PrometheusJobMissing expr: absent(up{job="prometheus"}) for: 0m labels: severity: warning annotations: summary: Prometheus job missing (instance {{ $labels.instance }}) description: "A Prometheus job has disappeared\n VALUE = {{ $value }}\n LABELS = {{ $labels }}"