Prometheus监控环境错误案例
<h3>【1:已解决】redis_exporter监控问题</h3>
<p><img src="https://www.showdoc.com.cn/server/api/attachment/visitFile?sign=c4bee6c4f03888eac4b506a39b535582&file=file.png" alt="" />
<strong>分析:</strong>
查看Prometheus的github文档后(<a href="https://github.com/prometheus/prometheus/issues/2459">https://github.com/prometheus/prometheus/issues/2459</a>) 发现scrape_interval的周期收集时间设置太短,导致Prometheus收集时间太短导致超时,周期性从监控目标上收集数据更改为30s,scrape_timeout超时时间也设置为30s</p>
<p><img src="https://www.showdoc.com.cn/server/api/attachment/visitFile?sign=b9072691191372c53462e7a75cd55bf5&file=file.png" alt="" /></p>
<h3>【2:已解决】redis_exporter与Prometheus集成后,grafana识别不了 data source,重启也没用</h3>
<p><img src="https://www.showdoc.com.cn/server/api/attachment/visitFile?sign=28e7c808ba48845fde237abab57afe4c&file=file.png" alt="" /></p>
<p>解决方法:更换监控模板即可</p>
<h3>【3:未解决】Prometheus告警模板无法识别,始终是默认模板进行告警</h3>
<p><strong>默认模板:</strong>
<img src="https://www.showdoc.com.cn/server/api/attachment/visitFile?sign=418d950e5b3f090b17c6d3ec25b46fcd&file=file.png" alt="" /></p>
<p><strong>自定义模板:</strong></p>
<p><img src="https://www.showdoc.com.cn/server/api/attachment/visitFile?sign=b4fd453a0fcd033d46b7106563ba70db&file=file.png" alt="" /></p>
<h3>【4:已解决】prometheus指定rule_files的路径后,prometheus无法启动</h3>
<p>systemctl status prometheus 查看状态后发现文件有问题,找到对应文件发现yml文件格式错误,修改后重启问题解决</p>
<h3>【5:已解决】启动rabbitmq时出现Error:</h3>
<p>{:enabled_plugins_mismatch, '/usr/local/rabbitmq/rabbitmq_server-3.7.8/etc/rabbitmq/enabled_plugins', '/usr/local/rabbitmq/etc/rabbitmq/enabled_plugins'}</p>
<p>解决方法:kill掉rabbitmq进程,重新启动</p>
<h3>【6:已解决】redis_exporter无法连接redis实例:</h3>
<p><img src="https://www.showdoc.com.cn/server/api/attachment/visitFile?sign=b6a8e2765a055154a3af5e6115c452e1&file=file.png" alt="" /></p>
<p>解决方法:进入redis_exporter.service服务文件,在ExecStart行中检查-redis.addr和-redis.password是否正确</p>
<h3>【7:已解决】在监控mysqld_exporter时,Prometheus连接超时:</h3>
<p><img src="https://www.showdoc.com.cn/server/api/attachment/visitFile?sign=734428b22a47de9aae7cb48aa83a267e&file=file.png" alt="" /></p>
<p>解决方法:因为端口被防火墙限制了,需要同时放行端口客户端服务端口、服务端也要放行</p>