starsliao / prometheus Goto Github PK

Grafana Dashboards for Prometheus Exporter

Home Page: https://grafana.com/orgs/starsliao/dashboards

License: Apache License 2.0

Python 100.00%

prometheus's Introduction

💎 我的推荐项目

Project	Stars	Forks	Remark
后羿 - TenSunS			后羿 - TenSunS(原ConsulManager)是一个使用Flask+Vue开发，基于Consul的WEB运维平台，弥补了Consul官方UI对Services管理的不足；并且基于Consul的服务发现与键值存储：实现了Prometheus自动发现多云厂商各资源信息；基于Blackbox对站点监控的可视化维护；以及对自建与云上资源的优雅管理与展示。
Prometheus			Grafana Dashboards for Prometheus Exporter：http://grafana.com/orgs/starsliao/dashboards
PythonTools			Python写的小工具集合
StarsL.cn			StarsL.cn
multi_mysqld_exporter			Multi-target support

prometheus's People

Contributors

Stargazers

Watchers

Forkers

sunmaybo rickchen1979 yangmygod yangboyd uglyliu mofelee hwting zx0825 xiaoruiguo mooncats cheyunhua ordinaryfan michellebai jarvis2294 cmdy widerstehen manians61 ming-ddtechcg zoushiwen liliang8858 i-box dr1s forging2012 introvenk shangzhipei justinlee88 jameswsg yuhuayun qxaok snowind liqinsg tonyskapunk rubbish822 jianweixs 1806933 jeandanielrimaz skyducks zhixiangjoy humhunter hazdik ljb-2000 vivikcat mobidyc fucku mountainmist1 leoncai-freestyle leoncaiau912 juliansteam wopost-github nicospec eshun klippo zhanglei allanwong marionxue zjkguo nareshi2k2 sontost zangqianglei edocevol precipit bookgh lbd2013 haokinglong nameless1987 youzx8 zhizhenxin 958015805 qzw1210 nanzm cheng1022 kerven88 rsrrrrrr fatihvrl fengye0705 souktel-user thabetamer uppercaveman lengender erlotsman qiaobz99 xlogin michalazarovitz sonvt1710 gooshy 0521ak47 cnhuashao daliang1215 xieyonglu catslave fancy03 phonglh79 han110zheng tools-env fishzle laashub-soa 284923181 pyy55open rog-maximus boystray

prometheus's Issues

Templating init failed Cannot read property 'result' of undefined

我已阅读README.md
当前的问题：

系统环境

模块名称	版本号
grafana	6.1.3
prometheus	2.9.1.linux-amd64
node_exporter	0.17.0

系统配置

项目	配置
hosts	mycentos
ip	192.168.22.129
操作系统	centos 7

Prometheus 配置文件

global:
  scrape_interval:     15s # Set the scrape interval to every 15 seconds. Default is every 1 minute.
  evaluation_interval: 15s # Evaluate rules every 15 seconds. The default is every 1 minute.

scrape_configs:
  - job_name: 'prometheus'
    static_configs:
    - targets: ['192.168.22.129:9090']
      labels:
        instance: prometheus

  - job_name: "job1"
    static_configs:
    - targets: ['192.168.22.129:9100']
      labels:
        instance: instance1

DataSource配置

之前尝试过的步骤

安装饼状插图
手动修改主机名为mycentos,节点名为192.168.22.129:9100
修改grafana版本为5.2.4

上面的步骤并没有对结果产生影响。

在Prometheus中查询各node的instance格式和版本如下

必须：$node取值node_exporter的instance，IP+端口格式。该看板大部分查询关联了这个变量，请确保该变量有效：

这一段中的$node是手动配置还是自动配置的？形如"instance1,192.168.22.129:9100"这样设置吗？

我一周前开始接触Prometheus和Grafana，现在想跑一个示例来监控机器的基本运行情况，但是目前的情况看起来像是某些变量配置错误，或者Grafana没有正确读取对应的节点信息，所以导致图表数据没有正常返回。

仪表盘的变量如下

我是不是需要手动修改name和node的值？
我该怎么做才能让图表正常显示呢？

以CPU使用率为例，在Grafana中不能正确显示

但是在Prometheus中可以查询到正确结果

Dashboard combine 2 job to 1 job

First of all, Thanks for awesome dashboard.
It's very cool and look interesting.

But I have a little problem with it.

From my prometheus config file

And in dashboard on grafana

All of job name is working well.
But only csag-resource and csag-sv2 look like that combine metrics from 2 job to the same job on dashboard.

Result on job csag-resource is same as csag-sv2 on dashboard

Do you have any reason about that result ?

浏览器控制台显示以下错误:
react-dom.production.min.js:171 TypeError: Cannot read property 'exports' of undefined
at e.loadPlugin (DashboardGrid.tsx:166)
at e.componentDidMount (DashboardGrid.tsx:208)
at da (react-dom.production.min.js:213)
at ca (react-dom.production.min.js:205)
at la (react-dom.production.min.js:204)
at Fo (react-dom.production.min.js:200)
at Object.enqueueSetState (react-dom.production.min.js:130)
at r.y.setState (react.production.min.js:13)
at r.i.strategisedSetState (react-sizeme.js:308)
at react-sizeme.js:336
io @ react-dom.production.min.js:171

Grafana version: v5.4.0

wmi_exporter监控windows数据好像不正确,感觉数据源有问题

上面汇总的数据,内存使用率磁盘使用率带宽使用率,数值永远是一样的
而且下面的数据感觉也不对,刚添加的节点, 竟然有历史数据

应该是有很多panel没有添加数据源引起的, 检查每个panel的datasource后,数据显示正常了

blackbox_export配置

可否提供blackbox_export和prometheus的相关配置进行参考，谢谢

Why are you using fixed range selections instead of $__interval?

Just out of curiosity, since your dashboards are one of the more popular ones available in the Grafana Cloud: Is there any reason why you are using fixed range selections like rate(metric[5m]) over the dynamic variable like rate(metric[$__interval])?

Node Exporter for Prometheus Dashboard CN v20191102显示No Data

您好，打扰到您了，本人想请教一下关于显示No Data的问题。

我的docker-compose文件里面包含了使用的所有镜像版本
Prometheus.zip
如果可以的话，请您远程帮忙看一下，谢谢
我联系方式：qq 786744873 验证666666

分时磁盘使用率

分时磁盘使用率显示不正常，极少数情况下显示100%以内，绝大多数情况下显示超过1000%

Dashboard English Version

Thanks for sharing the dashboard, and it looks great. I was wondering if there is an english version of the dashboard?

内存CPU监控报警怎么做呢？

您好，感谢提供dashboard
请问系统一些基本监控邮件报警怎么做呢？您提供的指标都没有写一些报警情况

English translation

Hi,
Thank you very much for the wonderful Grafana dashboard, would it be possible to provide an English translation for the few values that are not in English?
Thanks.

温度指标没有

node_hwmon_temp_celsius{instance="$node"}

prometheus 没有这项是怎么回事？

node_hwmon_temp_celsius 我是0.17版本

【各分区可用空间】这一栏有问题

分区错误如下：

/etc/hostname
/etc/hosts
/etc/resolv.conf

你好，ext3的文件系统无法显示磁盘的使用情况

如图所示经过测试ext3格式的分区不能获取到数据

点击主机明细后，链接会重写，重写后参数丢失，图形无法显示

http://grafana.xx.com/d/9CWBz0bik/1-node-exporter-for-prometheus-dashboard-cn-v20200628?orgId=1&var-job=&var-hostname=&var-node=&var-device=

我修改了，从新窗口打开，参数正常，可以显示图形。本窗口打开，参数丢失。能看到过程，显示正常后，又自动刷没了。是不是和本窗口的自动刷新有冲突

ip和端口隐藏

你好，我想隐藏ip和端口不想在生产端暴露出来要怎样修改dashboard才可以呢。

中文兼容版json报错“Only queries that return single series/table is supported”

grafana版本：7.0.6
prometheus版本：2.19.2
触发条件：当工具栏-IP（自动关联主机名）选择All的时候，内存使用率，最大分区使用率和交换分区使用率就会报“Only queries that return single series/table is supported”的错误。选择任意一台机器之后问题消失。只要选择All或者主机只有一台的情况下就会出现这种情况

Consider moving to a non contaminating license (eg. Apache 2)

First of all, thank you so much for this repo and dashboard which is extremely well done and useful to us.

As part of our internal infrastructure monitoring, we would like to do an internal fork of your node_exporter dashboard. However, the GPL license is preventing us to import this code into our own internal repo because of the "disclose source" clause of the GPL (see https://tldrlegal.com/license/gnu-general-public-license-v3-(gpl-3)).

Is the choice of GPL on purpose or would you consider switching to a more permissive license? Maybe, it could even encourage us or other enterprise users to contribute PRs in the future.
Thank you ! 谢谢！

8989模板报错

Failed create dashboard model
undefined is not iterable (cannot read property Symbol(Symbol.iterator))

ENV not found

网络宽带不显示

网络进宽带不显示
网络出宽带不显示
默认表达式是：
irate(node_network_receive_bytes_total{instance=~~'$node',device=~~'$nic'}[5m])8
正确表达式是：（有待验证）
进宽带：
irate(node_network_receive_bytes_total{instance=~~'$node',device!~~'tap.|veth.|br.|docker.|virbr|lo*'}[5m])8
出宽带：
irate(node_network_transmit_bytes_total{instance=~~'$node',device!~~'tap.|veth.|br.|docker.|virbr|lo*'}[5m])*8
请核实表达式并更新！

看板无法展示centos6/ubuntu1404的内存使用率

仪表板内存使用率不显示

undefined is not iterable (cannot read property Symbol(Symbol.iterator))

错误：undefined is not iterable (cannot read property Symbol(Symbol.iterator))，我直接报这个错误。

There will be no alarm when the alarm threshold is reached, but there will be recovery alarm.

English translation?

Hello, nice grafana dashboards, do you have plans to translate to English?

Blackbox Exporter data source not Displaying on panel

Blackbox Exporter data source not Displaying on panel, and some panel seem different from the image on grafana dashboard and its not displaying like ssl certificate monitoring and etc, i need your help please thanks

https://grafana.com/grafana/dashboards/9965

节点没有自动更新

感谢提供的 Dashboard，实际是用的过程中发现当集群中有新的 Node 加入之后，节点的下拉框里面并不会同步更新。

我尝试把 node 变量改成了

label_values(node_exporter_build_info,instance)

也不起作用。
请问是我的配置问题吗？

今天docker安装操作了一下，不显示数据

1.求一个qq群，或者微信群。

Node Exporter for Prometheus Dashboard 刚安装使用，竟然不显示数据，深受打击。
但是docker日志都没有报错。
我是2020-08-05安装的docker环境（node_export+premetheus+grafana），docker拉取的都是最新版本。
不知道为什么不显示数据， job node hostname 这几个变量没有获取到数据。

2.官方公布的测试ok版本

Grafana v7.0.1 + node_exporter 0.18.1测试使用正常 , 请告知一下 prometheus 版本是多少，我按照官方重新安装。

3.配置部分


# 看板说明 , 以下这段话涉及到的变量    job  node  这几个在我导入模板后还需要设置什么东西吗？

导入看板后，请根据实际情况在看板右上角点击Dashboard settings--Variables设置好变量：
默认已经设置并关联好job，hostname，node这3个变量。

$node取值node_exporter的instance，IP:端口格式。大部分查询关联了这个变量，请确保该变量有效！
$maxmount用来查询当前主机的最大分区，默认只获取ext.*和xfs类型的分区。

4.我的premotheus配置，我不知道哪里导致的获取不到数据，求指导

# my global config
global:
  scrape_interval:     15s # Set the scrape interval to every 15 seconds. Default is every 1 minute.
  evaluation_interval: 15s # Evaluate rules every 15 seconds. The default is every 1 minute.
  # scrape_timeout is set to the global default (10s).

# Alertmanager configuration
alerting:
  alertmanagers:
    - static_configs:
        - targets:
          # - alertmanager:9093

# Load rules once and periodically evaluate them according to the global 'evaluation_interval'.

#A scrape configuration containing exactly one endpoint to scrape:
#Here it's Prometheus itself.
scrape_configs:
  #The job name is added as a label `job=<job_name>` to any timeseries scraped from this config.
  - job_name: 'prometheus'
    static_configs:
      - targets: ['192.168.6.12:9090']
        labels:
          instance: "prometheus"
  - job_name: "node"
    static_configs:
      - targets: ["192.168.6.12:9100"]
        labels:
          instance: "node"

5.2020-08-07 补充

我发现 premetheus 直接在物理机运行，grafana就可以获取服务器数据，但是通过docker方式启动的话，就获取不到数据。
我检查了prometheus.yml文件，对比了物理机的配置文件，格式和内容都是一样的，如上。
目前依然没有搞明白，为什么以docker运行 prometheus 就无法获取数据？
如果可能，请加一下我qq：1990850157，直接在我服务器远程排查一下，不胜感激！

建议加上实时数据

这里有最大最小平均建议在这里加上实时数据

windows资源监控，看板上CPU、网络等面板没有数据

ip

多node情况下显示不全

多个node情况下显示不全，是否需要独立修改参数？

switch the host name, the resources detail area doesn't change.

the edition is: Node Exporter for Prometheus Dashboard 20200530

node_exporter 0.18版本支持问题

node_exporter 0.18版本，没有一组ip和主机名对应的指标了，是不是需要作出调整 #

No swap causes NaN

When a node does not have swap defined, then it causes the graph to look like 100% and NaN is the value

Adding 1 byte to each will cause it to evaluate to 0%. If the node uses swap, then the 1-byte change should be non-impacting

(1 - ((node_memory_SwapFree_bytes{instance=~"$node"} + 1)/ (node_memory_SwapTotal_bytes{instance=~"$node"} + 1))) * 100

中文版最新dashboard资源总揽未能显示多个主机信息

我安装后，发现最新中文版的Job中没有All选项与https://grafana.com/grafana/dashboards/8919中截图效果有差异。
资源总揽中也只能显示一个被监控主机的信息。需要从Job选择切换。

有几个指标没有显示出来

关于磁盘的几个指标没有显示出来。不知道怎么设置

wmi_exporter 改名为 windows_exporter

如题。
导入下载的 node_exporter for windows 的 json 配置文件后，没有正确展示。所有面板都没有数据。
参见说明

这回是真的问题，哈哈，没图

就是个黑框，从新编

辑又是一个黑框

不显示硬件温度

Node_exporter 版本 0.18.1

原因是这个版本没有node_hwmon_temp_celsius 这个指标了

请问怎么解决呢

Here is one suggestion: The uptime panel is using sum. Results could be not helpful when viewing multiple nodes in the same time. max might be better.