Coder Social home page Coder Social logo

prometheus's Introduction

Star History Chart

💎 我的推荐项目

Project Stars Forks Remark
后羿 - TenSunS 后羿 - TenSunS(原ConsulManager)是一个使用Flask+Vue开发,基于Consul的WEB运维平台,弥补了Consul官方UI对Services管理的不足;并且基于Consul的服务发现与键值存储:实现了Prometheus自动发现多云厂商各资源信息;基于Blackbox对站点监控的可视化维护;以及对自建与云上资源的优雅管理与展示。
Prometheus Grafana Dashboards for Prometheus Exporter:http://grafana.com/orgs/starsliao/dashboards
PythonTools Python写的小工具集合
StarsL.cn StarsL.cn
multi_mysqld_exporter Multi-target support

prometheus's People

Contributors

boystray avatar eravion avatar ilanni2460 avatar micah avatar starsliao avatar tuanna08 avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

prometheus's Issues

Templating init failed Cannot read property 'result' of undefined

我已阅读README.md
当前的问题:

7

系统环境

模块名称 版本号
grafana 6.1.3
prometheus 2.9.1.linux-amd64
node_exporter 0.17.0

系统配置

项目 配置
hosts mycentos
ip 192.168.22.129
操作系统 centos 7

41

Prometheus 配置文件

global:
  scrape_interval:     15s # Set the scrape interval to every 15 seconds. Default is every 1 minute.
  evaluation_interval: 15s # Evaluate rules every 15 seconds. The default is every 1 minute.

scrape_configs:
  - job_name: 'prometheus'
    static_configs:
    - targets: ['192.168.22.129:9090']
      labels:
        instance: prometheus

  - job_name: "job1"
    static_configs:
    - targets: ['192.168.22.129:9100']
      labels:
        instance: instance1

DataSource配置

903

之前尝试过的步骤

  • 安装饼状插图
  • 手动修改主机名为mycentos,节点名为192.168.22.129:9100
  • 修改grafana版本为5.2.4

上面的步骤并没有对结果产生影响。

在Prometheus中查询各node的instance格式和版本如下
2831

必须:$node取值node_exporter的instance,IP+端口格式。该看板大部分查询关联了这个变量,请确保该变量有效:

这一段中的$node是手动配置还是自动配置的?形如"instance1,192.168.22.129:9100"这样设置吗?

我一周前开始接触Prometheus和Grafana,现在想跑一个示例来监控机器的基本运行情况,但是目前的情况看起来像是某些变量配置错误,或者Grafana没有正确读取对应的节点信息,所以导致图表数据没有正常返回。

仪表盘的变量如下
14

我是不是需要手动修改namenode的值?
我该怎么做才能让图表正常显示呢?

以CPU使用率为例,在Grafana中不能正确显示
4840

但是在Prometheus中可以查询到正确结果
821

Dashboard combine 2 job to 1 job

First of all, Thanks for awesome dashboard.
It's very cool and look interesting.

But I have a little problem with it.

From my prometheus config file
image

And in dashboard on grafana
image

All of job name is working well.
But only csag-resource and csag-sv2 look like that combine metrics from 2 job to the same job on dashboard.

Result on job csag-resource is same as csag-sv2 on dashboard

Do you have any reason about that result ?

安装新版本后, 不显示任何指标

浏览器控制台显示以下错误:
react-dom.production.min.js:171 TypeError: Cannot read property 'exports' of undefined
at e.loadPlugin (DashboardGrid.tsx:166)
at e.componentDidMount (DashboardGrid.tsx:208)
at da (react-dom.production.min.js:213)
at ca (react-dom.production.min.js:205)
at la (react-dom.production.min.js:204)
at Fo (react-dom.production.min.js:200)
at Object.enqueueSetState (react-dom.production.min.js:130)
at r.y.setState (react.production.min.js:13)
at r.i.strategisedSetState (react-sizeme.js:308)
at react-sizeme.js:336
io @ react-dom.production.min.js:171

Grafana version: v5.4.0

wmi_exporter监控windows数据好像不正确,感觉数据源有问题

上面汇总的数据,内存使用率 磁盘使用率 带宽使用率,数值永远是一样的
而且下面的数据感觉也不对,刚添加的节点, 竟然有历史数据

应该是有很多panel没有添加数据源引起的, 检查每个panel的datasource后,数据显示正常了

windows

分时磁盘使用率

分时磁盘使用率显示不正常,极少数情况下显示100%以内,绝大多数情况下显示超过1000%

Dashboard English Version

Thanks for sharing the dashboard, and it looks great. I was wondering if there is an english version of the dashboard?

English translation

Hi,
Thank you very much for the wonderful Grafana dashboard, would it be possible to provide an English translation for the few values that are not in English?
Thanks.

温度指标没有

node_hwmon_temp_celsius{instance="$node"}

prometheus 没有这项 是怎么回事?

node_hwmon_temp_celsius 我是0.17版本

ip和端口隐藏

你好,我想隐藏ip和端口不想在生产端暴露出来要怎样修改dashboard才可以呢。

中文兼容版json报错“Only queries that return single series/table is supported”

grafana版本:7.0.6
prometheus版本:2.19.2
触发条件:当工具栏-IP(自动关联主机名)选择All的时候,内存使用率,最大分区使用率和交换分区使用率就会报“Only queries that return single series/table is supported”的错误。选择任意一台机器之后问题消失。只要选择All或者主机只有一台的情况下就会出现这种情况

image.png

image.png

Consider moving to a non contaminating license (eg. Apache 2)

First of all, thank you so much for this repo and dashboard which is extremely well done and useful to us.

As part of our internal infrastructure monitoring, we would like to do an internal fork of your node_exporter dashboard. However, the GPL license is preventing us to import this code into our own internal repo because of the "disclose source" clause of the GPL (see https://tldrlegal.com/license/gnu-general-public-license-v3-(gpl-3)).

Is the choice of GPL on purpose or would you consider switching to a more permissive license? Maybe, it could even encourage us or other enterprise users to contribute PRs in the future.
Thank you ! 谢谢 !

8989模板报错

Failed create dashboard model
undefined is not iterable (cannot read property Symbol(Symbol.iterator))

网络宽带不显示

网络进宽带不显示
网络出宽带不显示
默认表达式是:
irate(node_network_receive_bytes_total{instance='$node',device='$nic'}[5m])8
正确表达式是:(有待验证)
进宽带:
irate(node_network_receive_bytes_total{instance='$node',device!'tap.
|veth.|br.|docker.|virbr|lo*'}[5m])8
出宽带:
irate(node_network_transmit_bytes_total{instance='$node',device!'tap.
|veth.|br.|docker.|virbr|lo*'}[5m])*8
请核实表达式并更新!

节点没有自动更新

感谢提供的 Dashboard,实际是用的过程中发现当集群中有新的 Node 加入之后,节点的下拉框里面并不会同步更新。

我尝试把 node 变量改成了

label_values(node_exporter_build_info,instance)

也不起作用。
请问是我的配置问题吗?

今天docker安装操作了一下,不显示数据

1.求一个qq群,或者微信群。

Node Exporter for Prometheus Dashboard 刚安装使用,竟然不显示数据,深受打击。
但是docker日志都没有报错。
我是2020-08-05安装的docker环境(node_export+premetheus+grafana),docker拉取的都是 最新版本。
不知道为什么不显示数据, job node hostname 这几个变量没有获取到数据。

2.官方公布的测试ok版本

Grafana v7.0.1 + node_exporter 0.18.1测试使用正常 , 请告知一下 prometheus 版本是多少,我按照官方重新安装。

3.配置部分


# 看板说明 , 以下这段话涉及到的变量    job  node  这几个在我导入模板后还需要设置什么东西吗?

导入看板后,请根据实际情况在看板右上角点击Dashboard settings--Variables设置好变量:
默认已经设置并关联好job,hostname,node这3个变量。

$node取值node_exporter的instance,IP:端口格式。大部分查询关联了这个变量,请确保该变量有效!
$maxmount用来查询当前主机的最大分区,默认只获取ext.*和xfs类型的分区。

4.我的premotheus配置 ,我不知道哪里导致的获取不到数据,求指导

# my global config
global:
  scrape_interval:     15s # Set the scrape interval to every 15 seconds. Default is every 1 minute.
  evaluation_interval: 15s # Evaluate rules every 15 seconds. The default is every 1 minute.
  # scrape_timeout is set to the global default (10s).

# Alertmanager configuration
alerting:
  alertmanagers:
    - static_configs:
        - targets:
          # - alertmanager:9093

# Load rules once and periodically evaluate them according to the global 'evaluation_interval'.

#A scrape configuration containing exactly one endpoint to scrape:
#Here it's Prometheus itself.
scrape_configs:
  #The job name is added as a label `job=<job_name>` to any timeseries scraped from this config.
  - job_name: 'prometheus'
    static_configs:
      - targets: ['192.168.6.12:9090']
        labels:
          instance: "prometheus"
  - job_name: "node"
    static_configs:
      - targets: ["192.168.6.12:9100"]
        labels:
          instance: "node"

5.2020-08-07 补充

我发现 premetheus 直接在物理机运行,grafana就可以获取服务器数据,但是通过docker方式启动的话,就获取不到数据。
我检查了prometheus.yml文件,对比了物理机的配置文件,格式和内容都是一样的,如上。
目前依然没有搞明白,为什么以docker运行 prometheus 就无法获取数据?
如果可能,请加一下我qq:1990850157, 直接在我服务器远程排查一下,不胜感激!

No swap causes NaN

When a node does not have swap defined, then it causes the graph to look like 100% and NaN is the value
Screen Shot 2019-11-26 at 10 53 12 AM

Adding 1 byte to each will cause it to evaluate to 0%. If the node uses swap, then the 1-byte change should be non-impacting

(1 - ((node_memory_SwapFree_bytes{instance=~"$node"} + 1)/ (node_memory_SwapTotal_bytes{instance=~"$node"} + 1))) * 100

不显示硬件温度

Node_exporter 版本 0.18.1

原因是 这个版本 没有node_hwmon_temp_celsius 这个指标了

请问怎么解决呢

Job Title clean up

I removed a few node explorer but in the dashboard it still list those nodes how can i remove them from the job list?

blackbox_exporter图表不一致

你好版主,我导入了你的blackbox模板后,发现状态明细展示跟你的不一样,没有domain,连通性,ssl显示也不一样,是模板问题吗?

绑定IP看板

Hi,我想要创建多个看板,每个看板绑定一个固定IP,应该如何修改JSON文件?

图表能显示,但是没有数据

node exporter 0.16, prometheus 2.6.1, Grafana cli version 5.4.3/5.4.2
已经安装了grafana显示饼图的插件,但是数据没有显示。
修改参数成为 label_values(node_exporter_build_info,instance)

default

Uptime issue

Hi there,

Nice dashboard! Thanks.

Here is one suggestion: The uptime panel is using sum. Results could be not helpful when viewing multiple nodes in the same time. max might be better.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.