Coder Social home page Coder Social logo

icinga2-diagnostics's People

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

icinga2-diagnostics's Issues

Add VMware as Hypervisor

Could someone please run virt-what on a VMware Guest (Linux) and enter the output in this issue?

Check the schema version of the Director DB

This could be useful for determining:

  • Which Director version is installed (if we don't get the information on any other way)
  • If the Director could apply all needed schema migrations (check needed version of release against installed version)
  • Something else I can't think about right now.

Check for InfluxDB

Check if InfluxDB is installed and if so, get some basic information like version.

Check for proxies

Check environment variables, curlrc, etc. for configured proxies.

Take care to check them for root, icinga and every user that might be involved.

Only show if there is a proxy set (maybe the way it was set) not the proxy itself because the configuration might contain passwords.

"Suggestion mode"

Add an option for a mode which analyzes the setup and gives hints where to look for problems and how to fix them yourself. Maybe this could be the first step when having a problem. Rerunning the script in its current default mode to provide information when asking for help could become the second step when all suggestions did not help with debugging.

Thanks @dnsmichi for the idea.

Mask passwords in output

In standard output there are some passwords (e.g. Api Users). Mask them as a default.
Think about adding an option to have them in clear text though.

Check for asynchronous nodes in HA setups

In rare occasions the sync between nodes, esp. master nodes can break. If it breaks completely you see the nodes as disconnected but in even rarer occasions the sync continues but misses some objects.

e.g. you acknowledge a problem on one node but the other one misses the acknowledgement. This way you end up with an acknowledged problem but you might still get notifications for it.

We should find a way how to check if two nodes within a zone are synced completely.

Add "full" mode

Add an option to have the script output not on stdout but a tarball which includes the whole icinga 2 configuration, logs, etc.

Search for segfaults in logs

A uses setup showed Icinga 2 segfaulting. Search for entries like this in the syslog logs and report them as anomaly.

Get "Director" version

This is a thing we see very often in support.

Since there are no packages for director right now, there are different ways to get Director installed and sometimes it's hard for support engineers and even the user to determine the current version.

Possible ways of installation I can think of:

  • git clone
  • git clone with checkout of specific tag or release
  • Download of specific release
  • Download of master as zip file which leaves us with a some random commit

Show used commands

Work through the logs to see which plugins are called. Don't just list the loglines but only count the plugin with full path. Options to plugins often contain sensitive data so we don't want that to be part of the output.

Show info when no anoamlies were found

The empty headline "Anomalies found" can be misleading and users might think there were anomalies found while this is only a headline. When there is nothing below the headline there were no anomalies found.

Use the `icinga` Check

Verify that the icinga CheckCommand is used.

Even better to use it's return to check for all used Icinga 2 versiones in the setup. Some users can not update all nodes within a reasonable timeframe so checking if they are up-to-date might be important for debugging.

Check if load balancing is unbalanced.

Check if the checks are evently distributred between nodes within an zone.

When this new feature is implemented, we should collect some sample data from setups and then introduce thresholds for anomaly detection.

Ways to find if the nodes are unbalanced are discribed in https://www.icinga.com/docs/icinga2/latest/doc/15-troubleshooting/#late-check-results-in-distributed-environments and https://www.icinga.com/2016/08/11/analyse-icinga-2-problems-using-the-console-api/

Check for Graphite

Check if Graphite is installed and if so, get some basic information like version.

Walk through the list of packages but don't show every package in default mode.

This will significantly shorten the output but not the time it takes to generate the output. Maybe we should print the headline of packages and start searching

Use variables to list all packages not signed by the icinga key. Just add every matching package to a list (and maybe increase a counter). Show the list in the end where anomalies are listed.

Create Markdown output

Just an idea.

  • Should this be the default or and extra option?
  • Is this even something useful?

cat: /etc/redhat-release: No such file or directory

OS Version: cat: /etc/redhat-release: No such file or directory
Ubuntu
14.04
Hypervisor: Running on hardware or unknown hypervisor
CPU cores: 12
RAM: 20G
Firewall: active

Checked script on Ubuntu machine

Check for fpm

The script still finds the old PHP installation used before Icinga Web 2 2.5 or still there from updating. Best would be to check for both so side effects of leaving old PHP when upgrading to FPM can still be seen.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.