Coder Social home page Coder Social logo

Comments (7)

dragoangel avatar dragoangel commented on August 17, 2024

Also another option is to add label with reason of failures, instead creating other metrics :)

It would be cool if all data that now written to logs about check failures will be available via metric label so user can create alert description that will indicate exact reason of failure. aka:

read tcp x:x->x:x: i/o timeout
dial tcp x:x: operation was canceled
regex: ^x didn't match: ...
dial tcp x:25: connect: connection refused
...
etc

from ssl_exporter.

ribbybibby avatar ribbybibby commented on August 17, 2024

The up metric records the success/failure of requests from Prometheus -> the exporter. I don't think it would be right for us to return a non-2xx response if the exporter is fine but the issue is with the upstream.

You make a good point about non-TLS related failures though. Perhaps we should ignore errors from the upstream as long as we can successfully establish a TLS connection and extract certificates?

from ssl_exporter.

dragoangel avatar dragoangel commented on August 17, 2024

My point that I want to see that host is down or timeout separetly from ssl. This could be explicitly each separate metric but it requires a lot of alerts, or reason could be recorded as a label - then only one alert can cover all issues and throw exact reason of failure

from ssl_exporter.

ribbybibby avatar ribbybibby commented on August 17, 2024

What are some examples of SSL related errors? TLS verification failing? Is there anything else that can happen?

I suppose bugs in our regex matching for starttls? Or servers that do things in a way we haven't accounted for?

from ssl_exporter.

ribbybibby avatar ribbybibby commented on August 17, 2024

Putting raw error log strings into metrics strikes me as the wrong approach. Metrics are not designed for that kind of information.

We could have some coarser labels like 'starttls' I guess? What would you actually use this delineation for though? How would you treat a host that is timing out vs a host that is failing the starttls handshake differently?

from ssl_exporter.

dragoangel avatar dragoangel commented on August 17, 2024

Yes I would definetly treat host with failed ssl definitely compared of host that down, because it could be server totally off. Alerting that provide exact reason what is going on always better that alert that could be due to different reasons because you need to check all of them. And in historical view - you would know what it was, without going and reading logs of remote ssl exporter that setuped somewhere far away :)

from ssl_exporter.

dragoangel avatar dragoangel commented on August 17, 2024

What are some examples of SSL related errors? TLS verification failing? Is there anything else that can happen?

I suppose bugs in our regex matching for starttls? Or servers that do things in a way we haven't accounted for?

For example regex can not match in smtp when server is totally overloaded and do not return any data, just open connection, saw it couple of times

from ssl_exporter.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.