Coder Social home page Coder Social logo

Comments (6)

jkwatson avatar jkwatson commented on June 19, 2024

Hi @skonto. This is definitely something that I am interested in, being at verta. Have you thought about what kind of metric aggregations would be relevant for model monitoring, and how they might map to the OpenTelemetry metric instruments? Would we need some custom aggregations? Could they be done in-process, or would some sort of collector be more appropriate? Love to hear your thoughts on this topic.

from oteps.

skonto avatar skonto commented on June 19, 2024

Hi @jkwatson here are some initial thoughts. We already have enough to start with existing instruments and aggregations.
For example we can split model monitoring into two basic areas like model performance and model operational performance (terms can change). In the first area we need to do the work of identifying what we want to measure and then try map to the standard or extend it. For example we might find useful to measure data drift and use Jense-Shannon divergence implemented as a histogram. In the second domain we might consider disko i/o, uptime, cpu/mem utilization, scoring latency etc.
Of course we need input about what is useful to standardize, what people use in practice etc.
We could start researching this by doing the exercise of putting together a number of metrics that provide a good overview of a model's performance. cc @dineshg13

from oteps.

jkwatson avatar jkwatson commented on June 19, 2024

I think the interesting new work would be in model performance, and not necessarily in the operational performance, which I think would probably fall understand standard observability-style metrics.

With regards to drift... I can imagine a reference histogram being recorded somewhere, and then some kind of time-windowed cumulative histogram being used to compute the divergence. I'm not sure OTel itself really has facilities for "time windowed" cumulative histograms right now... there might be an opportunity to make a proposal to fulfill that need.

from oteps.

skonto avatar skonto commented on June 19, 2024

I'm not sure OTel itself really has facilities for "time windowed" cumulative histograms right now... there might be an opportunity to make a proposal to fulfill that need.

That would be interesting work, want to start a draft doc?

from oteps.

jkwatson avatar jkwatson commented on June 19, 2024

I'm not sure OTel itself really has facilities for "time windowed" cumulative histograms right now... there might be an opportunity to make a proposal to fulfill that need.

That would be interesting work, want to start a draft doc?

I don't think I have time to work on that at the moment, but I'd be very happy to take a look at a proposal from someone who did have time.

from oteps.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.