Comments (6)
Hi @skonto. This is definitely something that I am interested in, being at verta. Have you thought about what kind of metric aggregations would be relevant for model monitoring, and how they might map to the OpenTelemetry metric instruments? Would we need some custom aggregations? Could they be done in-process, or would some sort of collector be more appropriate? Love to hear your thoughts on this topic.
from oteps.
Hi @jkwatson here are some initial thoughts. We already have enough to start with existing instruments and aggregations.
For example we can split model monitoring into two basic areas like model performance and model operational performance (terms can change). In the first area we need to do the work of identifying what we want to measure and then try map to the standard or extend it. For example we might find useful to measure data drift and use Jense-Shannon divergence implemented as a histogram. In the second domain we might consider disko i/o, uptime, cpu/mem utilization, scoring latency etc.
Of course we need input about what is useful to standardize, what people use in practice etc.
We could start researching this by doing the exercise of putting together a number of metrics that provide a good overview of a model's performance. cc @dineshg13
from oteps.
I think the interesting new work would be in model performance, and not necessarily in the operational performance, which I think would probably fall understand standard observability-style metrics.
With regards to drift... I can imagine a reference histogram being recorded somewhere, and then some kind of time-windowed cumulative histogram being used to compute the divergence. I'm not sure OTel itself really has facilities for "time windowed" cumulative histograms right now... there might be an opportunity to make a proposal to fulfill that need.
from oteps.
I'm not sure OTel itself really has facilities for "time windowed" cumulative histograms right now... there might be an opportunity to make a proposal to fulfill that need.
That would be interesting work, want to start a draft doc?
from oteps.
I'm not sure OTel itself really has facilities for "time windowed" cumulative histograms right now... there might be an opportunity to make a proposal to fulfill that need.
That would be interesting work, want to start a draft doc?
I don't think I have time to work on that at the moment, but I'd be very happy to take a look at a proposal from someone who did have time.
from oteps.
Related Issues (20)
- Proposal: Exemplars HOT 2
- Add labels to entry level tasks for new contributors
- Proposal: Adding profiling as a support event type HOT 33
- Proposal: Establish consistent guidelines for organizing vendor specific contributions to the OpenTelemetry Collector HOT 1
- Proposal: Non-core components like Exporters should live in contrib repos HOT 12
- Proposal: Enable security vulnerability scans on OTel repos HOT 4
- Proposal: Ability to associate tracer by alias with exporter/appender/destination HOT 1
- Proposal: Reduce clock-skew issues in mobile and other client-side trace sources HOT 5
- Proposal: Supporting Real User Monitoring Events in OpenTelemetry HOT 27
- Proposal: Add Sensitive Data Labels HOT 15
- Proposal: Remote Sampling
- Proposal: "Plugable backend" Tracing Client/Query library HOT 16
- Proposal: Add support for Elastic Common Schema (ECS) in OpenTelemetry HOT 6
- Proposal: specify how opentelemetry will deal with idle metrics no longer being reported
- Add link to opentelemetry.io under About section HOT 1
- Proposal: Dynamic configuration of metrics HOT 2
- Proposal: clarify behavior when retrieving non-existent currently active span HOT 13
- Proposal: The OpenTelemetry Spec should allow SDKs to export all the spans regardless of their sampled flag
- Proposal: Resource Scope and Namespace API HOT 6
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from oteps.