Comments (6)
If I were to operate this application in a business context (company with a +20000 user GHES instance) I'd be looking for the following:
- indexing information per connector (last index date, index success/failure, crawling data)
- usage metrics (number of active sessions, tokens generated)
What can I do with those?
The index data and success status tell me if the connectors are working correctly and with the changes in the crawling data I can see if there was suddenly a big change in the ingested content that I otherwise might not be aware of.
With the active sessions I could get a better understanding of how the system is used throughout the day to plan changes or maintenance. And the generated tokens gives me a correlation for usage and cost.
from danswer.
Hey @kavfixnel! Do you have any specific metrics / areas you want observability into?
from danswer.
A chat history could be a useful metric specially on the Enterprise scenario with at least the requesting user and the total tokens generated.
from danswer.
Follow up question, what specific things would be useful to track via prometheus? We're trying to avoid introducing too much extra infra that's not related to the core requirements of the project. Is there a must-have reason for observability via a third party log ingestion tool?
That said we are currently adding better logging on the backend for things such as model response, latency etc.
If this is just better logging, we are already doing it and will have an update soon.
If this is about a third party log ingestion service, please explain the need, thx
from danswer.
@yuhongsun96 A better logging is indeed useful since I could parse to ingest in my own analytics system that is already in place in the company. It's a good step.
I wish to use such observability together with DLP solutions to ensure a safe usage of LLM systems across the company.
from danswer.
Although logging could be used to create metrics about usage, typically it makes more sense to add metrics to the a application directly.
Since Danswer seems to specifically focus on containerized Kubernetes applications, I suggested Prometheus since thatβs a common way to observe metrics in these types of apps.
For metrics I was looking for:
- How often endpoints were called
- Request duration
- Number of times APIs (OpenAI etc.) were called as well as request duration and number of errors to these APIs
- Scrape durations of data sources
The nice thing about adding metrics is you can balsa add alerting for when things go wrong, which seems like something an Enterprise tool should have
from danswer.
Related Issues (20)
- Connector for JAMA Connect (Requirement mgmt) HOT 1
- No ETA for indexing, too slow, can't parallel index HOT 9
- Connector for Service now
- Danswer responds to English prompt in Spanish
- failed to delete connector: key error HOT 1
- I want to delete some files in a connector HOT 3
- Human-to-SQL Query Generation with Charting HOT 2
- Gpt assistants can not use ollama source(Custom LLM Provider) LLM HOT 6
- Slackbot missing persona__prompt entry when using document sets HOT 1
- danswer-model-server services do not survive a reboot when using docker compose
- Documents created via Web connector can not be updated via Ingestion API
- PostgreSQL Error while using GPT-4 Turbo endpoint on Azure
- Support Azure GPT-4 Turbo endpoint for image generation and maximum number of tokens HOT 1
- PDF file upload not correctly parsed HOT 3
- Slack connector PostgreSQL connection closed after long indexing
- Customize Chunk Size Per Connector
- Could a self-hosted danswer cause a netscan? HOT 1
- Danswer GPT4ALL
- add Cloudflare web connector authentication HOT 1
- Getting the error "service "background" refers to undefined volume local_dynamic_storage: invalid compose project"
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
π Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. πππ
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google β€οΈ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from danswer.