Comments (6)
@jmink - We're triaging tickets now. Do you have any more context from this conversation that you can provide here?
from chef-server.
This came out of a brief conversation with Phil from Facebook. Basically he doesn't know how to read erlang stack traces and would prefer some error message that's a bit more focused on what the user can do to fix things.
from chef-server.
Hey @sdelano @jmink -- @mauriciosilva and I are on-site at Facebook, this is still an issue. This could be a CE project that @mauriciosilva can help out on.
from chef-server.
@seth once said of this (roughly) "We totally need to do that - it would save everyone from customers to CSE countless hours. It's easy to do in a way we can re-use across all our erlang daemons"
@jmink echoed the clear wins here.
Filing tickets with a stacktrace because a config file doesn't have the right password in it, waiting for a CSE person to decode and explain possible things and then fixing the config file is a ginormous waste of time and energy. "Authentication to Postgresql failed" means no ticket, no waste of CSE times, no waste of customer's time, and a far, far better customer experience. You chose to use erland for the clear performance and testability wins - and that's awesome. But all languages have pros and cons, and the cost of using erland is there are no sane error messages, so you get to do that yourself.
Versions of this bug keep getting closed. It's incredibly frustrating, please stop. This is a clear win for everyone involved. Multiple engineers agree. Please figure out when to schedule it on a sprint (or your nearest internal equivalent).
from chef-server.
@seth, @sdelano, is there already thinking on how this can be done, e.g. a list of common cases where we can do something better than a stack trace?
from chef-server.
I'll take a first stab after getting some feedback from Phil / CSE:
- Our component data flow can guide the most common cases, i.e. failure to connect to services / resources such as databases, bifrost, bookshelf, solr, rabbit, etc.
- Such connection failures can be classified around authentication, authorization, general availability (resource does not exist), or worst case some "other" failure type.
- The message should include a human meaningful name of the resource, e.g. a hostname, postgres connection string, etc.
- Where possible, such errors should suggest corrective actions or diagnoses for the operator, e.g. ensuring a certificate is valid and verifiable, ensuring the database is reachable through some command line tool, etc.
I have more specifics in some notes. The goal here would be to let operators know information that will let them fix things quickly, and avoid the need for outside help.
from chef-server.
Related Issues (20)
- Installing chef-manage via chef-server-ctl is not working HOT 1
- chef-server-ctl user-create with prompt for password is broken HOT 1
- Upgrade to rails 7 and ruby 3+ in oc-id HOT 2
- OCID: profile email update is throwing error
- New nodes aren't indexed but are known to Chef-Server (Version 14) HOT 7
- Update the version of Chef server in Automate HOT 1
- Unable to upload/delete cookbook with Chef Admin account
- Chef Automate 2022-01 failing chef-server-ctl test HOT 1
- Chef Client Range Search Unexpected Results HOT 4
- API Endpoints to update client certs not accessible PUT HOT 1
- Cookbook parsing fails on restore knife ec backup/restore. HOT 1
- embedded knife commands show warnings HOT 1
- Incorrect metadata in a cookbook causes all client runs on nodes in that org to fail, irrespective of them using the cookbook in question. HOT 1
- Update External Opensearch documentation with the user permissions required for Chef to work correctly with Opensearch. HOT 1
- chef-server-ctl test in failing in FIPS enabled Amazon Linux 2 system. HOT 1
- Chef server install fails at "add internal user to opensearch security plugin" on local proxmox host but not AWS HOT 2
- Unable to `chef-server-ctl reconfigure` a new 15.3.2 install on Ubuntu 22.04 HOT 8
- Cookbook with invalid dependencies causes ALL Chef client runs to begin failing (even on nodes that do not use the cookbook in question) HOT 4
- New OpenSSL requirements in RHEL 9 in fips mode [RHSA-2023:3722-01], cannot connect to Chef Server anymore with no EMS support
- Chef search results are limited to 10,000 records when using external OpenSearch 1.3.x regardless of max_result_window
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from chef-server.