Comments (8)
Can you share the error?
from cortex-helm-chart.
cortex-distributor-5478b6758f-cl457 1/1 Running 0 10m
cortex-distributor-5478b6758f-lgkjc 1/1 Running 0 10m
cortex-ingester-66488dd769-krjmz 1/1 Running 0 2d23h
cortex-ingester-66488dd769-twt2g 1/1 Running 0 2d23h
cortex-nginx-5cf69b45b7-qq9ph 1/1 Running 0 16d
cortex-nginx-5cf69b45b7-tqkzb 1/1 Running 0 16d
cortex-querier-57f85ccccb-v4lsv 1/1 Running 0 2d23h
cortex-query-frontend-6cc65c85c5-bwczv 1/1 Running 0 10m
cortex-query-frontend-6cc65c85c5-rg2xm 1/1 Running 0 10m
Failing pods after upgrade from 1.14.1 to 1.15.3:
cortex-alertmanager-8647857b79-v9qdm 0/1 Running 3 (87s ago) 6m29s
cortex-compactor-0 0/1 Running 0 6m21s
cortex-ingester-64d6b76cc8-bp5d5 0/1 Running 0 6m29s
cortex-querier-74f7f589-2f9js 0/1 Running 3 (87s ago) 6m28s
cortex-ruler-7746ccfd6f-xnjr7 0/1 Running 3 (85s ago) 6m27s
cortex-store-gateway-0 0/1 Running 0 6m23s
k logs -n cortex cortex-querier-74f7f589-2f9js
level=info ts=2023-09-28T10:30:59.065683442Z caller=main.go:194 msg="Starting Cortex" version="(version=1.15.3, branch=HEAD, revision=21e8366)"
level=info ts=2023-09-28T10:30:59.066064447Z caller=server.go:323 http=[::]:8080 grpc=[::]:9095 msg="server listening on addresses"
level=info ts=2023-09-28T10:30:59.067243862Z caller=memberlist_client.go:399 msg="Using memberlist cluster node name" name=cortex-querier-74f7f589-2f9js-36456d1f
level=info ts=2023-09-28T10:30:59.077885501Z caller=memberlist_client.go:575 msg="joined memberlist cluster" reached_nodes=4
level=info ts=2023-09-28T10:30:59.085044494Z caller=memberlist_client.go:536 msg="joined memberlist cluster" reached_nodes=4
ts=2023-09-28T10:31:00.178013763Z caller=memberlist_logger.go:74 level=warn msg="Got ping for unexpected node 'cortex-querier-74f7f589-2f9js-808f43ae' from=100.77.7.114:7946"
ts=2023-09-28T10:31:02.179318929Z caller=memberlist_logger.go:74 level=warn msg="Got ping for unexpected node 'cortex-querier-74f7f589-2f9js-808f43ae' from=100.77.7.50:7946"
ts=2023-09-28T10:31:02.179673135Z caller=memberlist_logger.go:74 level=warn msg="Got ping for unexpected node 'cortex-querier-74f7f589-2f9js-808f43ae' from=100.77.7.42:7946"
ts=2023-09-28T10:31:02.179716035Z caller=memberlist_logger.go:74 level=warn msg="Got ping for unexpected node 'cortex-querier-74f7f589-2f9js-808f43ae' from=100.77.7.109:7946"
ts=2023-09-28T10:31:02.179881038Z caller=memberlist_logger.go:74 level=warn msg="Got ping for unexpected node cortex-querier-74f7f589-2f9js-808f43ae from=100.77.7.114:42498"
ts=2023-09-28T10:31:04.069733066Z caller=memberlist_logger.go:74 level=warn msg="Got ping for unexpected node 'cortex-querier-74f7f589-2f9js-808f43ae' from=100.77.7.109:7946"
k logs cortex-ruler-7746ccfd6f-xnjr7 -n cortex
level=info ts=2023-09-28T10:31:00.404790924Z caller=main.go:194 msg="Starting Cortex" version="(version=1.15.3, branch=HEAD, revision=21e8366)"
level=info ts=2023-09-28T10:31:00.405091428Z caller=server.go:323 http=[::]:8080 grpc=[::]:9095 msg="server listening on addresses"
k logs -n cortex cortex-store-gateway-0
level=info ts=2023-09-28T10:22:49.386296971Z caller=main.go:194 msg="Starting Cortex" version="(version=1.15.3, branch=HEAD, revision=21e8366)"
level=info ts=2023-09-28T10:22:49.386591376Z caller=server.go:323 http=[::]:8080 grpc=[::]:9095 msg="server listening on addresses"
k logs cortex-alertmanager-8647857b79-v9qdm -n cortex
level=info ts=2023-09-28T10:27:38.465985397Z caller=main.go:194 msg="Starting Cortex" version="(version=1.15.3, branch=HEAD, revision=21e8366)"
level=info ts=2023-09-28T10:27:38.466976213Z caller=server.go:323 http=[::]:8080 grpc=[::]:9095 msg="server listening on addresses"
k logs cortex-alertmanager-8647857b79-v9qdm -n cortex
level=info ts=2023-09-28T10:27:38.465985397Z caller=main.go:194 msg="Starting Cortex" version="(version=1.15.3, branch=HEAD, revision=21e8366)"
level=info ts=2023-09-28T10:27:38.466976213Z caller=server.go:323 http=[::]:8080 grpc=[::]:9095 msg="server listening on addresses"
k logs -n cortex cortex-compactor-0
level=info ts=2023-09-28T10:22:48.991038111Z caller=main.go:194 msg="Starting Cortex" version="(version=1.15.3, branch=HEAD, revision=21e8366)"
level=info ts=2023-09-28T10:22:48.991531119Z caller=server.go:323 http=[::]:8080 grpc=[::]:9095 msg="server listening on addresses"
level=info ts=2023-09-28T10:22:48.992992641Z caller=module_service.go:64 msg=initialising module=server
level=info ts=2023-09-28T10:22:48.993143843Z caller=module_service.go:64 msg=initialising module=memberlist-kv
level=info ts=2023-09-28T10:22:48.993146343Z caller=module_service.go:64 msg=initialising module=runtime-config
level=info ts=2023-09-28T10:22:48.993390847Z caller=module_service.go:64 msg=initialising module=compactor
k logs -n cortex cortex-ingester-64d6b76cc8-bp5d5
level=info ts=2023-09-28T10:22:37.989674144Z caller=main.go:194 msg="Starting Cortex" version="(version=1.15.3, branch=HEAD, revision=21e8366)"
level=info ts=2023-09-28T10:22:37.989985548Z caller=server.go:323 http=[::]:8080 grpc=[::]:9095 msg="server listening on addresses"
from cortex-helm-chart.
cortex-distributor-5478b6758f-cl457 1/1 Running 0 10m cortex-distributor-5478b6758f-lgkjc 1/1 Running 0 10m cortex-ingester-66488dd769-krjmz 1/1 Running 0 2d23h cortex-ingester-66488dd769-twt2g 1/1 Running 0 2d23h cortex-nginx-5cf69b45b7-qq9ph 1/1 Running 0 16d cortex-nginx-5cf69b45b7-tqkzb 1/1 Running 0 16d cortex-querier-57f85ccccb-v4lsv 1/1 Running 0 2d23h cortex-query-frontend-6cc65c85c5-bwczv 1/1 Running 0 10m cortex-query-frontend-6cc65c85c5-rg2xm 1/1 Running 0 10m Failing pods after upgrade from 1.14.1 to 1.15.3: cortex-alertmanager-8647857b79-v9qdm 0/1 Running 3 (87s ago) 6m29s cortex-compactor-0 0/1 Running 0 6m21s cortex-ingester-64d6b76cc8-bp5d5 0/1 Running 0 6m29s cortex-querier-74f7f589-2f9js 0/1 Running 3 (87s ago) 6m28s cortex-ruler-7746ccfd6f-xnjr7 0/1 Running 3 (85s ago) 6m27s cortex-store-gateway-0 0/1 Running 0 6m23s k logs -n cortex cortex-querier-74f7f589-2f9js level=info ts=2023-09-28T10:30:59.065683442Z caller=main.go:194 msg="Starting Cortex" version="(version=1.15.3, branch=HEAD, revision=21e8366)" level=info ts=2023-09-28T10:30:59.066064447Z caller=server.go:323 http=[::]:8080 grpc=[::]:9095 msg="server listening on addresses" level=info ts=2023-09-28T10:30:59.067243862Z caller=memberlist_client.go:399 msg="Using memberlist cluster node name" name=cortex-querier-74f7f589-2f9js-36456d1f level=info ts=2023-09-28T10:30:59.077885501Z caller=memberlist_client.go:575 msg="joined memberlist cluster" reached_nodes=4 level=info ts=2023-09-28T10:30:59.085044494Z caller=memberlist_client.go:536 msg="joined memberlist cluster" reached_nodes=4 ts=2023-09-28T10:31:00.178013763Z caller=memberlist_logger.go:74 level=warn msg="Got ping for unexpected node 'cortex-querier-74f7f589-2f9js-808f43ae' from=100.77.7.114:7946" ts=2023-09-28T10:31:02.179318929Z caller=memberlist_logger.go:74 level=warn msg="Got ping for unexpected node 'cortex-querier-74f7f589-2f9js-808f43ae' from=100.77.7.50:7946" ts=2023-09-28T10:31:02.179673135Z caller=memberlist_logger.go:74 level=warn msg="Got ping for unexpected node 'cortex-querier-74f7f589-2f9js-808f43ae' from=100.77.7.42:7946" ts=2023-09-28T10:31:02.179716035Z caller=memberlist_logger.go:74 level=warn msg="Got ping for unexpected node 'cortex-querier-74f7f589-2f9js-808f43ae' from=100.77.7.109:7946" ts=2023-09-28T10:31:02.179881038Z caller=memberlist_logger.go:74 level=warn msg="Got ping for unexpected node cortex-querier-74f7f589-2f9js-808f43ae from=100.77.7.114:42498" ts=2023-09-28T10:31:04.069733066Z caller=memberlist_logger.go:74 level=warn msg="Got ping for unexpected node 'cortex-querier-74f7f589-2f9js-808f43ae' from=100.77.7.109:7946" k logs cortex-ruler-7746ccfd6f-xnjr7 -n cortex level=info ts=2023-09-28T10:31:00.404790924Z caller=main.go:194 msg="Starting Cortex" version="(version=1.15.3, branch=HEAD, revision=21e8366)" level=info ts=2023-09-28T10:31:00.405091428Z caller=server.go:323 http=[::]:8080 grpc=[::]:9095 msg="server listening on addresses" k logs -n cortex cortex-store-gateway-0 level=info ts=2023-09-28T10:22:49.386296971Z caller=main.go:194 msg="Starting Cortex" version="(version=1.15.3, branch=HEAD, revision=21e8366)" level=info ts=2023-09-28T10:22:49.386591376Z caller=server.go:323 http=[::]:8080 grpc=[::]:9095 msg="server listening on addresses" k logs cortex-alertmanager-8647857b79-v9qdm -n cortex level=info ts=2023-09-28T10:27:38.465985397Z caller=main.go:194 msg="Starting Cortex" version="(version=1.15.3, branch=HEAD, revision=21e8366)" level=info ts=2023-09-28T10:27:38.466976213Z caller=server.go:323 http=[::]:8080 grpc=[::]:9095 msg="server listening on addresses" k logs cortex-alertmanager-8647857b79-v9qdm -n cortex level=info ts=2023-09-28T10:27:38.465985397Z caller=main.go:194 msg="Starting Cortex" version="(version=1.15.3, branch=HEAD, revision=21e8366)" level=info ts=2023-09-28T10:27:38.466976213Z caller=server.go:323 http=[::]:8080 grpc=[::]:9095 msg="server listening on addresses" k logs -n cortex cortex-compactor-0 level=info ts=2023-09-28T10:22:48.991038111Z caller=main.go:194 msg="Starting Cortex" version="(version=1.15.3, branch=HEAD, revision=21e8366)" level=info ts=2023-09-28T10:22:48.991531119Z caller=server.go:323 http=[::]:8080 grpc=[::]:9095 msg="server listening on addresses" level=info ts=2023-09-28T10:22:48.992992641Z caller=module_service.go:64 msg=initialising module=server level=info ts=2023-09-28T10:22:48.993143843Z caller=module_service.go:64 msg=initialising module=memberlist-kv level=info ts=2023-09-28T10:22:48.993146343Z caller=module_service.go:64 msg=initialising module=runtime-config level=info ts=2023-09-28T10:22:48.993390847Z caller=module_service.go:64 msg=initialising module=compactor k logs -n cortex cortex-ingester-64d6b76cc8-bp5d5 level=info ts=2023-09-28T10:22:37.989674144Z caller=main.go:194 msg="Starting Cortex" version="(version=1.15.3, branch=HEAD, revision=21e8366)" level=info ts=2023-09-28T10:22:37.989985548Z caller=server.go:323 http=[::]:8080 grpc=[::]:9095 msg="server listening on addresses"
There is not a single error in your logs.
from cortex-helm-chart.
Yep - the readiness checks are not healthy (as below) so pods never get to a ready state and are eventually killed.
1s Warning Unhealthy pod/cortex-store-gateway-0 Startup probe failed: Get "http://100.77.7.153:8080/ready": context deadline exceeded (Client.Timeout exceeded while awaiting headers)
25s Warning Unhealthy pod/cortex-compactor-0 Startup probe failed: HTTP probe failed with statuscode: 503
NAME ENDPOINTS AGE
cortex-alertmanager 237d
cortex-compactor 237d
cortex-distributor 100.77.7.148:8080,100.77.7.154:8080 237d
cortex-distributor-headless 100.77.7.148:9095,100.77.7.154:9095 237d
cortex-ingester 100.77.7.42:8080,100.77.7.50:8080 237d
cortex-ingester-headless 100.77.7.42:9095,100.77.7.50:9095 237d
cortex-memberlist 100.77.7.148:7946,100.77.7.154:7946,100.77.7.42:7946 + 1 more... 237d
cortex-nginx 100.77.7.16:80,100.77.7.23:80 237d
cortex-querier 100.77.7.47:8080 237d
cortex-query-frontend 100.77.7.151:8080,100.77.7.155:8080 237d
cortex-query-frontend-headless 100.77.7.151:9095,100.77.7.155:9095 237d
cortex-ruler 237d
cortex-store-gateway 237d
cortex-store-gateway-headless 237d
from cortex-helm-chart.
Yep - the readiness checks are not healthy (as below) so pods never get to a ready state and are eventually killed.
1s Warning Unhealthy pod/cortex-store-gateway-0 Startup probe failed: Get "http://100.77.7.153:8080/ready": context deadline exceeded (Client.Timeout exceeded while awaiting headers) 25s Warning Unhealthy pod/cortex-compactor-0 Startup probe failed: HTTP probe failed with statuscode: 503
NAME ENDPOINTS AGE cortex-alertmanager 237d cortex-compactor 237d cortex-distributor 100.77.7.148:8080,100.77.7.154:8080 237d cortex-distributor-headless 100.77.7.148:9095,100.77.7.154:9095 237d cortex-ingester 100.77.7.42:8080,100.77.7.50:8080 237d cortex-ingester-headless 100.77.7.42:9095,100.77.7.50:9095 237d cortex-memberlist 100.77.7.148:7946,100.77.7.154:7946,100.77.7.42:7946 + 1 more... 237d cortex-nginx 100.77.7.16:80,100.77.7.23:80 237d cortex-querier 100.77.7.47:8080 237d cortex-query-frontend 100.77.7.151:8080,100.77.7.155:8080 237d cortex-query-frontend-headless 100.77.7.151:9095,100.77.7.155:9095 237d cortex-ruler 237d cortex-store-gateway 237d cortex-store-gateway-headless 237d
Can't reproduce. For me everything works fine. Try running
helm dep update .
before upgrading/installing to get the latest memcached.
I tested with:
helm install cortex . -n cortex --create-namespace -f ci/test-sts-values.yaml --set image.tag=v1.15.3
from cortex-helm-chart.
Have you tried an upgrade from 1.14.1 to 1.15.3, thanks? Looking at the CHANGELOG - https://github.com/cortexproject/cortex/blob/master/CHANGELOG.md?plain=1 doesn't appear that you need to do anything specific to upgrade on the application side.
If you can confirm the upgrade is okay via the helm chart method than I'll close this call and raise one on the cortex application, thanks.
from cortex-helm-chart.
https://github.com/cortexproject/cortex/blob/master/CHANGELOG.md?plain=1
Okay.
from cortex-helm-chart.
Found reason why my pods not starting - cortexproject/cortex#5449 - I'm using Azure and needed:
endpoint_suffix: blob.core.windows.net
Thanks for confirming chart was okay.
from cortex-helm-chart.
Related Issues (20)
- Exposing 9094 tcp port for alertmanager HOT 12
- Error with usage of Max Chunk Age HOT 1
- Memcache installed regardless of whether or not it is enabled
- Enable HPA on QueryFrontend when QueryScheduler is enabled HOT 1
- Add targetLabels to Prometheus ServiceMonitor CRDs HOT 1
- Add securityContext to memcached metrics containers
- New release? HOT 2
- Upgrade Image from v1.13.0 to v1.14.1 Using chart v2.1.0 fails error loading config from /etc/cortex/cortex.yaml: Error parsing config file: yaml: unmarshal errors HOT 4
- Expose ServiceMonitor labels for memcached instances and make format consistent HOT 2
- Default use of memberlist (ingester) and consul (distributor/ruler)? HOT 2
- Default values continue to cause alertmanager templates to be deleted HOT 4
- cortex-store-gateway-0 stateful set always shows as starting and also restarting frequently HOT 1
- Do not set .spec.replicas for store-gateway if autoscaling is enabled HOT 2
- Create a production ready values files HOT 4
- Optionally expose Ruler rules endpoint in nginx ingress
- Chart.yaml on release 2.1.0 uses cortex v1.14.1, should use 1.15.3 HOT 2
- Should purger work with query-frontend? HOT 2
- Disable livenessprobe by default HOT 1
- Create a new release HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from cortex-helm-chart.