Comments (10)
Currently we are out of monitoring due to the issue mentioned ,it will be really great if some one can have can assist on this.
from loki.
Was able to fix the affinity issue .Only issue can't able to figure out was config issue. PFB values.yaml.Kindly do the needful.Thanks
from loki.
Was able to fix all the issues .Thanks
from loki.
Was able to fix all the issues .Thanks
Care for sharing how you did solve the issue ?
from loki.
@krptg0 : Issue was related to the "parallelise_shardable_queries: true" variable used to be under "loki.query_range" in the chart version we used in 5.44.4 ,but after upgrade to 6.5.0 it should be moved to loki.structuredConfig.query_range ,which also needs to updated in the grafana documentation page for now until permanent fix . Seems this is a bug in the latest chart and I saw some user already derived case for the same few weeks back.
5.44.4
loki:
query_scheduler:
max_outstanding_requests_per_tenant: 2048
query_range:
parallelise_shardable_queries: false
split_queries_by_interval: 0
6.5.0
loki:
commonConfig:
replication_factor: 1
query_scheduler:
max_outstanding_requests_per_tenant: 2048
structuredConfig:
query_range:
parallelise_shardable_queries: true
Thanks
from loki.
@krptg0 I don't see that solving the issue. would you be able to share your config which worked for you .
helm upgrade --reset-values my-loki -f values-loki.yaml grafana/loki -n vector --debug --version 6.5.2 upgrade.go:155: [debug] preparing upgrade for my-loki upgrade.go:536: [debug] resetting values to the chart's original version coalesce.go:286: warning: cannot overwrite table with non table for loki.singleBinary.affinity (map[podAntiAffinity:map[requiredDuringSchedulingIgnoredDuringExecution:[map[labelSelector:map[matchLabels:map[app.kubernetes.io/component:single-binary]] topologyKey:kubernetes.io/hostname]]]]) coalesce.go:286: warning: cannot overwrite table with non table for loki.read.affinity (map[podAntiAffinity:map[requiredDuringSchedulingIgnoredDuringExecution:[map[labelSelector:map[matchLabels:map[app.kubernetes.io/component:read]] topologyKey:kubernetes.io/hostname]]]]) coalesce.go:286: warning: cannot overwrite table with non table for loki.tableManager.affinity (map[podAntiAffinity:map[requiredDuringSchedulingIgnoredDuringExecution:[map[labelSelector:map[matchLabels:map[app.kubernetes.io/component:table-manager]] topologyKey:kubernetes.io/hostname]]]]) coalesce.go:286: warning: cannot overwrite table with non table for loki.write.affinity (map[podAntiAffinity:map[requiredDuringSchedulingIgnoredDuringExecution:[map[labelSelector:map[matchLabels:map[app.kubernetes.io/component:write]] topologyKey:kubernetes.io/hostname]]]]) coalesce.go:286: warning: cannot overwrite table with non table for loki.gateway.affinity (map[podAntiAffinity:map[requiredDuringSchedulingIgnoredDuringExecution:[map[labelSelector:map[matchLabels:map[app.kubernetes.io/component:gateway]] topologyKey:kubernetes.io/hostname]]]]) coalesce.go:286: warning: cannot overwrite table with non table for loki.backend.affinity (map[podAntiAffinity:map[requiredDuringSchedulingIgnoredDuringExecution:[map[labelSelector:map[matchLabels:map[app.kubernetes.io/component:backend]] topologyKey:kubernetes.io/hostname]]]]) coalesce.go:286: warning: cannot overwrite table with non table for loki.singleBinary.affinity (map[podAntiAffinity:map[requiredDuringSchedulingIgnoredDuringExecution:[map[labelSelector:map[matchLabels:map[app.kubernetes.io/component:single-binary]] topologyKey:kubernetes.io/hostname]]]]) coalesce.go:286: warning: cannot overwrite table with non table for loki.read.affinity (map[podAntiAffinity:map[requiredDuringSchedulingIgnoredDuringExecution:[map[labelSelector:map[matchLabels:map[app.kubernetes.io/component:read]] topologyKey:kubernetes.io/hostname]]]]) coalesce.go:286: warning: cannot overwrite table with non table for loki.gateway.affinity (map[podAntiAffinity:map[requiredDuringSchedulingIgnoredDuringExecution:[map[labelSelector:map[matchLabels:map[app.kubernetes.io/component:gateway]] topologyKey:kubernetes.io/hostname]]]]) coalesce.go:286: warning: cannot overwrite table with non table for loki.backend.affinity (map[podAntiAffinity:map[requiredDuringSchedulingIgnoredDuringExecution:[map[labelSelector:map[matchLabels:map[app.kubernetes.io/component:backend]] topologyKey:kubernetes.io/hostname]]]]) coalesce.go:286: warning: cannot overwrite table with non table for loki.write.affinity (map[podAntiAffinity:map[requiredDuringSchedulingIgnoredDuringExecution:[map[labelSelector:map[matchLabels:map[app.kubernetes.io/component:write]] topologyKey:kubernetes.io/hostname]]]]) coalesce.go:286: warning: cannot overwrite table with non table for loki.tableManager.affinity (map[podAntiAffinity:map[requiredDuringSchedulingIgnoredDuringExecution:[map[labelSelector:map[matchLabels:map[app.kubernetes.io/component:table-manager]] topologyKey:kubernetes.io/hostname]]]]) Error: UPGRADE FAILED: execution error at (loki/templates/validate.yaml:40:4): You must provide a schema_config for Loki, one is not provided as this will be individual for every Loki cluster. See https://grafana.com/docs/loki/latest/operations/storage/schema/ for schema information. For quick testing (with no persistence) add
--set loki.useTestSchema=true helm.go:84: [debug] execution error at (loki/templates/validate.yaml:40:4): You must provide a schema_config for Loki, one is not provided as this will be individual for every Loki cluster. See https://grafana.com/docs/loki/latest/operations/storage/schema/ for schema information. For quick testing (with no persistence) add
--set loki.useTestSchema=true UPGRADE FAILED main.newUpgradeCmd.func2 helm.sh/helm/v3/cmd/helm/upgrade.go:229 github.com/spf13/cobra.(*Command).execute github.com/spf13/[email protected]/command.go:983 github.com/spf13/cobra.(*Command).ExecuteC github.com/spf13/[email protected]/command.go:1115 github.com/spf13/cobra.(*Command).Execute github.com/spf13/[email protected]/command.go:1039 main.main helm.sh/helm/v3/cmd/helm/helm.go:83 runtime.main runtime/proc.go:267 runtime.goexit runtime/asm_amd64.s:1650
I am currently on 5.47.2
helm ls -a NAME NAMESPACE REVISION UPDATED STATUS CHART APP VERSION my-grafana vector 2 2024-05-21 11:41:27.3348885 +0530 IST deployed grafana-7.3.11 10.4.1 my-loki vector 1 2024-05-21 10:58:05.3864634 +0530 IST deployed loki-5.47.2 2.9.6
from loki.
helm upgrade --reset-values my-loki -f values_29042024.yaml grafana/loki -n vector --debug --version 6.5.2
The Helm file attached was suitable for upgrading, but a couple of pods encountered errors still.
output.txt
from loki.
in gateway pod:
Events:
Type Reason Age From Message
---- ------ ---- ---- -------
Normal Scheduled 17m default-scheduler Successfully assigned vector/my-loki-gateway-548dd78cd8-wrgnd to ip-10-0-1-223.eu-west-2.compute.internal
Normal Pulled 17m kubelet Container image "docker.io/nginxinc/nginx-unprivileged:1.24-alpine" already present on machine
Normal Created 17m kubelet Created container nginx
Normal Started 17m kubelet Started container nginx
Warning Unhealthy 2m8s (x101 over 16m) kubelet Readiness probe errored: strconv.Atoi: parsing "http": invalid syntax
from loki.
NAME READY STATUS RESTARTS AGE
loki-backend-0 2/2 Running 3 (12m ago) 12m
loki-backend-1 1/2 CrashLoopBackOff 3 (13s ago) 75s
loki-canary-9hrdt 1/1 Running 0 25m
loki-canary-gqktk 1/1 Running 0 24m
loki-canary-q6r28 1/1 Running 0 23m
loki-canary-rbgbl 1/1 Running 0 26m
loki-read-b76c4bff4-kv9qj 1/1 Running 0 81s
loki-read-b76c4bff4-sjjg4 1/1 Running 0 50s
loki-write-0 1/1 Running 0 25m
loki-write-1 0/1 Running 0 8s
loki-write-2 1/1 Running 0 80s
my-grafana-7cfd6ffc59-cjhtp 1/1 Running 0 27m
my-loki-chunks-cache-0 2/2 Running 0 12m
my-loki-gateway-548dd78cd8-wrgnd 0/1 Running 0 27m
my-loki-gateway-66f8b59d65-75z95 0/1 Running 0 34m
my-loki-grafana-agent-operator-6b4f987557-655hx 1/1 Running 0 27m
my-loki-logs-5sr6b 2/2 Running 0 2d10h
my-loki-logs-cdskt 2/2 Running 0 2d11h
my-loki-logs-jvdnv 2/2 Running 0 21m
my-loki-logs-z28sp 2/2 Running 0 2d11h
my-loki-results-cache-0 2/2 Running 0 12m
my-vector-0 1/1 Running 0 26m
$ kubectl logs loki-backend-1 -c loki
panic: runtime error: invalid memory address or nil pointer dereference
[signal SIGSEGV: segmentation violation code=0x1 addr=0x288 pc=0x22f02b0]
goroutine 1 [running]:
github.com/grafana/loki/v3/pkg/loki.(*Loki).updateConfigForShipperStore(0xc000a2ff40?)
/src/loki/pkg/loki/modules.go:755 +0xb0
github.com/grafana/loki/v3/pkg/loki.(*Loki).initBloomStore(0xc000bf3500)
/src/loki/pkg/loki/modules.go:715 +0x68
github.com/grafana/dskit/modules.(*Manager).initModule(0xc000a62708, {0x7ffd42c2c27d, 0x7}, 0x1?, 0xc0016800c0?)
/src/loki/vendor/github.com/grafana/dskit/modules/modules.go:136 +0x1f7
github.com/grafana/dskit/modules.(*Manager).InitModuleServices(0x0?, {0xc0008f4910, 0x1, 0xc000c36360?})
/src/loki/vendor/github.com/grafana/dskit/modules/modules.go:108 +0xd8
github.com/grafana/loki/v3/pkg/loki.(*Loki).Run(0xc000bf3500, {0x0?, {0x4?, 0x3?, 0x4912940?}})
/src/loki/pkg/loki/loki.go:453 +0x9d
main.main()
/src/loki/cmd/loki/main.go:122 +0x113b
from loki.
fixed this making change to helm
https://github.com/grafana/loki/blob/main/production/helm/loki/values.yaml#L337-L345
readinessProbe:
httpGet:
path: /
port: http-metrics
initialDelaySeconds: 15
timeoutSeconds: 1``
from loki.
Related Issues (20)
- loki-rules in alertmanager not creating ALERTS metrics
- Loki: Logs Disappear from Logs volume:
- Loki write pods in crashBackOff loop - Migrating to TSDB HOT 1
- Ruler config CRD remote_write client authorisation type needs to be basic and bearer.
- Unrecognized storage client after schema upgrade
- Docs feedback: /docs/sources/send-data/promtail/stages/pack.md
- track query bytes used by explore logs queries
- promtail support log rotations
- Container not found error for azure blob storage HOT 1
- no matches for kind "Statefulset" in group "apps"
- Failed to get shards for expressions
- patterns API: always returns 1-2 data points no matter the time range
- Slow performance/timeout for count_over_time on < 1,000 logs
- Queries with multiple patterns chained with `or` produce incorrect results
- Grafana/Loki slow to load more than 2k logs
- [Helm] Ingester rollout-group collides with Mimir
- Low performance of bloom compaction in Loki 3.0: OOMs and crashes HOT 1
- Can't connect Loki to Grafana in k8s
- Alertmanager client config present in Ruler CRD, however no config is generated
- [otlp/logrecords]: Ingestion of native OTLP LogRecords blew up memory usage in Distributors and Ingesters..
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from loki.