Comments (4)
simply open a shell on node and kill the kubelet by command: kill
That isn't simulating a crash, that sends a SIGTERM to the process, kubelet
traps the signal and terminates normally (with an exit code of zero). If you send a SIGKILL (kill -9
), you get the auto-restart that you'd expect, which is what happens when kubelet
exits unexpectedly due to a panic, etc.:
> systemctl status kubelet
● kubelet.service - Kubernetes Kubelet
Loaded: loaded (/etc/systemd/system/kubelet.service; disabled; preset: disabled)
Active: active (running) since Fri 2024-05-10 18:10:35 UTC; 2min 7s ago
Docs: https://github.com/kubernetes/kubernetes
Process: 2744679 ExecStartPre=/sbin/iptables -P FORWARD ACCEPT -w 5 (code=exited, status=0/SUCCESS)
Main PID: 2744680 (kubelet)
Tasks: 12 (limit: 18811)
Memory: 29.3M
CPU: 1.407s
CGroup: /runtime.slice/kubelet.service
└─2744680 /usr/bin/kubelet --node-ip=192.168.174.139 --cloud-provider=external --hostname-override=ip-192-168-174-139.us-west-2.compute.internal --config=/etc/kubernetes/kube>
> sudo kill -9 2744680
> systemctl status kubelet
● kubelet.service - Kubernetes Kubelet
Loaded: loaded (/etc/systemd/system/kubelet.service; disabled; preset: disabled)
Active: activating (auto-restart) (Result: signal) since Fri 2024-05-10 18:13:02 UTC; 1s ago
Docs: https://github.com/kubernetes/kubernetes
Process: 2744679 ExecStartPre=/sbin/iptables -P FORWARD ACCEPT -w 5 (code=exited, status=0/SUCCESS)
Process: 2744680 ExecStart=/usr/bin/kubelet $NODEADM_KUBELET_ARGS (code=killed, signal=KILL)
Main PID: 2744680 (code=killed, signal=KILL)
CPU: 1.526s
from amazon-eks-ami.
I will try but it doesn't so matter
We experienced an issue (twice) when the kubelet has been crashed because of memory pressure and never been recovered.
As return the pods were stuck forever. The only way to recover is deleting manually the EC2 node in the AWS console.
GKE and AKS implement a simple kublet watchdog. The kubelet restarts immediately
from amazon-eks-ami.
If kubelet
crashes, it's going to be restarted by systemd
, see my example above. It sounds like you're running into problems with kubeReserved
memory being insufficient, which is legit. That's discussed in #1141 and #1145.
from amazon-eks-ami.
Thanks
I think this is what we actually had
#1145
from amazon-eks-ami.
Related Issues (20)
- Documentation fix in development.md for servings docs HOT 1
- Custom AMI Build HOT 1
- Wrong link in the documentation HOT 1
- AL2023 NodeConfig docs for KubeletConfiguration point to the wrong document
- Hardened Source Image throws: `cannot stat ‘/tmp/worker/configure-clocksource.service’: No such file or directory`
- CHANGELOG.md not being updated HOT 3
- Question - AWS EKS AMI release calendar? HOT 2
- Build fails when passing AWS_ACCESS_KEY_ID because it assumes that key has access to bucket
- Execute custom command during bootstrapping process in AL2023 HOT 1
- explicit versioning of nodeadm that should be used in EKS AMIs
- [Feature] Publish binary w/ ability to validate the user supplied configuration HOT 1
- [Feature] Instance-store-backed AMIs
- EKS Logs Collector script Update HOT 3
- [bug] script allows to pass incorrect kublete arguments HOT 1
- Failed to retrieve instance data from ec2 metadata HOT 1
- CA Certificate used to pull images from private repo not used by kubelet after v20240202 HOT 4
- Missing c7i-flex.* instance types is causing nodes not to join the cluster HOT 3
- Hardened STIG EKS builds
- libnvidia-ml library not found in GPU AL2023 AMIs HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from amazon-eks-ami.