How to build a Raspberry Pi Kubernetes Cluster and deploy a Machine Learning webapp

HW Assembling
Hosts Setup
Control Plane Configuration
Kubernetes Networking
Adding Nodes
Essential Add-ons
Deploying an ML Web App
Licensing, Authors, and Acknowledgements

HW Assembling

Hosts Setup (repeat on control-plane an each worker)

I used Raspberry Pi Imager to flash your microSD card, using Ubuntu Server 20.04.2.LTS 64-bit as OS. I am assuming the raspberry Pi will be connected to your local network through an ethernet cable, however if you want to use WiFi you can follow the instructions below. Boot your host, as Ubuntu comes with ssh already installed you can ssh into your host from your Mac or other system you prefer to work from. I decided to install under the default user ubuntu. However if you want you can create a different user, just remember to add it to the sudoers group with:

sudo usermod -aG sudo newuser

It is better to assign your hosts static IPs, to avoind trouble when you reboot aas your router might assign a different IP

Assigning a static IP

If your Ubuntu cloud instance is provisioned with cloud-init, you’ll need to disable it. To do so create the following file:

sudo nano /etc/cloud/cloud.cfg.d/99-disable-network-config.cfg

network: {config: disabled}

To assign a static IP to your network interface, edit the file /etc/netplan/01-netcfg.yaml with your favourite text editor:

sudo nano /etc/netplan/01-netcfg.yaml and add the fllowing:

network:
  version: 2
  renderer: networkd
  ethernets:
    ens3:
      dhcp4: no
      addresses:
        - 192.168.0.201/24
      gateway4: 192.168.0.1
      nameservers:
          addresses: [8.8.8.8, 1.1.1.1]

where 192.168.0.201 is the IP you want to assign to your host and 192.168.0.1 is the IP of your router.

You can now save the new configuration:

sudo netplan apply What about assigning the static IP from the router config?

Set Up SSH Keys on Ubuntu

SSH keys provide an easy, secure way of logging into your server and are recommended for all users. You will log in to your hosts from your machine without even having to enter a password. If you already have an ssh key on your machine, all you have to do is:

ssh-copy-id username@remote_host

You can check if you have one by running:

ls ~/.ssh

if have 2 files id.rsa and id.rsa_pub then you are set. Otherwise you will have to first create an RSA key pai by running:

ssh-keygen

Now you can login into yout hosts:

ssh ubuntu@remote_host

Install docker

sudo apt-get update
sudo apt install -y docker.io


sudo cat > /etc/docker/daemon.json <<EOF
{
  "exec-opts": ["native.cgroupdriver=systemd"],
  "log-driver": "json-file",
  "log-opts": {
    "max-size": "100m"
  },
  "storage-driver": "overlay2"
}
EOF

You need to change the default cgroups driver Docker uses from cgroups to systemd to allow systemd to act as the cgroups manager and ensure there is only one cgroup manager in use.

sudo sed -i '$ s/$/ cgroup_enable=cpuset cgroup_enable=memory cgroup_memory=1 swapaccount=1/' /boot/firmware/cmdline.txt

Enable docker to start at boot:

sudo systemctl enable docker.service

You will have to reboot your host and check if docker has been installed correctly by running:

sudo docker info

Allow iptables to see bridged traffic

Kubernetes needs iptables to be configured to see bridged network traffic. You can do this by changing the sysctl config:

cat <<EOF | sudo tee /etc/sysctl.d/k8s.conf
net.bridge.bridge-nf-call-ip6tables = 1
net.bridge.bridge-nf-call-iptables = 1
EOF
sudo sysctl --system

Install Kubernetes packages for Ubuntu

# Add the packages.cloud.google.com atp key
curl https://packages.cloud.google.com/apt/doc/apt-key.gpg | sudo apt-key add -

# Add the Kubernetes repo
cat <<EOF | sudo tee /etc/apt/sources.list.d/kubernetes.list
deb https://apt.kubernetes.io/ kubernetes-xenial main
EOF

# Update the apt cache and install kubelet, kubeadm, and kubectl
# (Output omitted)
sudo apt update && sudo apt install -y kubelet kubeadm kubectl

# Disable (mark as held) updates for the Kubernetes packages
$ sudo apt-mark hold kubelet kubeadm kubectl

That's it for the host configuration. Remember you have to do this for your control-plane host and for all your worker nodes.

Config the Control Plane

Generate a bootstrap token:

# Generate a bootstrap token to authenticate nodes joining the cluster
TOKEN=$(sudo kubeadm token generate)
echo $TOKEN

Initialize the Control Plane:

sudo kubeadm init --token=${TOKEN} --kubernetes-version=v1.21.0 --pod-network-cidr=10.244.0.0/16

If everything went ok, you should see a message like this one:

Your Kubernetes control-plane has initialized successfully!

To start using your cluster, you need to run the following as a regular user:

  mkdir -p $HOME/.kube
  sudo cp -i /etc/kubernetes/admin.conf $HOME/.kube/config
  sudo chown $(id -u):$(id -g) $HOME/.kube/config

You should now deploy a pod network to the cluster.
Run "kubectl apply -f [podnetwork].yaml" with one of the options listed at:
  https://kubernetes.io/docs/concepts/cluster-administration/addons/

Then you can join any number of worker nodes by running the following on each as root:

kubeadm join 192.168.0.201:6443 --token zqqoy7.9oi8dpkfmqkop2p5 \
    --discovery-token-ca-cert-hash sha256:71270ea137214422221319c1bdb9ba6d4b76abfa2506753703ed654a90c4982b

You can now validate that the Control Plane has been installed with the kubectl get nodes command:

kubectl get nodes

Install a Container Network Interface (CNI) add-on

Kubernetes presents a network plugin interface called the CNI, Container Network Interface. And then third parties actually provide the plugins that implement the pod network. Plugins can do different things. But the general idea is yes, it's big and flat and each node gets allocated a sub set of addresses from it. Then, as pods are spun up, they get scheduled to a particular node. The IP address that they get is one from the range of addresses allocated to that node.

We will use the Flannel CNI add-on.

# Download the Flannel YAML data and apply it
#
kubectl apply -f https://raw.githubusercontent.com/coreos/flannel/master/Documentation/kube-flannel.yml

Join the workers nodes to the control plane

To join a worker node to the cluster, run the following in the control plane first:

# Add a worker nodes
# Get join token
kubeadm token list
# If the token is expired, generate a new one with the command:
sudo kubeadm token create

# Get Discovery Token CA cert Hash
openssl x509 -pubkey -in /etc/kubernetes/pki/ca.crt | openssl rsa -pubin -outform der 2>/dev/null | openssl dgst -sha256 -hex | sed 's/^.* //'
# Get your API server endpoint with kubectl command:
kubectl cluster-info

Now log into the node and run the following command:

# Now we have everything we need to join a worker node to the Cluster
kubeadm join \
  <control-plane-host>:<control-plane-port> \
  --token <token> \
  --discovery-token-ca-cert-hash sha256:<hash>
  
  
sudo kubeadm join 192.168.0.201:6443 --token w4r8s6.w8yzb2pvvnlhb9cp --discovery-token-ca-cert-hash sha256:f6d91b2f0bf4bd26a3c66894c5ea9db5fe6bc7499ef817d8847e074477261d10

What if I want my master node to be a worker too?

I had only two Raspberry Pis at my disposal, so I did not want to commit one of them to a control plane solely. That is why I unset the taint automatically applied to the control-plane node by running the following command:

kubectl edit node ubuntu-controlplane

and deleted the ollowing 2 lines:

 - effect: NoSchedule
   key: node-role.kubernetes.io/master

Access dashboards from local machine

If you need to access any dashboard running from services in the cluster you have to create an SSH tunnel from your local machine to the control-plane to forward the request: Run the following command on your local machine:

ssh -L 9999:control-plane-ip:port -N -f -l user control-plane-ip

For example if you started a proxy on the control-plane to access the API server with: kubectl proxy&

as it listens to the 8001 port you can run: ssh -L 8001:control-plane-ip:8001 -N -f -l user control-plane-ip and you will be to access the API server on your local machine on port 8001

Essential add-ons

Helm: Kubernetes package manager, to install packaged applications (charts)
Nginx Ingress controller: a specialized load balancer for Kubernetes
Node Exporter: a tool that collects health info from each node and exports in a format that Prometheus can use.
Prometheus: an open-source systems monitoring and alerting toolkit
Grafana: an open source solution for running data analytics, pulling up metrics that make sense of the massive amount of data & to monitor our apps with the help of cool customizable dashboards.
cert-manager: a tool that allows to issue certificates
Kubernetes Dashboard: a tool control cluster resources

Install Helm

Helm is a package manager for Kubernetes that allows developers and operators to more easily package, configure, and deploy applications and services onto Kubernetes clusters. Helm does:

Install software.
Automatically install software dependencies.
Upgrade software.
Configure software deployments.
Fetch software packages from repositories.

To install, run the ffollowing: curl -s https://raw.githubusercontent.com/helm/helm/master/scripts/get-helm-3 | bash

Add the chart repositories:

helm repo add prometheus-community https://prometheus-community.github.io/helm-charts
helm repo add grafana https://grafana.github.io/helm-charts
helm repo add infobloxopen https://infobloxopen.github.io/cert-manager/
helm repo add stable https://charts.helm.sh/stable/

Nginx Ingress controller

A Kubernetes Ingress controller is a specialized load balancer for Kubernetes environments.

Accept traffic from outside the Kubernetes platform, and load balance it to pods (containers) running inside the platform
Can manage egress traffic within a cluster for services which need to communicate with other services outside of a cluster
Are configured using the Kubernetes API to deploy objects called “Ingress Resources”
Monitor the pods running in Kubernetes and automatically update the load‑balancing rules when pods are added or removed from a service

Run the following command as from the bare metal section of the Nginx Ingress controller website:

kubectl apply -f https://raw.githubusercontent.com/kubernetes/ingress-nginx/controller-v1.0.4/deploy/static/provider/cloud/deploy.yaml

You can now check the status: kubectl -n ingress-nginx get pod

Bare-metal Load Balancer

In traditional cloud environments, where network load balancers are available on-demand, a single Kubernetes manifest suffices to provide a single point of contact to the NGINX Ingress controller to external clients and, indirectly, to any application running inside the cluster. However, Kubernetes does not have a built-in network load-balancer implementation. Bare-metal environments need another solution, like MetalLB. MetalLB is a network load balancer and can expose cluster services on a dedicated IP address on the network, allowing external clients to connect to services inside the Kubernetes cluster.

To install MetalLB, proceed as follows:

kubectl apply -f https://raw.githubusercontent.com/metallb/metallb/v0.11.0/manifests/namespace.yaml
kubectl apply -f https://raw.githubusercontent.com/metallb/metallb/v0.11.0/manifests/metallb.yaml

# On first install only
kubectl create secret generic -n metallb-system memberlist --from-literal=secretkey="$(openssl rand -base64 128)"

MetalLB remains idle until configured. You need to supply a configMap with details of the addresses it can assign to the Kubernetes Service LoadBalancers. The addresses must be free for MetalLB to use and not be assigned to other hosts. The DHCP server in your router should not attempt to assign the addresses that MetalLB will use. In the config map below I assigned addresses between 192.168.0.1 and 192.168.0.126

# Create the config map
$ cat <<EOF | kubectl create -f -
apiVersion: v1
kind: ConfigMap
metadata:
  namespace: metallb-system
  name: config
data:
  config: |
    address-pools:
    - name: default
      protocol: layer2
      addresses:
      - 192.168.1.240-192.168.1.250
EOF

Now you can expose a deployment using LoadBalancer-type Kubernetes services.

Monitoring: Node-exporter

You need to run the following code in each node:

First download:

curl -SL https://github.com/prometheus/node_exporter/releases/download/v1.0.1/node_exporter-1.0.1.linux-armv7.tar.gz > node_exporter.tar.gz && sudo tar -xvf node_exporter.tar.gz -C /usr/local/bin/ --strip-components=1

then create the file /etc/systemd/system/nodeexporter.service to make sure it restarts each time the node is rebooted:

[Unit]
Description=NodeExporter
[Service]
TimeoutStartSec=0
ExecStart=/usr/local/bin/node_exporter
[Install]
WantedBy=multi-user.target

Now you can start the services:

sudo systemctl daemon-reload && sudo systemctl enable nodeexporter && sudo systemctl start nodeexporter

Monitoring: Prometheus

Prometheus is an open-source systems monitoring and alerting toolkit originally built at SoundCloud. Edit the targets line in the prometheus.yaml file below with the addresses ot hostnames of your configuration:

apiVersion: v1
kind: Namespace
metadata:
  name: monitoring
---
apiVersion: v1
kind: ConfigMap
metadata:
  name: prometheus-config
  namespace: monitoring
data:
  prometheus.yaml: |
    global:
      scrape_interval:     15s
      external_labels:
        monitor: 'k3s-monitor'
    scrape_configs:
      - job_name: 'prometheus'
        scrape_interval: 5s
        static_configs:
          - targets: ['localhost:9090']
      - job_name: 'K3S'
        static_configs:
          - targets: ['192.168.0.201:9100', '192.168.0.202:9100', '192.168.0.203:9100','192.168.0.204:9100']
---
apiVersion: apps/v1
kind: Deployment
metadata:
  name: prometheus-deployment
  namespace: monitoring
  labels:
    app: prometheus
spec:
  replicas: 1
  selector:
    matchLabels:
      app: prometheus
  template:
    metadata:
      labels:
        app: prometheus
    spec:
      containers:
      - name: prometheus
        image: prom/prometheus
        volumeMounts:
          - name: config-volume
            mountPath: /etc/prometheus/prometheus.yml
            subPath: prometheus.yaml
        ports:
        - containerPort: 9090
      volumes:
        - name: config-volume
          configMap:
           name: prometheus-config
---
kind: Service
apiVersion: v1
metadata:
  namespace: monitoring
  name: prometheus-service
spec:
  selector:
    app: prometheus
  ports:
  - name: promui
    protocol: TCP
    port: 9090
    targetPort: 9090

To install, run the following:

kubectl apply -f prometheus.yaml

This sets up our scrape config for Prometheus and deploys it into our cluster in the monitoring namespace. We create a service for other pods in the cluster to be able to access it, but we don't give it a load balancer or ingress route because we explicitly do not want to expose it outside our cluster.

You have to make the service available in the control-plane with a port-forward

kubectl port-forward service/prometheus-service -n monitoring --address 0.0.0.0 9090:9090

Grafana

You can install Grafana using the manifest files in the grafana-kustom subdir:

kubectl apply --kustomize grafana-kustom

# get admin password
kubectl get secret --namespace monitoring grafana -o jsonpath="{.data.admin-password}" | base64 --decode ; echo

# check the service is up and running
kubectl get svc -n monitoring

It will show the external IP that the Load Balancer has assigned to the service. You can access the grafana dashboard at that address and port.

Once you are logged in the grafana dashboard (admin/admin) you can add the Prometheus datasource to grafana using the url http://prometheus-service:9090, and import the Prometheus Node Exporter dashboard from https://grafana.com/grafana/dashboards/1860.

Cert-manager

helm repo add jetstack https://charts.jetstack.io
kubectl create ns cert-manager
helm install \
  cert-manager jetstack/cert-manager \
  --namespace cert-manager \
  --version v0.16.0 \
  --set installCRDs=true

You can find a guide to issuing certificates with Cert-manager here.

Kubernetes Dashboard

You can install the Kubernetes dashboard:

kubectl apply -f https://raw.githubusercontent.com/kubernetes/dashboard/v2.3.1/aio/deploy/recommended.yaml

To access tha dashboard you need to start a proxy on the control plane to access the API server: kubectl proxy

Now you can access the dashboard:

http://localhost:8001/api/v1/namespaces/kubernetes-dashboard/services/https:kubernetes-dashboard:/proxy/

Alternatively you can forward the port for the dashboard only:

kubectl port-forward -n kubernetes-dashboard service/kubernetes-dashboard 8080:443

and access the dashboard with:

https://localhost:8080

Either way you will need an authentication token: List secrets using:

kubectl get secrets

You will see a secret named dashboard-admin...

kubectl describe secret dashboard-admin-sa-token-kw7vn

Copy and past the token into the authentication page

Deploying an ML Web App

We will now deploy a ML webapp in our cluster: a semantic search engine with sentence transformers and Faiss. A semantic search engines uses a numerical representation of text queries using state-of-the-art language models, indexing them in a high-dimensional vector space and measuring how similar a query vector is to the indexed documents. The webapp is built with Streamlit, an open-source Python library that makes it easy to create applications for machine learning and data science. You can find more details in this article by Kostas Stathoulopoulos. I assume you have some familiarity with docker, otherwise please have a look here. Also you can create a free account on Docker Hub.

On your machine, download the git repo:

git clone https://github.com/kstathou/vector_engine```

# Build docker image: User your docker hub user and choose a name for our app
docker build -t <user>/<image_name> .

# Run the image
docker run -p 8501:8501 <user>/<image_name>

Now the web app is running on your local machine, you can upload it to dockerhub so that it can be retrieved by any cloud provider: docker push <user>/<image_name> .

It will be a bit slow the first time you launch it, as it has to download the models into cache.

In order to deploy on Kubernetes, you must create a deployment yaml file like the one below. For a more detailed description of how to deploy applications into Kubernetes, have a look at this. First, we create create a deployment, i.e. the pods that host the application containers.

deployment.yaml:

apiVersion: extensions/v1beta1
kind: Deployment
metadata:
    labels:
        app: frontend
    name: frontend
spec:
    replicas: 2
    selector:
        matchLabels:
            app: frontend
    template:
        metadata:
            labels:
                app: frontend
        spec:
            containers:
            - image: santamm/ml-pi-app:latest
              imagePullPolicy: IfNotPresent
              name: frontend

To make the application accessible from outside the cluster, we need to create a service. A service is a Kubernetes object that receives HTTP requests and load balances them among the Pods under its control.

service.yaml:

apiVersion: v1
kind: Service
metadata:
    labels:
        app: frontend
    name: frontend-svc
spec:
    ports:
    - port: 5000
      protocol: TCP
      targetPort: 5000
    selector:
        app: frontend
    type: ClusterIP

An Ingress resource listen to port 80 and connects to the backend service.

ingress.yaml:

apiVersion: extensions/v1beta1
kind: Ingress
metadata:
    name: frontend-ingress
spec:
    rules:
    - http:
        paths:
        - path: /api
          backend:
            serviceName: frontend-svc
            servicePort: 5000

Now you can deploy from your control-pane:

kubectl apply -f deployment.yaml
kubectl apply -f service.yaml
kubectl apply -f ingress.yaml

Appendix: Connect your Raspberry Pi to the network via WiFi

identify the name of your wireless network interface. You will get a list of network interfaces. Usually the wireless one starts with a 'w'

ls /sys/class/net or ip link show

navigate to the /etc/netplan directory and locate the appropriate Netplan configuration files. Since we have disables cloud config, the config file will be the same we used to assign a static IP to the eth0 network interface:

sudo nano /etc/netplan/01-netcfg.yaml

You will have to add something like this:

wifis:
    wlan0:
      access-points:
        "SSID_name":
          password: "00000000"
      dhcp4: no
      addresses: [192.168.1.102/24]
      gateway4: 192.168.0.1
      nameservers:
        addresses: [8.8.8.8,4.2.2.2]

Otherwise if connecting through a WiFi hotspot, add instead the following lines:

wifis:
    wlan0:
        dhcp4: true
        optional: true
        access-points:
            "SSID_name":
                identity: "yourUsername@yourInstitution"
                password: "YOUR_password_here"
        dhcp4: no
        addresses: [192.168.1.102/24]
        gateway4: 192.168.0.1
        nameservers:
        addresses: [8.8.8.8,4.2.2.2]

Licensing, Authors, Acknowledgements

For licensing see LICENSE file.

santamm / raspberry-cluster Goto Github PK

raspberry-cluster's Introduction