The custom-pod-autoscaler from jthomperoo

Deploy built images to Docker Hub

Each commit should result in a new image being pushed to Docker Hub.

Manual triggering of Custom Pod Autoscaler evaluation

Rather than being triggered just by the timer and at set intervals, the Custom Pod Autoscaler evaluation could be triggered manually, through a REST endpoint.
This would allow users to send an HTTP request to the Custom Pod Autoscaler and start an evaluation immediately, rather than waiting for the interval to expire.

Add testing

Allow ScaleTargetRef to target ReplicaControllers and ReplicaSets, like an HPA

Add additional configuration for the API

Allow configuration of the following for the API:

Enable/Disable the API.
Support HTTPS for the API.
- Allow specifying path to certs to use.

Resolve Flask security vulnerability

2 flask vulnerabilities found in …/app/requirements.txt
Remediation
Upgrade flask to version 1.0.0 or later. For example:
flask>=1.0.0

CVE-2019-1010083

CVE-2018-1000656

Add 'run_type' flag, providing context for how the evaluation is being run

Should have two modes, api and scaler.

api is when the evaluation is triggered through the API, as a read only event.
scaler is when the evaluation is triggered by the regular scaler logic.

Metric and Evaluation timeout

Implement a timeout for fetching metrics/evaluations, configurable - stops the CPA being unresponsive if a script fails - i.e due to pod terminating/crashing half way through an evaluation.

Choose which pods to terminate when scaling down

The CPA evaluator could decide which pods to terminate when scaling down, rather than relying on the Kubernetes decision making which bases it on how old the pod is.
This could be a list of pods with priorities assigned to them, with the lowest priority pods terminated when scaling down as needed.

Adjust env vars to be consistent with rest of YAML config

Env vars should be lowercase, rather than uppercase to match the YAML config.

Restructure shell package for better dependency injection

This will help when writing unit tests for other packages.

Hooks to be triggered at different points in a CPA execution cycle.

Hooks would be points at which a user-defined shell command is executed, to allow users to have greater control of the Custom Pod Autoscaler.

Pre metric gathering.
Post metric gathering.
Pre evaluation.
Post evaluation.
Pre scaling.
Post scaling.

Support scaling to zero

This code seems to be preventing me from scaling a deployment to zero replicas:

custom-pod-autoscaler/scale/scale.go

Line 112 in 7b1d427

if currentReplicas == 0 {

My use case is a queue-based processing system with a bunch of GPUs, so scale to zero is rather important (and also why I can't use the build in HorizontalPodAutoscaler)

Allow specification of how metrics/evaluations should be run

At the minute, the metric is run for every pod in a deployment; however it would be useful if you could run the metric only per deployment, rather than per each pod.

A new configuration option would help this, run-mode, with these options:

per-pod - Runs per pod.
per-deployment - Runs per deployment.

Expose metrics through an API endpoint

Metrics should be retrievable through an API endpoint:
GET /metrics
Returning:

[
  {
    "deployment": "example-deployment",
    "metrics": [
      {
        "pod": "example-pod",
        "value": "value"
      }
    ]
  }
]

These metrics should be calculated at request time.

Add unit testing of scaling functions

Allow different methods to pass data

Would require a change to how the current configuration to support this, e.g. instead of

metric: "shell command"

It should allow you to specify which way to pass data, for shell commands:

metric: 
  type: "pipe"
  pipe: "shell command"

For something like a HTTP request:

metric: 
  type: "http"
  http: 
    method: "GET"
    endpoint: "https://0.0.0.0:5000/metrics"

This would also apply to evaluations, so a more general configuration option to specify a series of different data transfer options would be useful.

Add getting started guide

This would be useful for introducing the framework.

Add unit testing of custom logic interaction functions

Deploy release images to Docker Hub

If a new release is built on GitHub, it should result in a new image being pushed to Docker Hub and that image should be tagged as latest.

Add unit testing of models and serialisation

Add start time for scaler

Allow setting up a timing for the scaler to start, for example could provide the time:

0001-01-01 00:00:15 +0000 UTC

This would cause the scaler to start only after the next round 15 seconds. This would be set up with a default of

0001-01-01 00:00:00 +0000 UTC

Which would by default just start at the next full minute/hour/second etc.

Handle no deployments being managed

At the minute if there are no deployments, the /evaluate.py script still seems to be being called, with no metric JSON, causing it to error out. If there are no deployments, the evaluation should be skipped.

Retrieve namespace from env vars and use when selecting deployments/pods

Depends on jthomperoo/custom-pod-autoscaler-operator#14

Support data transfer using HTTP, allowing passing data either as a query param or request body

Relies on #75
For something like a HTTP request:

metric: 
  type: "http"
  http: 
    method: "GET"
    headers: 
      - key: value
      - key: value
    url: "https://0.0.0.0:5000/metrics"
    parameterMethod: query

Metric gathering and hooks should be able to determine if it is a dry run

The dryRun field should be provided to the hooks and metric gathering to allow them to determine if it is a dry run.

Cooldown feature

Thrashing is when a deployment is scaled up and down repeatedly in a short period of time, caused by being right on the threshold of an evaluation. For example, if the number of pods in a deployment rapidly changes between 2 and 3 because of small changes in the metrics as it is directly on a boundary/threshold.
The Cooldown feature would allow setting a delay or a cool down period on when to scale, avoiding rapid changes in number of pods for minor changes in the metric being evaluated. For example, a cooldown could be set to not scale again if a deployment has been scaled in the past 5 minutes.

/bin/bash
/usr/local/bin/python3

Allows for greater flexibility and not tying to a requirement for /bin/sh

Support scaling ReplicaSets, ReplicationControllers and StatefulSets

Horizontal Pod Autoscaler supports this, so the Custom Pod Autoscaler should too.

From https://kubernetes.io/docs/tasks/run-application/horizontal-pod-autoscale/

The Horizontal Pod Autoscaler automatically scales the number of pods in a replication controller, deployment, replica set or stateful set...

Add CI

Expose evaluations through an API endpoint

Evaluations should be retrievable through an API endpoint:
GET /evaluations
Returning:

[
  {
    "deployment": "example-deployment",
    "evaluation": {
      "target_replicas": 2
    }
  }
]

This evaluation should be calculated at request time.

jthomperoo / custom-pod-autoscaler Goto Github PK

custom-pod-autoscaler's People

Contributors

Stargazers

Watchers

Forkers

custom-pod-autoscaler's Issues

Recommend Projects

Recommend Topics

Recommend Org