Coder Social home page Coder Social logo

traas-stack / kapacity Goto Github PK

View Code? Open in Web Editor NEW
162.0 11.0 14.0 797 KB

An open cloud native capacity solution which helps you achieve ultimate resource utilization in an intelligent and risk-free way.

Home Page: https://kapacity.netlify.app

License: Apache License 2.0

Dockerfile 0.34% Makefile 1.56% Go 76.22% Python 21.88%
aiops autoscaling capacity cloud-native finops hpa kubernetes monitoring risk-mitigation time-series-forecasting

kapacity's People

Contributors

archerny avatar dayko2019 avatar zqzten avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

kapacity's Issues

No nginx_ingress_controller_requests_rate metric found in prometheus

What happened?

image
image
image
image
image

I followed the tutorial and found that the training was incorrect and promethus did not have the nginx_ingress_controller_requests_rate indicator.

What did you expect to happen?

success to train the model

How can we reproduce it (as minimally and precisely as possible)?

follow the tutorial

Anything else we need to know?

No response

Kapacity version

Kubernetes version

$ kubectl version
# paste output here

in which time. the release version cloud get

What would you like to be added?

according the wechat article (point this). It says in June. the v0.2 will be publish. but I find it not get. what I more concerned is , in which time. I could get release version in order to use it in prod env.
wait for your reply. thanks.

Why is this needed?

want to know release version in which time could get

Introduce time series forecasting model training script

What would you like to be added?

Introduce the Python script to train the time series forecasting model for predictive scaling.

Why is this needed?

The time series forecasting model need to be trained beforehand so that the acutal prediction job can utilize it.

Support controller reconcile concurrency configuration

What would you like to be added?

Add a reconcile-concurrency flag to manager to configure the reconcile concurrency of every controller.

Why is this needed?

The reconcile concurrency defaults to 1 which does not meet the efficiency requirement in production.

Reasonable behavior on selection of portraits with the same priority

What would you like to be added?

Currently, the behavior of the portrait selection is undefined when we have multiple portraits with the same priority. We need to introduce a reasonable behavior to it, that is to select the one which desires most replicas.

Why is this needed?

To provide a more reasonable and predictable behavior.

fetch_resource_metric_history failed

What happened?

image
I followed the tutorial to create a ihpa : https://kapacity.netlify.app/zh-cn/docs/getting-started/quick-start/ihpa/predictive-scaling/#%E5%88%9B%E5%BB%BA%E9%85%8D%E7%BD%AE%E4%BA%86%E5%8A%A8%E6%80%81%E9%A2%84%E6%B5%8B%E5%BC%8F%E7%94%BB%E5%83%8F%E6%BA%90%E7%9A%84-ihpa, and I failed.

What did you expect to happen?

success to run the job

How can we reproduce it (as minimally and precisely as possible)?

https://kapacity.netlify.app/zh-cn/docs/getting-started/quick-start/ihpa/predictive-scaling/#%E5%88%9B%E5%BB%BA%E9%85%8D%E7%BD%AE%E4%BA%86%E5%8A%A8%E6%80%81%E9%A2%84%E6%B5%8B%E5%BC%8F%E7%94%BB%E5%83%8F%E6%BA%90%E7%9A%84-ihpa

Anything else we need to know?

No response

Kapacity version

Kubernetes version

$ kubectl version
# paste output here

the burst algorithm

What would you like to be added?

When is the code implementation of the burst algorithm expected to be released?

// TODO
BurstPortraitType PortraitType = "Burst"

Why is this needed?

“Algorithm which detects abnormal traffic or potential capacity risks, and suggests a safe replica count proactively.”
I originally thought the burst algorithm mentioned in the project introduction was already supported.

Expose the unified metric provider interface as a service

What would you like to be added?

Expose the unified metric provider interface as a (probably gRPC) service.

Why is this needed?

This enables external algorithm jobs to utilize the internal unified metric provider interface so that they don't need to impl metrics query themselves.

request help: when could I get dashboard project?

What would you like to be added?

Hello, I learned that kapacity has a dashboard project which will introducted,I want to know when I could get it.

Why is this needed?

use dashboard as a control plane

Support automatic readiness gate injection

What would you like to be added?

Introduce an admission webhook to automatically inject the kapacity.traas.io/online readiness gate to pods which needed.

Why is this needed?

To support the readiness gate pod traffic controller.

Introduce replicas prediction job

What would you like to be added?

Introduce the Python program which gathers metrics, does replicas prediction, and writes out results. This program can be run as Kubernetes CronJob. The model used by the program should be trained beforehand so that it would not rely on GPUs.

Why is this needed?

It is needed to do the real prediction work for the predictive portrait.

Support arbitrary metrics query for promethues metric provider

What would you like to be added?

Support arbitrary metrics query for promethues metric provider.

Why is this needed?

Currently the Promethues metric provider only supports resource metrics query which limits the metrics that algorithms can use. We need a machinism to support arbitrary PQL query to make most of this provider.

as a user, if I can use http api control kapacity

What would you like to be added?

Hello,I want to know, if I can use api(such as http invoke),to control the kapacity behavior . If it is yes, where could I get the api?

Why is this needed?

when I could use http method to control kapacity behavior, I could make our own control portal page. and use http api to publish config into k8s cluster.

[FAQ] More details about `recommendation of Pod resource specifications (CPU, memory, etc.) intelligent algorithms`

Thank you very much for open-sourcing Kapacity. Sharing experiences from the industry is very important for this field. My main research direction includes container workload resource recommendation, so even if the resource recommendation algorithm of Kapacity will be open-sourced in the future, I still want to learn about your ideas as soon as possible. Could you please provide a detailed introduction of the models and methods you use for resource recommendation?

Ref: https://kapacity.netlify.app/docs/roadmap/

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.