Coder Social home page Coder Social logo

akshjain83 / awesome-machine-learning-operations Goto Github PK

View Code? Open in Web Editor NEW

This project forked from ethicalml/awesome-production-machine-learning

0.0 2.0 0.0 187 KB

a curated list of awesome resources to deploy, monitor, version and scale your machine learning (ML + DevOps = MLOps)

License: MIT License

awesome-machine-learning-operations's Introduction

Machine Learning Operations (ML + DevOps = MLOps)

What is Machine Learning Operations (MLOps)?

Machine Learning Operations involves the infrastructure required to scale your ML capabilities. You can find an overview of MLOps in Alejandro Saucedo's talk at the 2018 EuroSciPy on Scalable Data Science: The State of MLOps in 2018.

This repository covers two of the core principles of MLOps:
  • Reproducibility: Model & Data Versioning
  • Orchestration: Model Deployment Orchestration

Overview of this repo

  • This repository contains a curated list of awesome resources that will help you kick-start or enhance your machine learning operations
  • Machine Learning Operations involve everything that is required to serve your ML, including deploying, monitoring, scaling, versioning, etc

Main Contributors

Index

  • Model standard formats
  • Model versioning
  • Data storage / standardisation
  • Feature engineering automation
  • Model deployment frameworks
  • Computation load distribution frameworks
  • Function as a service / Lambda frameworks
  • Computational optimisation libraries
  • Commercial data science platforms
  • Commercial ETL platforms

Main Contents

Model Standard Formats for interoperability

  • ONNX - Open Neural Network Exchange Format
  • Neural Network Exchange Format (NNEF) - A standard format to store models across Torch, Caffe, TensorFlow, Theano, Chainer, Caffe2, PyTorch, and MXNet
  • PFA - Created by the same organisation as PMML, the Predicted Format for Analytics is an emerging standard for statistical models and data transformation engines.
  • PMML - The Predictive Model Markup Language standard in XML - (Video)
  • Java PMML API - Java libraries for consuming and producing PMML files containing models from different frameworks, including:

Model & Pipeline Versioning

  • Data Version Control (DVC) - A git fork that allows for version management of models
  • ModelDB - Framework to track all the steps in your ML code to keep track of what version of your model obtained which accuracy, and then visualise it and query it via the UI
  • Pachyderm - Open source distributed processing framework build on Kubernetes focused mainly on dynamic building of production machine learning pipelines - (Video)
  • steppy - Lightweight, Python3 library for fast and reproducible machine learning experimentation. Introduces simple interface that enables clean machine learning pipeline design.
  • Jupyter Notebooks - Web interface python sandbox environments for reproducible development
  • Quilt Data - Versioning, reproducibility and deployment of data and models.
  • H2O Flow - Jupyter notebook-like inteface for H2O to create, save and re-use "flows"
  • ModelChimp - Framework to track and compare all the results and parameters from machine learning models (Video)
  • PredictionIO - An open source Machine Learning Server built on top of a state-of-the-art open source stack for developers and data scientists to create predictive engines for any machine learning task
  • MLflow - Open source platform to manage the ML lifecycle, including experimentation, reproducibility and deployment.
  • Sacred - Tool to help you configure, organize, log and reproduce machine learning experiments.
  • FGLab - Machine learning dashboard, designed to make prototyping experiments easier.
  • Studio.ML - Model management framework which minimizes the overhead involved with scheduling, running, monitoring and managing artifacts of your machine learning experiments.

Data Storage / Standardisation / Privacy

  • EdgeDB - NoSQL interface for Postgres that allows for object interaction to data stored
  • BayesDB - Database that allows for built-in non-parametric Bayesian model discovery and queryingi for data on a database-like interface - (Video)
  • Apache Arrow - In-memory columnar representation of data compatible with Pandas, Hadoop-based systems, etc
  • Apache Parquet - On-disk columnar representation of data compatible with Pandas, Hadoop-based systems, etc
  • Apache Kafka - Distributed streaming platform framework
  • Uber SQL Differencial Privacy - Uber's open source framework that enforces differential privacy for general-purpose SQL queries.
  • ClickHouse - ClickHouse is an open source column oriented database management system supported by Yandex - (Video)

Feature Engineering Automation

  • auto-sklearn - Framework to automate algorithm and hyperparameter tuning for sklearn
  • TPOT - Automation of sklearn pipeline creation (including feature selection, pre-processor, etc)
  • tsfresh - Automatic extraction of relevant features from time series
  • Featuretools - An open source framework for automated feature engineering
  • Colombus - A scalable framework to perform exploratory feature selection implemented in R
  • automl - Automated feature engineering, feature/model selection, hyperparam. optimisation

Model Deployment Frameworks

  • Seldon - Open source platform for deploying and monitoring machine learning models in kubernetes - (Video)
  • Redis-ML - Module available from unstable branch that supports a subset of ML models as Redis data types
  • Model Server for Apache MXNet (MMS) - A model server for Apache MXNet from Amazon Web Services that is able to run MXNet models as well as Gluon models (Amazon's SageMaker runs a custom version of MMS under the hood)
  • Tensorflow Serving - High-performant framework to serve Tensofrlow models via grpc protocol able to handle 100k requests per second per core
  • Clipper - Model server project from Berkeley's Rise Rise Lab which includes a standard RESTful API and supports TensorFlow, Scikit-learn and Caffe models
  • DeepDetect - Machine Learning production server for TensorFlow, XGBoost and Cafe models written in C++ and maintained by Jolibrain
  • MLeap - Standardisation of pipeline and model serialization for Spark, Tensorflow and sklearn
  • OpenScoring - REST web service for scoring PMML models built and maintained by OpenScoring.io
  • NVIDIA TensorRT - Model server created by NVIDIA that runs models in ONNX format, including frameworks such as TensorFlow and MATLAB

Computation load distribution frameworks

  • Hadoop Open Platform-as-a-service (HOPS) - A multi-tenency open source framework with RESTful API for data science on Hadoop which enables for Spark, Tensorflow/Keras, it is Python-first, and provides a lot of features
  • PyWren - Answer the question of the "cloud button" for python function execution. It's a framework that abstracts AWS Lambda to enable data scientists to execute any Pyhton function - (Video)
  • NumPyWren - Scientific computing framework build on top of pywren to enable numpy-like distributed computations
  • BigDL - Deep learning framework on top of Spark/Hadoop to distribute data and computations across a HDFS system
  • Horovod - Uber's distributed training framework for TensorFlow, Keras, and PyTorch
  • Apache Spark MLib - Apache Spark's scalable machine learning library in Java, Scala, Python and R
  • Dask - Distributed parallel processing framework for Pandas and NumPy computations - (Video)

Data Pipeline Frameworks

  • Apache Airflow - Data Pipeline framework built in Python, including scheduler, DAG definition and a UI for visualisation
  • Luigi - Luigi is a Python module that helps you build complex pipelines of batch jobs, handling dependency resolution, workflow management, visualisation, etc
  • Genie - Job orchestration engine to interface and trigger the execution of jobs from Hadoop-based systems
  • Oozie - Workflow scheduler for Hadoop jobs

Function as a Service / Lambda Frameworks

  • OpenFaaS - Serverless functions framework with RESTful API on Kubernetes
  • Fission - (Early Alpha) Serverless functions as a service framework on Kubernetes
  • Hydrosphere ML Lambda - Open source model management cluster for deploying, serving and monitoring machine learning models and ad-hoc algorithms with a FaaS architecture
  • Hydrosphere Mist - Serverless proxy for Apache Spark clusters
  • Apache OpenWhisk - Open source, distributed serverless platform that executes functions in response to events at any scale.

Infrastructure Orchestration Frameworks

  • Kubeflow - A cloud native platform for machine learning based on Google’s internal machine learning pipelines.
  • Polyaxon - A platform for reproducible and scalable machine learning and deep learning on kubernetes. - (Video)

Optimization of Computation

  • Numba - A compiler for Python array and numerical functions

Commercial Data-science Platforms

  • Comet.ml - Machine learning experiment management. Free for open source and students (Video)
  • Skytree 16.0 - End to end machine learning platform (Video)
  • Algorithmia - Cloud platform to build, deploy and serve machine learning models (Video)
  • y-hat - Deployment, updating and monitoring of predictive models in multiple languages (Video)
  • Amazon SageMaker - End-to-end machine learning development and deployment interface where you are able to build notebooks that use EC2 instances as backend, and then can host models exposed on an API
  • Google Cloud Machine Learning Engine - Managed service that enables developers and data scientists to build and bring machine learning models to production.
  • Microsoft Azure Machine Learning service - Build, train, and deploy models from the cloud to the edge.
  • IBM Watson Machine Learning - Create, train, and deploy self-learning models using an automated, collaborative workflow.
  • neptune.ml - community-friendly platform supporting data scientists in creating and sharing machine learning models. Neptune facilitates teamwork, infrastructure management, models comparison and reproducibility.
  • Datmo - Workflow tools for monitoring your deployed models to experiment and optimize models in production.
  • Valohai - Machine orchestration, version control and pipeline management for deep learning.
  • Dataiku - Collaborative data science platform powering both self-service analytics and the operationalization of machine learning models in production.
  • MCenter - MLOps platform automates the deployment, ongoing optimization, and governance of machine learning applications in production.
  • Skafos - Skafos platform bridges the gap between data science, devops and engineering; continuous deployment, automation and monitoring.
  • SKIL - Software distribution designed to help enterprise IT teams manage, deploy, and retrain machine learning models at scale.
  • MLJAR - Platform for rapid prototyping, developing and deploying machine learning models.
  • MissingLink - MissingLink helps data engineers streamline and automate the entire deep learning lifecycle.
  • DataRobot - Automated machine learning platform which enables users to build and deploy machine learning models.
  • RiseML - Machine Learning Platform for Kubernetes: RiseML simplifies running machine learning experiments on bare metal and cloud GPU clusters of any size.
  • Datatron - Machine Learning Model Governance Platform for all your AI models in production for large Enterprises.

Commercial ETL Platforms

awesome-machine-learning-operations's People

Contributors

akshjain83 avatar axsaucedo avatar gidim avatar maxbenchrist avatar navy-xie avatar tuomastik avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.