Coder Social home page Coder Social logo

tomzhang / adlik Goto Github PK

View Code? Open in Web Editor NEW

This project forked from adlik/adlik

0.0 1.0 0.0 4.64 MB

Adlik: Toolkit for Accelerating Deep Learning Inference

License: Apache License 2.0

Python 16.47% C++ 71.42% Shell 0.28% C 3.28% Dockerfile 0.39% Starlark 8.18%

adlik's Introduction

Adlik

Build Status Bors enabled

Adlik is an end-to-end optimizing framework for deep learning models. The goal of Adlik is to accelerate deep learning inference process both on cloud and embedded environment.

Adlik consists of two sub projects: Model compiler and Serving platform.

Model compiler supports several optimizing technologies like pruning, quantization and structural compression, which can be easily used for models developed with TensorFlow, Keras, PyTorch, etc.

Serving platform provides deep learning models with optimized runtime based on the deployment environment. Put simply, based on a deep learning model, the users of Adlik can optimize it with model compiler and then deploy it to a certain platform with Adlik serving platform.

Adlik schematic diagram

With Adlik framework, different deep learning models can be deployed to different platforms with high performance in a much flexible and easy way.

Adlik: Model Compiler

Model Compiler schematic diagram

  1. Support optimization for models from different kinds of deep learning architecture, eg. TensorFlow/Caffe/PyTorch.
  2. Support compiling models as different formats, OpenVINO IR/ONNX/TensorRT for different runtime, eg. CPU/GPU/FPGA.
  3. Simplified interfaces for the workflow.

Adlik: Serving Engine

Serving Engine schematic diagram

  1. Model uploading & upgrading, model inference & monitoring.
  2. Unified inference interfaces for different models.
  3. Management and scheduling for a solution with multiple models in various runtime.
  4. Automatic selection of inference runtime.
  5. Ability to add customized runtime.

Build

This guide is for building Adlik on Ubuntu systems.

First, install Git and Bazel.

Then, clone Adlik and change the working directory into the source directory:

git clone https://github.com/ZTE/Adlik.git
cd Adlik

Build clients

  1. Install the following packages:

    • python3-setuptools
    • python3-wheel
  2. Build clients:

    bazel build //adlik_serving/clients/python:build_pip_package -c opt --incompatible_no_support_tools_in_action_inputs=false
  3. Build pip package:

    mkdir /tmp/pip-packages && bazel-bin/adlik_serving/clients/python/build_pip_package /tmp/pip-packages

Build serving

First, install the following packages:

  • automake
  • libtool
  • make

Build serving with OpenVINO runtime

  1. Install intel-openvino-ie-rt-core package from OpenVINO.

  2. Assume the installation path of OpenVINO is /opt/intel/openvino_VERSION, run the following command:

    export INTEL_CVSDK_DIR=/opt/intel/openvino_VERSION
    export InferenceEngine_DIR=$INTEL_CVSDK_DIR/deployment_tools/inference_engine/share
    bazel build //adlik_serving \
        --config=openvino \
        -c opt \
        --incompatible_no_support_tools_in_action_inputs=false \
        --incompatible_disable_nocopts=false

Build serving with TensorFlow CPU runtime

Run the following command:

bazel build //adlik_serving \
    --config=tensorflow-cpu \
    -c opt \
    --incompatible_no_support_tools_in_action_inputs=false \
    --incompatible_disable_nocopts=false

Build serving with TensorFlow GPU runtime

Assume builing with CUDA version 10.0.

  1. Install the following packages from here and here:

    • cuda-cublas-dev-10-0
    • cuda-cufft-dev-10-0
    • cuda-cupti-10-0
    • cuda-curand-dev-10-0
    • cuda-cusolver-dev-10-0
    • cuda-cusparse-dev-10-0
    • libcudnn7=*+cuda10.0
    • libcudnn7-dev=*+cuda10.0
  2. Run the following command:

    env TF_CUDA_VERSION=10.0 \
        bazel build //adlik_serving \
            --config=tensorflow-gpu \
            -c opt \
            --incompatible_no_support_tools_in_action_inputs=false \
            --incompatible_disable_nocopts=false \
            --incompatible_use_specific_tool_files=false

Build serving with TensorRT runtime

Assume building with CUDA version 10.0.

  1. Install the following packages from here and here:

    • cuda-cublas-10-0
    • cuda-cufft-10-0
    • cuda-cupti-10-0
    • cuda-curand-10-0
    • cuda-cusolver-10-0
    • cuda-cusparse-10-0
    • cuda-nvml-dev-10-0
    • cuda-nvrtc-10-0
    • libcudnn7=*+cuda10.0
    • libcudnn7-dev=*+cuda10.0
    • libnvinfer7=*+cuda10.0
    • libnvinfer-dev=*+cuda10.0
    • libnvonnxparsers7=*+cuda10.0
    • libnvonnxparsers-dev=*+cuda10.0
  2. Run the following command:

     env TF_CUDA_VERSION=10.0 \
         bazel build //adlik_serving \
             --config=tensorrt \
             -c opt \
             --action_env=LIBRARY_PATH=/usr/local/cuda-10.0/lib64/stubs \
             --incompatible_no_support_tools_in_action_inputs=false \
             --incompatible_disable_nocopts=false \
             --incompatible_use_specific_tool_files=false

Build in Docker

The ci/docker/build.sh file can be used to build a Docker images that contains all the requirements for building Adlik. You can build Adlik with the Docker image.

adlik's People

Contributors

amadeus-zte avatar austinzh avatar bors[bot] avatar efanzh avatar hanbt avatar ibrahimhaddad avatar itu-aichallenge-zte avatar shiyingjin avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.