Coder Social home page Coder Social logo

jaredcasper / dali Goto Github PK

View Code? Open in Web Editor NEW

This project forked from nvidia/dali

1.0 1.0 0.0 25.17 MB

A library containing both highly optimized building blocks and an execution engine for data pre-processing in deep learning applications

Home Page: https://docs.nvidia.com/deeplearning/sdk/dali-developer-guide/index.html

License: Apache License 2.0

CMake 6.45% Dockerfile 0.14% C 0.42% C++ 75.17% Python 7.84% Cuda 8.70% Shell 1.27%

dali's Introduction

License

NVIDIA DALI v0.1

Today’s deep learning applications include complex, multi-stage pre-processing data pipelines that include compute-intensive steps mainly carried out on the CPU. For instance, steps such as load data from disk, decode, crop, random resize, color and spatial augmentations and format conversions are carried out on the CPUs, limiting the performance and scalability of training and inference tasks. In addition, the deep learning frameworks today have multiple data pre-processing implementations, resulting in challenges such as portability of training and inference workflows and code maintainability.

NVIDIA Data Loading Library (DALI) is a collection of highly optimized building blocks and an execution engine to accelerate input data pre-processing for deep learning applications. DALI provides both performance and flexibility of accelerating different data pipelines, as a single library, that can be easily integrated into different deep learning training and inference applications.

Key highlights of DALI include:

  • Full data pipeline accelerated from reading from disk to getting ready for training/inference
  • Flexibility through configurable graphs and custom operators
  • Support for image classification and segmentation workloads
  • Ease of integration through direct framework plugins and open source bindings
  • Portable training workflows with multiple input formats - JPEG, LMDB, RecordIO, TFRecord
  • Extensible for user specific needs through open source license

Installing prebuilt DALI packages

Prerequisities

  • Linux x64
  • NVIDIA Driver_ supporting CUDA 9.0 or later (i.e., 384.xx or later driver releases)
  • One or more of the following Deep Learning frameworks:
    • MXNet 1.3 beta_ mxnet-cu90==1.3.0b20180612 or later
    • pyTorch 0.4_
    • TensorFlow 1.7_ or later

Installation

pip install --extra-index-url https://developer.download.nvidia.com/compute/redist nvidia-dali

Compiling DALI from source

Prerequisities

  • Linux x64
  • NVIDIA CUDA 9.0_ (CUDA 8.0 compatibility is provided unofficially)
  • nvJPEG library_ (This can be unofficially disabled; see below)
  • protobuf_ version 2 or later (version 3 or later is required for TensorFlow TFRecord file format support)
  • CMake 3.5_ or later
  • libjpeg-turbo 1.5.x_ or later (This can be unofficially disabled; see below)
  • OpenCV 3_ or later (OpenCV 2.x compatibility is provided unofficially)
  • (Optional) liblmdb 0.9.x_ or later
  • One or more of the following Deep Learning frameworks:
    • MXNet 1.3 beta_ mxnet-cu90==1.3.0b20180612 or later
    • pyTorch 0.4_
    • TensorFlow 1.7_ or later

Note

TensorFlow installation is required to build the TensorFlow plugin for DALI

Note

Items marked "unofficial" are community contributions that are believed to work but not officially tested or maintained by NVIDIA.

Get the DALI source

git clone --recursive https://github.com/NVIDIA/dali
cd dali

Make the build directory

mkdir build
cd build

Compile DALI

To build DALI without LMDB support:

cmake ..
make -j"$(nproc)" install

To build DALI with LMDB support:

cmake -DBUILD_LMDB=ON ..
make -j"$(nproc)" install

Optional CMake build parameters:

  • BUILD_PYTHON - build Python bindings (default: ON)
  • BUILD_TEST - include building test suite (default: ON)
  • BUILD_BENCHMARK - include building benchmarks (default: ON)
  • BUILD_LMDB - build with support for LMDB (default: OFF)
  • BUILD_NVTX - build with NVTX profiling enabled (default: OFF)
  • BUILD_TENSORFLOW - build TensorFlow plugin (default: OFF)
  • (Unofficial) BUILD_JPEG_TURBO - build with libjpeg-turbo (default: ON)
  • (Unofficial) BUILD_NVJPEG - build with nvJPEG (default: ON)

Install Python bindings

pip install dali/python

Getting started

docs/examples_ directory contains a series of examples (in the form of Jupyter notebooks) of different features of DALI. It also contains examples of how to use DALI to interface with DL frameworks.

Documentation for the latest stable release is available here. Nightly version of the documentation that stays in sync with the master branch is available here.

Contributing to DALI

Contributions to DALI are more than welcome. To make the pull request process smooth, please follow these guidelines.

Contributors

DALI was built with major contributions from Trevor Gale, Przemek Tredak, Simon Layton, Andrei Ivanov, Serge Panev

dali's People

Contributors

januszl avatar ptrendx avatar cliffwoolley avatar kh4l avatar slayton58 avatar winggan avatar zsef123 avatar romerojosh avatar moconnor725 avatar drivanov avatar maxcuda avatar

Stargazers

Ashley Turing avatar

Watchers

Jared Casper avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.