Coder Social home page Coder Social logo

dataset_loaders's Introduction

This repository contains a framework to load the most commonly used datasets for image and video semantic segmentation. The framework can perform some on-the-fly preprocessing/data augmentation, as well as run on multiple threads (if enabled) to speed up the I/O operations.

NEWS: You might be interested in checking out Main loop TF, a python main loop that integrates the Dataset loaders with Tensorflow!

Attribution

If you use this code, please cite:

  • [1] Francesco Visin, Adriana Romero - Dataset loaders: a python library to load and preprocess datasets (BibTeX)

How to install it:

  1. Clone the repository with --recursive in some path, e.g. to your $HOME:

    git clone --recursive https://github.com/fvisin/dataset_loaders.git "$HOME/dataset_loaders"
  2. Install the package:

    pip install [--user] -e <dataset_loaders path>
  3. The framework assumes that the datasets are stored in some shared paths, accessible by everyone, and should be copied locally on the machines that run the experiments. The framework automatically takes care for you to copy the datasets from the shared paths to a local path.

    Create a configuration file with these paths in /dataset_loaders/dataset_loaders/config.ini (see the config.ini.example in the same directory for guidance).

    Note: if you want to disable the copy mechanism, just specify the same path for the local and the shared path:

    [general]
    datasets_local_path = /a/local/path
    [camvid]
    shared_path = /a/local/path/camvid
    [cityscapes]
    shared_path = /a/local/path/cityscapes/
    (and so on...)
  4. To use the MS COCO dataset, you also need to do the following:

    cd dataset_loaders/images/coco/PythonAPI
    make all
  5. You will need to install SimpleITK and openCV if you intend to use the warp_spline or the optical flow data augmentations respectively.


Notes

  • The code is provided as is, please expect minimal-to-none support on it.
  • This framework is provided for research purposes only. Although we tried our best to test it, the code might be bugged or unstable. Use it at your own risk!
  • The framework currently supports image or video based datasets. It could be easily extended to support other kinds of data (e.g., text corpora), but there is no plan on our side to work on that at the moment.
  • Feel free to contribute to the code with a PR if you find bugs, want to improve the existing code or add support for other datasets.



THE SOFTWARE IS PROVIDED "AS IS" AND THE AUTHOR DISCLAIMS ALL WARRANTIES WITH REGARD TO THIS SOFTWARE INCLUDING ALL IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS. IN NO EVENT SHALL THE AUTHOR BE LIABLE FOR ANY SPECIAL, DIRECT, INDIRECT, OR CONSEQUENTIAL DAMAGES OR ANY DAMAGES WHATSOEVER RESULTING FROM LOSS OF USE, DATA OR PROFITS, WHETHER IN AN ACTION OF CONTRACT, NEGLIGENCE OR OTHER TORTIOUS ACTION, ARISING OUT OF OR IN CONNECTION WITH THE USE OR PERFORMANCE OF THIS SOFTWARE.

dataset_loaders's People

Contributors

adri-romsor avatar carriepl avatar david-vazquez avatar erakra avatar etdej avatar fvisin avatar marcociccone avatar michaldrozdzal avatar simjeg avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.