Coder Social home page Coder Social logo

pose-estimation-starter's Introduction

Human Pose Estimation Starter

A PyTorch-based starter kit for 2D human pose estimation projects, designed for researchers working on optimizing human pose estimation networks using knowledge distillation and neural architecture search. It provides an interface for training, inference, and evaluation, as well as the dataloaders for common human pose estimation datasets.

screenshot

This repository is a fork of PyTorch-Pose, and also uses code from several other repositories as listed in the Acknowledgments section.

Table of Contents

Getting Started

These instructions will help you set up the project on your local machine for development and testing purposes.

Features

  • Multi-thread data loading
  • Multi-GPU training
  • Logger
  • Training/testing results visualization

See Getting Started for notes on how to use the project for training and evaluation.

Prerequisites

  • PyTorch 0.4.1 or higher

Note that the original code was developed with Python2 and has not been tested with Python3 yet. If you would like to contribute to the project and update the code to Python3 and a newer version of PyTorch, please feel free to submit a pull request.

Installation

  1. Clone the repository:
git clone --recursive https://github.com/saifkhichi96/human-pose-estimation-starter.git
  1. Install the required Python packages:
pip install -r requirements.txt
  1. Create a symbolic link to the images directory of the MPII dataset:
    ln -s PATH_TO_MPII_IMAGES_DIR data/mpii/images
    
    For training/testing on COCO, please refer to COCO Readme.
  2. Download annotation file:

Usage

  1. Download and prepare the dataset following the instructions in the data/ folder.
  2. Train a human pose estimation model by running the training script:
python src/train.py --config configs/config.yaml
  1. Evaluate the model on the test dataset:
python src/evaluate.py --config configs/config.yaml

Structure

  • data/: A folder containing a small dataset for testing purposes and instructions for obtaining larger datasets.
  • notebooks/: A folder containing Jupyter notebooks to demonstrate the implementation.
  • src/: A folder containing the main source code for human pose estimation.
  • models/: A folder with code for various pose estimation models (ViTPose, TokenPose, etc.).
  • utils/: A folder with utility functions for data loading, preprocessing, and postprocessing.
  • configs/: A folder containing configuration files for different models and training settings.
  • requirements.txt: A list of required Python packages.

Supported Models

The following models are supported:

  • Alejandro Newell, Kaiyu Yang, and Jia Deng, Stacked Hourglass Networks for Human Pose Estimation, arXiv:1603.06937, 2016. (GitHub)
  • Xiao et al., Simple Baselines for Human Pose Estimation and Tracking, arxiv:1804.06208, ECCV 2018 (GitHub)

For more information on the models, please refer to the models documentation.

Supported Datasets

Dataloaders for the following datasets are provided:

For more information on the datasets, please refer to the datasets documentation.

Roadmap

  • Convert the code to Python3 and PyTorch 2.0
  • Use PyTorch Lightning to reduce boilerplate code
  • Add support for more datasets
  • Add code to visualize data samples
  • Add support for more models
  • Improve the documentation
    • Create the models documentation
    • Create the datasets documentation
    • Restructure the documentation to make it easier to navigate

Contributing

Please read CONTRIBUTING.md for details on our code of conduct and the process for submitting pull requests.

Acknowledgments

pose-estimation-starter's People

Contributors

bearpaw avatar madhav-bansal avatar pranavbudhwant avatar saifkhichi96 avatar tete1030 avatar weigq avatar xiaoyong avatar xmyqsh avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.