Coder Social home page Coder Social logo

sharier / object-3d-coordinate Goto Github PK

View Code? Open in Web Editor NEW
5.0 1.0 0.0 3.68 MB

Generate 3D coordinate of object from Kitchen & Dinner Scenario Video by using Convolution Neural Network Outputs and Motion Sensor Data.

Python 85.05% Jupyter Notebook 14.95%

object-3d-coordinate's Introduction

Motivation:

The ability of robotic agents performing everyday activity such as loading a dishwasher and setting a table autonomously in a wide range of context is challenging. Mastering such everyday activities by robotic agents is an important step for robots to become the competent (co-)workers or assistants. Everyday Activity Science and Engineering(EASE) is a research program that investigates the everyday activites of humans and uses this informations for modelling autonomous robotic agents.

Objectives:

The main goal of the project was to determine the 3D coordinate of each specific objects that are being moved during the recording session. This project is a small part of EASE research program in which two cameras where used to detect and track objects above the two table setting scenerio named as Kitchen table and Dinner table. The objects are moved by a participant from kitchen to dinner and vice versa. The participant performs the recording session wearing a motion sensor suit which records the coordinate of the body with respect to the origin of the motion capture system.

GOALS:

  1. Detect the objects using YOLO-V3 (You Only Look Once).
  2. Track the Detected objects using OpenCV Multi-tracker.
  3. Camera Calibration.
  4. Combine Motion Capture Data with obtained coordinates from computer vision technique to generate a 3D coordinate of each object.

Requirements:

  1. OpenCV 4.0.0
  2. Darknet YOLO. Darknet_YOLO-V3
  3. Pandas, Numpy, Imutils
  4. Linux OS (Ubuntu 16.04 LTS)
  5. Python 3.6.6

Procedures To use the script:

There are six arguments required for using the script YOLO_MOCAP_TRACKING.py:

  1. kitchenvdo: Path to video file of the kitchen scene.
  2. dinnervdo: Path to video file of the dinner scene.
  3. timestampkitchen: Path to time stamp file of the kitchen scene.
  4. timestampdinner: Path to time stamp file of the dinner scene.
  5. optitrackJSON: Path to mocap .JSON file of the motion capture data.
  6. optitrackCSV: Path to csv file of the motion capture CSV.

Detection of Object (Bowl) On Dinner Table Scenerio

Additional Information:

Procedures to use Trained YOLO Network for Detection:

A configuration file config.txt is used to configure the YOLO network for forward propagation in the main python script.

The following parameters are needed for setting the YOLO network:

  1. ClassFile: The path of the file with extension ”.names” is the file with the class names of the trained YOLO network
  2. ModelConfiguration: The path of file with extension ”.cfg” is the configuration file of the YOLO network.
  3. ModelWeights: The path of file with extension ”.weights” is the trained weights of YOLO.

object-3d-coordinate's People

Contributors

sharier avatar

Stargazers

 avatar  avatar  avatar Awsaf Rahman avatar MD Atif Bin Karim avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.