The bootstrap_paradox from praveen5959

bootstrap_paradox's Introduction

Problem : To filter out the background during the live stream of educational videos

Solution: Given the parameters and constraints presented by the problem, in terms of quality of the output, speed (in terms of fps) of the output, and cost of the resources required (CPU/GPU cost) , we chose to implement a version DeepLabV3 which is based on the encoder decoder architecture. The video if fed to the model frame by frame and the primary person (educator) is identified and segmented. The background is subtracted in this way from every frame and the output frames are compressed together to form the video which would not have any background objects or people. Implementing DeepLabV3 also ensures that the latency caused by this processing is reduced to a fraction of a second.

Model Architecture

Instructions to run:

Run sudo pip3 install -r requirements.txt to install all packages.
Run python inference_video.py for video files.
Run python inference_webcam.py for live streaming.

Recommend Projects

praveen5959 / bootstrap_paradox Goto Github PK

bootstrap_paradox's Introduction

bootstrap_paradox's People

Contributors

Watchers

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent