Rohan Dubey's Projects
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
DDPG algorithm for PID tuning
An AI solution based on YOLOv5 model to detect those who are wearing mask or not.
In this project we are presenting the real time facial expression recognition of seven most basic human expressions: ANGER, DISGUST, FEAR, HAPPY, NEUTRAL SAD, SURPRISE.
Feature Detection is very important understanding an image and marking out the point/region in an image responsible for bringing out the good features of an image. This project contains different methods/approaches how feature detection can be implemented.
Codes regarding the paper: Handwritten Image Detection using DCGAN with SIFT and ORB Optical Features
Homography is technique involving mapping one image to another based on their corresponding related good features . In this project, we can find out the given image from the video feed provided real-time. This project involves feature detection using SIFT algorithm and images are matched by FLANN and K- Nearest Neighbor Technique
Profile README + Site made with 11ty and TailwindCSS
A Program that creates a bounding box that enables you to construct a predictable pipeline of high-quality training data that will teach your ML/DL-powered computer vision system to find and identify objects in image and video data.
Numerical Analysis Techniques
A project on Optical Image Tracking covering Optical Flow, Dense Optical Flow, MeanShift Technique, CamShift Technique, Single Object Tracking and Multi Object Tracking.
Introduction of OpenCV and pillow packages to make a program like paint.
A python program that detects the pulse rate of an individual through the webcam. This is a contact-less method for pulse detection and can work on any camera/environment
Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow
YOLOv5 in PyTorch > ONNX > CoreML > TFLite