Coder Social home page Coder Social logo

aruto-code / realtimetranslator Goto Github PK

View Code? Open in Web Editor NEW

This project forked from amaanamalani/realtimetranslator

0.0 0.0 0.0 20.82 MB

Real Time Sign Language Translator to Speech. This is the capstone project I worked on in my final year of BTech (Data Science) degree.

Python 0.11% Jupyter Notebook 99.89%

realtimetranslator's Introduction

realtimetranslator

Real Time Sign Language Translator to Speech. This is the capstone project I worked on in my final year of BTech (Data Science) degree. Dataset: Key Point data extracted using MediaPipe Training data: 25 sign language actions, 20 sequences per action Key Points: Face, Pose, Left Hand, Right Hand Key Points Face Key Points = 468 x 3(x,y,z) = 1404 Key Points Pose Key Points = 33 x 4(x,y,z, visibility) = 132 Key Points Right Hand Key Points = 21 x 3(x,y,z) = 63 Key Points Left Hand Key Points = 21 x 3(x,y,z) = 63 Key Points Total Key Points: 1662 per frame 20 Sequences, 30 frames each - 500 frames in total Sequences Flattening: Data sequences are flattened and mapped to labels for machine learning model training. Labeling: Labels assigned based on the corresponding sign language gesture, facilitating supervised learning.

  1. Data Structure: Organized into 25 action categories, each containing 20 sequences with 30 frames each.
  2. Data Preprocessing: Flattened frames and mapped labels for training.
  3. Machine Learning Models: Two models utilized - LSTM and Simple RNN.
  4. Real-Time Translation: Application captures video from a webcam, processes it with MediaPipe and the trained model to generate real-time predictions.
  5. Visualization: Recognized actions are displayed, and a sentence of recognized signs is maintained.
  6. Speech Synthesis: pyttsx3 library used for text-to-speech synthesis.
  7. User Interface: Streamlit employed to create an intuitive, user-friendly virtual environment.

realtimetranslator's People

Contributors

amaanamalani avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.