Coder Social home page Coder Social logo

prathamshankwalker / vakta_voicebot Goto Github PK

View Code? Open in Web Editor NEW

This project forked from krrish-v/vakta

0.0 0.0 0.0 15.86 MB

AI-powered voice learning platform designed to assist individuals with visual impairments.

License: Apache License 2.0

Python 62.75% Jupyter Notebook 37.25%

vakta_voicebot's Introduction

Vakta Voicebot - Empowering Visually Impaired Community with AI

Image

Introduction

Introducing "Vakta 0.0.1" - an innovative and cutting-edge GUI based project developed for the Hackathon. Vakta is a state-of-the-art, fully voice-activated, and AI-enabled assistant, thoughtfully designed with the primary objective of empowering the visually impaired community. with adaptive learning technology. With an unwavering commitment to providing an inclusive and seamless user experience, Vakta will revolutionize the way blind individuals interact with technology.

"Vakta" is a Sanskrit word for "speaker." In our project, Vakta embodies this essence by becoming a compassionate and intelligent mentor for the visually impaired. Just as ancient speakers conveyed wisdom, Vakta aims to create a future of inclusivity and empowerment, bridging the gap between the visually impaired and knowledge.

Features

  1. Voice-activated Information (General Mode):

    • Engage in voice-based conversations with OpenAI's LLM, powered by Eleven Lab's voice model and Langchain, utilizing memory to retain context throughout the interaction.
    • Allows the individual to ask any question and get the answer through voice. Questions like “ What is the Capital of France? ” or “ Define Osmosis” are some examples.
    • To conclude, this feature is simply our favorite ChatGPT in voice form.
  2. Listen to your favorite book (Book Mode):

    • Download the pdf of the requested book and play it like an audiobook.
    • Control to stop and continue listening.
    • Powered by various NLP algorithms and search techniques using Google Books API.
  3. Know the weather around you (Weather Mode):

    • Ask the AI about the weather of a particular city.
    • AI responds with accurate temperature, humidity, and wind speed. For example, "What is the weather of Delhi?"
  4. Stay Updated with the latest news (News Mode):

    • Request for news headlines of a particular category or in general.
    • AI responds with the latest headlines from categories like Sports, Technology, Business, etc.
  5. Search and Listen to Music or Podcasts (YouTube Mode):

    • Empowers the user to search any song or video from YouTube.
    • Example: "Simon Sinek: How Great Leaders Inspire Action | TED Talk" or "Believer | Imagine Dragons."
  6. Send a message to your contacts (Messaging Mode):

    • Makes the user to send message easily to their contacts
    • Example: "Send a message to 'Contact Name' "

Voice Commands:

  • General Mode: The AI operates in the general chat mode (Feature 1) if no specific command is used.
  • Book Mode: Enable book mode by saying "Download book <book_name>."
  • Weather Mode: Enable weather mode by saying "weather <your_city_name>."
  • News Mode: Enable news mode by saying "Search News ."
  • YouTube Mode: Enable YouTube mode by saying "YouTube" and then provide the video title when prompted.
  • Messaging Mode: Enabel this mode by saying "send message" and then provie the message you want to send
  • Exit Program: Say "Exit Program" to close the application.

Tech Stack

  • GUI: KivyMD Framework (Python)

APIs Used

Developer Machine Setup

  1. Clone the repository: git clone https://github.com/krrish-v/vakta.git && cd vakta
  2. Install python if not installed on your system
  1. Create a virtual environment
  • If virtualenv is not installed : pip install virtualenv && virtualenv env
  • Activate virtual environment
    a. In Windows : env/Scripts/activate
    b. In Linux or MacOS: source env/bin/activate
  1. Install dependencies: pip install --upgrade pip -r requirements.txt
  2. pip install https://github.com/yt-dlp/yt-dlp/archive/master.tar.gz
  3. Move to GUI directory and run main.py file

Future Scope

  • Make the app available for Android and iOS devices since Kivy is a cross-platform Python framework.
  • Enhance the features and make it a fully ready platform for voice-based learning.
  • Release the app in the market, possibly on a subscription-based model.

-Currently working only on Linux OS system

Demo

(https://youtu.be/UiNxtdCrQMA)

References

vakta_voicebot's People

Contributors

krrish-v avatar prathamshankwalker avatar puneetmpatil avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.