Coder Social home page Coder Social logo

avert / ai-powered-video-tutorial-generator Goto Github PK

View Code? Open in Web Editor NEW

This project forked from akshitireddy/ai-powered-video-tutorial-generator

0.0 0.0 0.0 12.54 MB

Create AI-Generated Video Tutorials with Character Animation and Slides!

Home Page: https://www.linkedin.com/company/alystria-ai

License: The Unlicense

JavaScript 49.49% Python 49.01% CSS 1.50%

ai-powered-video-tutorial-generator's Introduction

AI-Generated Video Tutorials with Character Animation and Slides! ๐ŸŽฅ๐Ÿค–๐Ÿ“๐ŸŽจ

This AI tool helps you create captivating and informative AI-generated video tutorials on any topic! With a charming character featuring facial animation and informative slides, it can explain any topic with ease. The best part? You have full control over the tutorial's creativity, humor, level of explanation, character appearance, and voice. โค๏ธโœจ

Give it a try for free! ๐Ÿ”ฅ It leverages the powerful capabilities of various tools.

๐Ÿš€ Features

  • ๐Ÿง  The script is generated using Cohere's language model. (You can obtain a trial API key for free!)
  • ๐Ÿ—ฃ๏ธ Seamless integration with Edge TTS for high-quality voiceovers.
  • ๐Ÿ˜„ Engaging facial animation powered by SadTalker.
  • ๐Ÿ–ผ๏ธ Eye-catching and relevant images from Google for slides.
  • ๐ŸŽจ Customizable creativity, humor, explanation level, character appearance, and voice.

โœจ Quick Demo

quick_demo_compressed.mp4

โœจ Full Demo

Sophia_compressed.mp4
Benjamin_compressed.mp4

๐Ÿšจ Requirements

Open up a terminal and go to backend directory

For Windows

python -m venv .venv
.venv\Scripts\activate
pip install -r requirements.txt

For Linux or Mac

python -m venv .venv
source .venv/bin/activate
pip install -r requirements.txt

Also install sadtalker(https://github.com/OpenTalker/SadTalker#%EF%B8%8F-1-installation), the sadtalker directory must be inside backend. Run the sadtalker webui once so that the venv environment is created.

Add your Cohere API key to backend/apikeys.json

๐Ÿ”Œ How to use

Open up a terminal and go to frontend/topic2explanation

npm run dev

Open up another terminal and go to backend

uvicorn main:app

๐ŸŽจ Customizability

To change the character and voice, you need to put your desired character in characters directory. It must be in 640x720 resolution and for the voice you can choose a voice from the voice list which can be seen using this command.

edge-tts --list-voices

Once you find a voice you like add it in the create_audio_image function

character_dict = {'Benjamin': "en-GB-RyanNeural",
                  'Sophia': 'en-IE-EmilyNeural'}

๐Ÿ“ How it works

โœจ It all starts with the creation of a script using the create_script function. ๐Ÿ“œ This function takes various parameters such as topic, level of explanation, target audience age, creativity, and humor. ๐ŸŽญ With these parameters in mind, the script is carefully crafted to explain the chosen topic. To accomplish this, we leverage the power of the Cohere API and Langchain. ๐Ÿค

๐ŸŽ™๏ธ Once the script is ready, we move on to the create_audio_image function. This function splits the script into smaller sentences, which are then used to generate audio dialogues using Microsoft's Edge Text-to-Speech (TTS) service. ๐Ÿ—ฃ๏ธ In parallel, we generate a search query for each sentence using Cohere and Langchain once again. These search queries help us retrieve relevant images from Google, which will be used as slides in the presentation. ๐Ÿ–ผ๏ธ

๐ŸŽฅ With the audio files and character images in place, we proceed to create the videos using the Sadtalker library. ๐ŸŽฌ First, we generate videos for the character animations, and then we transform the still images from Google into slide videos. ๐ŸŽž๏ธ These slide videos will be seamlessly integrated into the final video presentation. To add an element of randomness, we assign a random number to each video and slide pair. Based on this number, we position the slides and talking character either to the left or right, or even use the image as the background. The talking character may appear in the bottom left or right corner of the screen. ๐ŸŽฏ

๐Ÿ“ผ Finally, we save the completed video, combining the slide videos and character animation videos. The resulting video is now ready to be shared! ๐ŸŽ‰ To bring it all together, we rely on FastAPI for the backend, and for the frontend, we utilize Next.js and Tailwind CSS. ๐Ÿš€

โค๏ธ Thanks

If you found this interesting check out Alystria AI for more fun projects

https://www.linkedin.com/company/alystria-ai

ai-powered-video-tutorial-generator's People

Contributors

akshitireddy avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.