Coder Social home page Coder Social logo

alm-lab / pace Goto Github PK

View Code? Open in Web Editor NEW
12.0 1.0 0.0 9.28 MB

PACE (Podcast AI for Chapters and Episodes) is a semantic search engine that helps you find the information you need, inter- and intra-podcasts (Project for the AssemblyAI Winter 2022 Hackathon).

Python 55.90% CSS 10.39% HTML 33.71%
audio-processing nlp-machine-learning podcast podcast-player podcast-search-engine semantic-search semantic-search-engine webapp assemblyai

pace's Introduction

๐ŸŽง PACE: Podcast AI for Chapters and Episodes

logo

DEMO: Try it out for limited time on our demo page.

๐Ÿ“ Table of Contents


๐ŸŽฏ Our IDEA

How many times have you been listening to a podcast and you wanted to go back to a specific part, but you don't remember the exact time? We have all been there. Now we got you covered!

We present PACE, Podcast AI for Chapters and Episodes, which allows you to search for podcasts at your own pace.
PACE is a semantic search engine that helps you find the information you need, in a fast and easy way.

To get more superpowers and let the magic begin, switch the checkbox on. PACE will use the power of AI to search inside the podcasts and give you back the exact part you're looking for.

In addition to providing fast and easy access to information within podcasts, PACE also uses advanced text-to-image generative models to automatically create chapter covers for each podcast episode. This makes it easy for listeners to quickly identify and navigate to the specific information they are looking for within an episode.

With PACE, accessing the information you need within podcasts has never been easier.
Get ready to experience the future of podcasting. Everything you need, just a click away!


๐Ÿ”Ž How does it work?

Podcasts usually are long-form audio content. They could contain many interesting topics, but you may be interested in only one or some of them.

PACE got you covered!

  • ๐Ÿ”ˆ Speech Processing: generate chapters from audio tracks
  • ๐Ÿ“ƒ NLP: search for episodes or chapters using natural language
  • ๐ŸŽจ Computer Vision: generate artwork for chapters using text-to-image models
  • ๐Ÿ’ป Web: Flask + Elasticsearch to build a web app

๐Ÿ“บ Demo & Slides

Watch the video

Find the slides of our project here.


โณ Pipeline

This is the pipeline of our project.

We ingest podcast rss feeds, and we enrich the dataframe with episodes and related metadata. We then filter episodes' description using BERT Podcast Classifier.

We encode episodes' description using sentence BERT, and we index them in Elasticsearch.

We then select a bunch of podcasts for intra-podcast search, and we use AssemblyAI to transcribe podcasts and retrieve chapters inside episodes. We index episodes' chapters in elasticsearch, using cohere to embed the chapters.

We finally leverage stable diffusion models to create an image/gif for each of the generated chapter.


๐Ÿ•’ PHASE 1

  1. Ingest podcasts rss
  2. Enrich dataframe with episodes and related metadata
  3. Filter episodes' description using BERT
  4. Encode episodes' description using sentence BERT
  5. Index episodes in elasticsearch
  6. TEST: Query elasticsearch using sentence BERT

END OF PHASE 1


๐Ÿ•• PHASE 2

  1. Select a bunch of podcasts for intra-podcast search (limited by AssemblyAI API)

  2. Index episodes' chapters in elasticsearch

    • Podcast name
    • Chapter description
    • Start & End time
    • Original episode (mp3 link)
  3. TEST: Query elasticsearch using cohere

  4. Web app

    • Implement flask-based web app
    • Implement search bar
    • Implement search results
    • Implement player
    • Implement intra-podcast search

END OF PHASE 2


๐Ÿ•˜ PHASE 3

  1. Generative models (generate gifs for chapters)
  2. Implement storage mechanism for images/gifs
  3. Implement image/gif retrieval in web app
  4. TEST: Test the overall correct behavior of the web app, in all its parts

END OF PHASE 3


๐Ÿ•› PHASE 4

  1. Slides & Demo
  2. Add a "search by voice" button to the web app -- Future Work
  3. Add "Fake News" detection to the web app - Future Work

END OF PHASE 4


๐Ÿ™๐Ÿป Acknowledgements

  • AssemblyAI for the outstanding experience (and the free credits)
  • Cohere for the amazing API
  • ๐ŸŽฌ plyr.io for the media player design
  • ๐Ÿค— Huggingface for the amazing models
  • โ˜•๏ธ โ˜•๏ธ ๐Ÿงƒ for the energy

๐ŸคŒ๐Ÿป About the project

PACE is a project created by Alkis Koudounas, Lorenzo Vaiani, and Moreno La Quatra. It is part of the AssemblyAI 50K Hackathon - Winter 2022.

logo

pace's People

Contributors

koudounasalkis avatar morenolaquatra avatar vaianilorenzo avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.