Coder Social home page Coder Social logo

trellixvulnteam / jiang-jung-dian_nux8 Goto Github PK

View Code? Open in Web Editor NEW

This project forked from mre500/jiang-jung-dian

0.0 0.0 0.0 63.34 MB

A service utilizing Amazon Transcribe, Amazon Comprehend and S3 in Amazon Web Services (AWS) to help users convert audio to text report with each speaker's name.

Python 75.92% R 23.95% Batchfile 0.13%

jiang-jung-dian_nux8's Introduction

Jiang-Jung-Dian (講重點)

It offers users a convenience way to convert recording audios to transcript with identified speakers. The converting part is done by utilizing Amazon Transcribe and Amazon Comprehend, while the identifying part is achieved by utilizing VGGVox speaker identification model. Here is the link of a simple deom vedio for AWS hackathon. It is demonstrated by using 2016 America presidential debate on youtube.

Prerequisites

  • Python >= 3.7
  • R >= 3.6.1

Setup

  • $ pip install -r requirements.txt to install all dependencies wirtten in python.
  • To install all dependencies wirtten in R:
    • $ install.packages("data.table")
    • $ install.packages("dplyr")
    • $ install.packages("shiny")
    • $ install.packages("DT")
    • $ install.packages("shinydashboard")
    • $ install.packages("stringr")

Execute

  1. First, run $ python ui.py, a GUI will pop up. \

  2. Then do enrollment:

    • Enter "speaker's name" in 使用者名稱
    • Click 開始錄音 to start enroll the speaker's voice.
    • Click 結束錄音 if one finishes recording.
    • Iterate over the first three processes if there are multiple speakers.
    • After all speakers are enrolled, click 開始辨識.
  3. Third, start recoding by clicking 會議錄音, and finish recording by clicking 結束會議.

  4. The result can be seen by cicking the generated shiny.bat file.

Processes behind the scene

Result

Contributors

Acknowledgments

Thanks Hack For Good hold by AWS for providing the AWS Services, and the open source of VGGVox. Special thanks to Chen, Stuart, the Solutions Architect at Amazon Web Services (AWS), for technical supports.

jiang-jung-dian_nux8's People

Contributors

dependabot[bot] avatar gigi2jean avatar mre500 avatar trellixvulnteam avatar watlz1533 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.