Coder Social home page Coder Social logo

pasaopasen / speechlogger Goto Github PK

View Code? Open in Web Editor NEW
7.0 2.0 1.0 21.02 MB

The simple application for logging (and translating) all speech from your microphone and speaker

Jupyter Notebook 7.27% Python 92.73%
speech translation choosen-language microphone logger helper googletrans textblob

speechlogger's Introduction

SpeechLogger

The simple application for logging (and translating) all speech from your microphone and speaker.

I think it would be very useful when you communicate with somebody foreign person by Discord or else app. It can help you to have better understanding your interlocutor if you know his language like I do english)

First result

Scope:

  • Translation messages from any languages supported by Google into certain choosen languages at once

  • Recognition and translation the speech on choosen language from microphone

TODO:

  • recognize all speech from microphone automatically (not only on demand)

  • recognize speech from speakers (it will be hard to do)

  • write program into desktop app

  • add text logging in file

1

How to run

You can download this app as compiled (see releases). Unpack and run .exe file.

Notes

  • Not correct showing of arabic words (reversed and upper-cased) exists because of bags with windows terminal. But if u will copy this text and paste it into text editor / google / messanger, it would be normal.

Second result

What's new:

  • recognition speech from speakers too (if u have a speaker supported callbacks)

see

How to run

I cannot compile it now because of problems with soundcard, but u can run it by Python:

  1. Download Python from official site (choose correct operation system)

  2. Open cmd.exe

  3. Install necessary packages via commands:

pip install numpy
pip install scipy
pip install textblob
pip install pyaudio
pip install soundcard==0.3.3
pip install speechrecognition
pip install termcolor
pip install colorama
  1. download and run this file

Notes

  • if u have some problems with installing pyaudio, try to get solutions here

Third result

What's new:

  • json files with basic settings and supported languages
  • trancription (grapheme to phoneme) by epitran

TODO firstly:

  • transcript persian words better

im

How to use

  1. also install epitran by command (cmd.exe):
pip install epitran
  1. download release

  2. unpuck and run .py file

4th

What's new:

  • TextBlob API was replaced with better googletrans API
  • English is transliterated by pysle, farsi --- by PersianG2p

1

How to use

pip install googletrans
pip install pysle
pip install PersianG2p

Download release, unpuck and run .py file.

News

Since 2 June 2020 I'm working with this Telegram translator bot.

speechlogger's People

Contributors

pasaopasen avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

Forkers

denfrost

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.