Coder Social home page Coder Social logo

svenha / py-nltools Goto Github PK

View Code? Open in Web Editor NEW

This project forked from gooofy/py-nltools

0.0 2.0 0.0 376 KB

A collection of basic python modules for spoken natural language processing

License: Apache License 2.0

Makefile 0.10% Python 99.90% Shell 0.01%

py-nltools's Introduction

py-nltools

Highlevel Diagram

A collection of abstraction layers and support functions that form the natural language processing foundation of the Zamia AI project:

  • phonetics: translation functions between various phonetic alphabets (IPA, X-SAMPA, X-ARPABET, …​)

  • tts: abstraction layer towards using eSpeak NG, MaryTTS, SVOX Pico TTS or a remote TTS server and sequitur g2p

  • asr: abstraction layer towards using kaldi-asr and pocketsphinx, models can be found here: http://goofy.zamia.org/voxforge/

  • sequiturclient: g2p using sequitur

  • pulseplayer: audio playback through pulseaudio

  • pulserecorder: audio recording through pulseaudio

  • tokenizer: english and german word tokenizer aimed at spoken language applications

  • threadpool: simple thread pool implementation

  • vad: Voice Activity Detection finite state machine based on webrtc VAD

  • macro_engine: Simple macro engine aimed at generating natural language expansions

I plan to add modules as I need them in the Zamia AI projects. Some modules like phonetics and tokenizer have some overlap with larger projects like NLTK or spaCy - my modules tend to be more hands-on and simple minded than these and therefore are in no way meant to replace them.

Requirements

Note: probably incomplete.

  • Python 2.7

  • for TTS one or more of:

    • MaryTTS, py-marytts

    • espeak-ng, py-espeak-ng

    • SVOX Pico TTS, py-picotts

  • for ASR one or more of:

    • kaldi-asr 5.1, py-kaldi-asr

    • pocketsphinx

  • sequitur

  • pulseaudio

  • webrtc

License

My own code is Apache-2.0 licensed unless otherwise noted in the script’s copyright headers.

Some scripts and files are based on works of others, in those cases it is my intention to keep the original license intact. Please make sure to check the copyright headers inside for more information.

Author

Guenter Bartsch <[email protected]>

py-nltools's People

Contributors

gooofy avatar mpuels avatar svenha avatar

Watchers

James Cloos avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.