Light

melspectrum007 Goto Github PK

followers: 5.0 following: 7.0 repos: 107.0 gists: 0.0

Type: User

melspectrum007's Projects

kaggle_carvana_segmentation

Code for a 1st place model in Carvana Image Masking Challenge

kaldi

This is now the official location of the Kaldi project.

kaldi-onnx

Kaldi model converter to ONNX

kkp15.github.io

Demo website for the paper MUSIC SOURCE SEPARATION USING GENERATIVE ADVERSARIAL NETWORKS, submitted in ICASSP2018.

librosa

Python library for audio and music analysis

list-of-symbolic-musical-datasets

loop

A method to generate speech across multiple speakers

lpcnet

Efficient neural speech synthesis

lws

Fast spectrogram phase recovery using Local Weighted Sums (C/Python/Matlab)

mad-twinnet

The code for the MaD TwinNet

magenta

Magenta: Music and Art Generation with Machine Intelligence

magnolia

melgan-neurips

melody-rnn

This is an adaption of karpathy/char-rnn for melody onset/offset detection.

mixing_secrets

modeling-plate-spring-reverb

In order to listen to the audio examples, please go the website:

montreal-forced-aligner

Command line utility for forced alignment using Kaldi

ms-snsd

The Microsoft Scalable Noisy Speech Dataset (MS-SNSD) is a noisy speech dataset that can scale to arbitrary sizes depending on the number of speakers, noise types, and Speech to Noise Ratio (SNR) levels desired.

multi-view-neural-acoustic-words-embeddings

mxnet-audio

Implementation of music genre classification, audio-to-vec, song recommender, and music search in mxnet

nara_wpe

Different implementations of "Weighted Prediction Error" for speech dereverberation

nmtgminor

A Neural Machine Translation toolkit for research purpose

onssen

An open-source speech separation and enhancement library

openseq2seq

Toolkit for efficient experimentation with various sequence-to-sequence models

performancenet

PerformanceNet: Score-to-Audio Music Generation with Multi-Band Convolutional Residual Network

phonet

Keras-based python framework to compute phonological posterior probabilities from audio files

pitchshiftpa

Phase-Aligned Pitch and Formant Shifter

praat

Praat: Doing Phonetics By Computer

praatio

A python library for working with praat, textgrids, time aligned audio transcripts, and audio files. It is primarily used for extracting features from and making manipulations on audio files given hierarchical time-aligned transcriptions (utterance > word > syllable > phone, etc).

promo

Prososdy Morph: A python library for manipulating pitch and duration in an algorithmic way, for resynthesizing speech.

1
2
3
4

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.