Coder Social home page Coder Social logo

Oytun Turk's Projects

prodiff icon prodiff

PyTorch Implementation of ProDiff (ACM-MM'22) with a Extremely-Fast diffusion speech synthesis pipeline

pyannote-audio icon pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

pyloudnorm icon pyloudnorm

Flexible audio loudness meter in Python with implementation of ITU-R BS.1770-4 loudness algorithm

pysptk icon pysptk

A python wrapper for Speech Signal Processing Toolkit (SPTK).

quickvc-voiceconversion icon quickvc-voiceconversion

QuickVC: Any-to-many Voice Conversion Using Inverse Short-time Fourier Transform for Faster Conversion

rad-mmm icon rad-mmm

A TTS model that makes a speaker speak new languages

radtts icon radtts

Provides training, inference and voice conversion recipes for RADTTS and RADTTS++: Flow-based TTS models with Robust Alignment Learning, Diverse Synthesis, and Generative Modeling and Fine-Grained Control over of Low Dimensional (F0 and Energy) Speech Attributes.

s3prl icon s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit.

samba icon samba

Official implementation of "Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling"

sc-wavernn icon sc-wavernn

Official PyTorch implementation of Speaker Conditional WaveRNN

sequitur-g2p icon sequitur-g2p

This is a github repository of the abandonware Sequitur G2P by Bisani & Ney

sherpa icon sherpa

Speech-to-text server framework with next-gen Kaldi

sonnet icon sonnet

TensorFlow-based neural network library

soundstream icon soundstream

This repository is an implementation of this article: https://arxiv.org/pdf/2107.03312.pdf

soundstream-pytorch icon soundstream-pytorch

Unofficial SoundStream implementation of Pytorch with training code and 16kHz pretrained checkpoint

spear-tts-pytorch icon spear-tts-pytorch

Implementation of Spear-TTS - multi-speaker text-to-speech attention network, in Pytorch

specdiff-gan icon specdiff-gan

Adversarial Training of Denoising Diffusion Model Using Dual Discriminators for High-Fidelity Multi-Speaker TTS

speech-backbones icon speech-backbones

This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.

speech-trident icon speech-trident

Awesome speech/audio LLMs, representation learning, and codec models

speecht5 icon speecht5

Unified-Modal Speech-Text Pre-Training for Spoken Language Processing

ssamba icon ssamba

An official implementation for SSAMBA: Self-Supervised Audio Mamba

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.