Coder Social home page Coder Social logo

ahmeftah's Projects

all-programming-e-books-pdf icon all-programming-e-books-pdf

A Curated List Of Programming Books For C, C++ , Python, JavaScript, NodeJs, ReactJs, Web, JQuery, Flask, Dom, Angular, CSS, HTML for beginners, intermediate, advanced and experts

arabic-asr-system icon arabic-asr-system

automated speech recognition system for arabic language for customers query classification, with adaptive learning and merged learning models trained with weka

arabic-speech-recognition icon arabic-speech-recognition

This repository contains my attempt to use two famous speech recognition frameworks (Kaldi, CMU Sphinx4) for Arabic Language using the publicly-available dataset "Arabic Corpus of Isolated Words"

audio_course_project icon audio_course_project

Project done as part of Audio Processing course at Tampere University. Topic was separation of harmonic and percussive elements according to paper EPARATION OF A MONAURAL AUDIO SIGNAL INTO HARMONIC/PERCUSSIVE COMPONENTS BY COMPLEMENTARY DIFFUSION ON SPECTROGRAM by Nobutaka Ono, Kenichi Miyamoto, Jonathan Le Roux, Hirokazu Kameoka, and Shigeki Sagayama.

audiomentations icon audiomentations

A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.

cargan icon cargan

Official repository for the paper "Chunked Autoregressive GAN for Conditional Waveform Synthesis"

change-emotions icon change-emotions

Nonparallel Emotional Speech Conversion with MUNIT. Introduction: This is a tensorflow implementation of paper(https://arxiv.org/pdf/1811.01174.pdf) Nonparallel Emotional Speech Conversion. It is an end-to-end voice conversion system which can change the speaker's emotion. For example, neutral to angry, sad to happy. The model aims at generating speech with desired emotions while keeping the original linguistic content and speaker identity. It first extracts acoustic features from raw audio, then learn the mapping from source emotion to target emotion in the feature space, and finally put those features together to rebuild the waveform. In our approach, three types of features are considered: Features: Fundamental frequency (log F_0), converted by logarithm Gaussian normalized transformation Power envelope, converted by logarithm Gaussian normalized transformation Mel-cepstral coefficients (MCEPs), a representation of spectral envelope, trained by CycleGAN Aperiodicities (APs), directly used without modification. Dependencies: Python 3.5, Numpy 1.15, TensorFlow 1.8, LibROSA 0.6, FFmpeg 4.0, PyWorld

code icon code

Compilation of R and Python programming codes on the Data Professor YouTube channel.

crank icon crank

A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder

cycle_gan_vc icon cycle_gan_vc

Reproducing PARALLEL-DATA-FREE VOICE CONVERSION USING CYCLE-CONSISTENT ADVERSARIAL NETWORKS (https://arxiv.org/pdf/1711.11293.pdf)

cyclegan icon cyclegan

Software that can generate photos from paintings, turn horses into zebras, perform style transfer, and more.

cyclegan-emovc icon cyclegan-emovc

EMOTIONAL VOICE CONVERSION WITH CYCLE-CONSISTENT ADVERSARIAL NETWORK

cyclegan-tensorflow icon cyclegan-tensorflow

Tensorflow implementation for learning an image-to-image translation without input-output pairs. https://arxiv.org/pdf/1703.10593.pdf

cycletransgan-evc icon cycletransgan-evc

CycleTransGAN-EVC: A CycleGAN-based Emotional Voice Conversion Model with Transformer

deepnude-an-image-to-image-technology icon deepnude-an-image-to-image-technology

DeepNude's algorithm and general image generation theory and practice research, including pix2pix, CycleGAN, UGATIT, DCGAN, SinGAN, ALAE, mGANprior, StarGAN-v2 and VAE models (TensorFlow2 implementation). DeepNude的算法以及通用生成对抗网络(GAN,Generative Adversarial Network)图像生成的理论与实践研究。

dl-for-emo-tts icon dl-for-emo-tts

:computer: :robot: A summary on our attempts at using Deep Learning approaches for Emotional Text to Speech :speaker:

dyganvc icon dyganvc

demo page https://MingjieChen.github.io/dygan-vc

ecapa-tdnn icon ecapa-tdnn

Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.