Coder Social home page Coder Social logo

aaronchen's Projects

10997_mwmae icon 10997_mwmae

Repository for MW-MAE paper submitted to NeurIPS 2023

1d-statespace icon 1d-statespace

This repository contains the source code of an efficient 1D probabilistic model for music time analysis proposed in ICASSP2022 venue.

3d-speaker icon 3d-speaker

A repository for single- and multi-modal speaker verification, speaker recognition, and speaker diarization.

3ddfa_v2 icon 3ddfa_v2

The official PyTorch implementation of Towards Fast, Accurate and Stable 3D Dense Face Alignment, ECCV, 2020

academicodec icon academicodec

AcademiCodec: An Open Source Audio Codec Model for Academic Research

accomontage icon accomontage

Codes and MIDI demos for paper: Zhao et al., AccoMontage: Accompaniment Arrangement via Phrase Selection and Style Transfer, ISMIR 2021

acoss icon acoss

acoss: Audio Cover Song Suite is a framework for feature extraction and benchmarking for the cover song identification (CSI) task

acoustic-model icon acoustic-model

Acoustic models for: A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion

adaspeech icon adaspeech

AdaSpeech: Adaptive Text to Speech for Custom Voice

adtof icon adtof

Additional material for the paper ADTOF: A large dataset of non-synthetic music for automatic drum transcription

ahotts icon ahotts

Text-to-Speech conversor for Basque and Spanish. It includes linguistic processing and built voices for the languages aforementioned. Its acoustic engine is based on hts_engine and it uses a high quality vocoder called AhoCoder. Developed by Aholab Signal Processing Laboratory,

ai-audio-datasets-list icon ai-audio-datasets-list

This is a datasets of speech, music and sound effects that can provide training data for AIGC, AI model training, intelligent audio tool development, and audio applications. The audio dataset is mainly used in speech recognition, speech synthesis, singing voice synthesis, music information retrieval, music generation, sound synthesis, etc

ai-research icon ai-research

【🔞🔞🔞 内含不适合未成年人阅读的图片】基于我擅长的编程、绘画、写作展开的 AI 探索和总结:StableDiffusion 是一种强大的图像生成模型,能够通过对一张图片进行演化来生成新的图片。ChatGPT 是一个基于 Transformer 的语言生成模型,它能够自动为输入的主题生成合适的文章。而 Github Copilot 是一个智能编程助手,能够加速日常编程活动。

aicity-reid-2020 icon aicity-reid-2020

:red_car: The 1st Place Submission to AICity Challenge 2020 re-id track (Baidu-UTS submission)

allosaurus icon allosaurus

Allosaurus is a pretrained universal phone recognizer for more than 2000 languages

alta icon alta

An Automatic Lyrics Transcription Framework using Dilated Convolutional Neural Networks with Self-Attention based on kaldi

amazon-dsstne icon amazon-dsstne

Deep Scalable Sparse Tensor Network Engine (DSSTNE) is an Amazon developed library for building Deep Learning (DL) machine learning (ML) models

amt-tools icon amt-tools

Machine learning tools and framework for automatic music transcription.

ance icon ance

A novel embedding training algorithm leveraging ANN search and achieved SOTA retrieval on Trec DL 2019 and OpenQA benchmarks

annotated_deep_learning_paper_implementations icon annotated_deep_learning_paper_implementations

🧑‍🏫 50! Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠

annoy icon annoy

Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk

artificialsonggenerator icon artificialsonggenerator

The ArtificialSongGenerator automatically composes and compiles the Artifical Audio Multitrack dataset (AAM).

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.