kaiidams Goto Github PK

followers: 23.0 following: 9.0 repos: 31.0 gists: 1.0

Name: Katsuya Iida

Type: User

Bio: Working on smart ML projects on my own.

Location: Tokyo

Blog: https://www.linkedin.com/in/katsuya-iida/

Katsuya Iida's Projects

all-en

Chromium extension to replace ja-jp with en-us in the URL so that you can easily move from Japanese page to English page if the site is multilingual.

androidspeechdemo

Sample of Android's automatic speech recognition (ASR)

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

freehand-dataset

Synthesized hand pose images generated by Blender

kokoro-align

Kokoro-Align is a PyTorch speech-transcript alignment tool for LibriVox. It splits audio files in silent positions and find CTC best path to align transcript texts with the audio files.

kokoro-speech-dataset

A public domain single speaker Japanese speech dataset

languagedetection

C# port of https://github.com/shuyo/language-detection

machinelearning-samples

Samples for ML.NET, an open source and cross-platform machine learning framework for .NET.

nemoonnxandroidapp

.NET Android sample app for NeMoOnnxSharp

nemoonnxgodot

Neural speech with NVIDIA NeMo and ONNX Runtime

nemoonnxsharp

Text-to-speech and speech recognition, VAD with NVIDIA NeMo and ONNX Runtime for .NET Core.

pinktrombone_cpp

Neil Thapen's Pinktrombone converted to C++

soundstream-pytorch

Unofficial SoundStream implementation of Pytorch with training code and 16kHz pretrained checkpoint

synthetic-recommendation-dataset

Open source synthetic recommendation dataset

tacotron

A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)

torchsharp

A .NET library that provides access to the library that powers PyTorch.

torchsharpexamples

Repository for TorchSharp examples and tutorials.

transferlearningaudio

ML.NET porting of https://www.tensorflow.org/tutorials/audio/transfer_learning_audio

tts

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

voice100

Voice100 includes neural TTS/ASR models. Inference of Voice100 is low cost as its models are tiny and only depend on CNN without autoregression.

voice100-runtime

Voice100 runtime. Voice100 includes neural TTS/ASR models. Inference of Voice100 is low cost as its models are tiny and only depend on CNN without recursion.

Voice100 Android App is a TTS/ASR sample app that uses ONNX Runtime and Voice100 neural TTS/ASR models on Xamarin. Inference of Voice100 is low cost as its models are tiny and only depend on CNN without recursion.

voice100godot

voice100sharp

Voice100 includes neural TTS/ASR models. Inference of Voice100 is low cost as its models are tiny and only depend on CNN without recursion.

wav2vec2_ja

wav2vec 2.0 finetuned with Common Voice 12.0 Japanese

webgl-python

A Python web server that controls WebGL of the browser

world

A high-quality speech analysis, manipulation and synthesis system

kaiidams Goto Github PK

Katsuya Iida's Projects

Recommend Projects

Recommend Topics

Recommend Org