kaiidams Goto Github PK
Name: Katsuya Iida
Type: User
Bio: Working on smart ML projects on my own.
Location: Tokyo
Name: Katsuya Iida
Type: User
Bio: Working on smart ML projects on my own.
Location: Tokyo
Chromium extension to replace ja-jp with en-us in the URL so that you can easily move from Japanese page to English page if the site is multilingual.
Sample of Android's automatic speech recognition (ASR)
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Synthesized hand pose images generated by Blender
Kokoro-Align is a PyTorch speech-transcript alignment tool for LibriVox. It splits audio files in silent positions and find CTC best path to align transcript texts with the audio files.
A public domain single speaker Japanese speech dataset
C# port of https://github.com/shuyo/language-detection
Samples for ML.NET, an open source and cross-platform machine learning framework for .NET.
NeMo: a toolkit for conversational AI
.NET Android sample app for NeMoOnnxSharp
Neural speech with NVIDIA NeMo and ONNX Runtime
Text-to-speech and speech recognition, VAD with NVIDIA NeMo and ONNX Runtime for .NET Core.
Neil Thapen's Pinktrombone converted to C++
Unofficial SoundStream implementation of Pytorch with training code and 16kHz pretrained checkpoint
Open source synthetic recommendation dataset
A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)
A .NET library that provides access to the library that powers PyTorch.
Repository for TorchSharp examples and tutorials.
ML.NET porting of https://www.tensorflow.org/tutorials/audio/transfer_learning_audio
πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Voice100 includes neural TTS/ASR models. Inference of Voice100 is low cost as its models are tiny and only depend on CNN without autoregression.
Voice100 runtime. Voice100 includes neural TTS/ASR models. Inference of Voice100 is low cost as its models are tiny and only depend on CNN without recursion.
Voice100 Android App is a TTS/ASR sample app that uses ONNX Runtime and Voice100 neural TTS/ASR models on Xamarin. Inference of Voice100 is low cost as its models are tiny and only depend on CNN without recursion.
Voice100 includes neural TTS/ASR models. Inference of Voice100 is low cost as its models are tiny and only depend on CNN without recursion.
wav2vec 2.0 finetuned with Common Voice 12.0 Japanese
A Python web server that controls WebGL of the browser
A high-quality speech analysis, manipulation and synthesis system
A declarative, efficient, and flexible JavaScript library for building user interfaces.
π Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. πππ
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google β€οΈ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.