freds0 Goto Github PK

followers: 65.0 following: 83.0 repos: 87.0 gists: 0.0

Name: Frederico S. Oliveira

Type: User

Company: UFMT

Bio: Researcher in the area of NLP, Ph.D. student at UFG, focusing on speech synthesis and recognition using deep learning and also professor at UFMT.

Twitter: fred_s0

Location: Cuiabá, Mato Grosso - Brazil

Blog: https://www.fredso.com.br

Greetings! My name is Fred. 👋

I hold the position of a professor at the esteemed Faculty of Engineering within the Federal University of Mato Grosso (UFMT), located in Cuiabá-MT, Brazil. My academic journey has led me to the achievement of a Ph.D. in the field of Artificial Intelligence from the Federal University of Goiás (UFG). My primary area of interest revolves around the realm of NLP.

I've taken the initiative to curate several repositories, which can be accessed here. While they may currently appear somewhat unorganized, I am actively planning to arrange them in a more structured manner in the future. For those seeking additional insights, I kindly invite you to explore further through the following links:

https://www.fredso.com.br: A glimpse into my personal webpage.
https://www.mrfalante.com.br: A collection of projects centered around the fascinating domain of speech processing, particularly in the context of Brazilian Portuguese.

Frederico S. Oliveira's Projects

ai-programming-using-python

This repository contains implementation of different AI algorithms, based on the 4th edition of amazing AI Book, Artificial Intelligence A Modern Approach

algoritmos_e_estrutura_de_dados_material_didatico

Material Didático da disciplina Algoritmos e Estrutura de Dados

alien_invasion

Classic game using PyGame

assem-vc

Official Code for Assem-VC @ICASSP2022

audio-slicer

A simple GUI application that slices audio with silence detection

awesome-singing-voice-synthesis-and-singing-voice-conversion

A paper and project list about the cutting edge Speech Synthesis, Text-to-Speech (TTS), Singing Voice Synthesis (SVS), Voice Conversion (VC), Singing Voice Conversion (SVC), and related interesting works.

brspeech-dataset

BRSpeech: A Portuguese Dataset for Speech Synthesis

bspeech-mos-prediction

A model for predicting MOS that utilizes embeddings of supervised learning and self-supervised learning models, combined with embeddings of speaker verification models, to predict the MOS metric.

capybara_dataset

This is a dataset composed of images of capybaras to be used for training a model for object detection

capybara_image_segmentation

This repository presents how to train your own Image Segmentator Using TensorFlow Object Detection API.

capybara_object_detection

This repository presents how to train your own Object Detector Using TensorFlow Object Detection API. It also demonstrates how to use the trained model to annotate data (auto-annotate).

cml-tts-dataset

CML-TTS: A Multilingual Dataset for Speech Synthesis

cml-tts-toolkit

CML-TTS Conversion Tools

coqui-tts

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

cs7320-ai

Examples for an AI course following the textbook Artificial Intelligence: A Modern Approach by Russell and Norvig.

data_augmentation_for_asr

A set of audio augmentation techniques to perform noise insertion in datasets used for Automatic Speech Recognition.

data_augmentation_for_object_detection

scripts to augment labelded images with bounding boxes

deep-speaker

Deep Speaker: an End-to-End Neural Speaker Embedding System.

ensembleobjectdetection

ermis_demo

facebook_denoiser

Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.

freds0 Goto Github PK

Greetings! My name is Fred. 👋

Frederico S. Oliveira's Projects

Recommend Projects

Recommend Topics

Recommend Org