Coder Social home page Coder Social logo

wav2vec2-sprint's Introduction

This repo is deprecated in favor of https://github.com/jonatasgrosman/huggingsound

Wav2Vec Trainer

This repository is based on https://github.com/jqueguiner/wav2vec2-sprint

Building docker image

Dockerhub available at https://hub.docker.com/r/patilsuraj/hf-wav2vec

to build the docker :

$ docker build -t hf-wav2vec-sprint -f Dockerfile .

to push it to dockerhub First create a repository on dockerhub

$ docker tag hf-wav2vec-sprint your-dockerhub-user/hf-wav2vec-sprint

to push it to dockerhub

$ docker push your-dockerhub-user/hf-wav2vec-sprint

Running WandB sweep

Initialize your sweep from any machine...

$ export WANDB_API_KEY=YOUR_WANDB_API_KEY
$ export WANDB_ENTITY=YOUR_WANDB_ENTITY
$ export WANDB_PROJECT=YOUR_WANDB_PROJECT

$ wandb sweep sweep.yaml

... the execution above will give you a sweep id, save it and on the training machine run:

$ export WANDB_API_KEY=YOUR_WANDB_API_KEY
$ export WANDB_ENTITY=YOUR_WANDB_ENTITY
$ export WANDB_PROJECT=YOUR_WANDB_PROJECT

$ wandb agent YOUR_SWEEP_ID

Uploading model to HF

You need to upload the following files to the HF repository

  • preprocessor_config.json
  • special_tokens_map.json
  • tokenizer_config.json
  • vocab.json
  • config.json
  • pytorch_model.bin
  • README.md (create this file based on the MODEL_CARD.md)
$ git config --global user.email "[email protected]"

$ git config --global user.name "Your name"

$ transformers-cli login

$ transformers-cli repo create your-model-name

$ git clone https://username:[email protected]/username/your-model-name

$ git add .

$ git commit -m "Initial commit"

$ git push

Troubleshooting

  • audioread.exceptions.NoBackendError: $ sudo apt-get install ffmpeg sox libsox-fmt-mp3

Finetuned models

Wav2Vec2-XLSR-53

wav2vec2-sprint's People

Contributors

jonatasgrosman avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.