Coder Social home page Coder Social logo

audio-competition's Introduction

Audio-Competition

Small datasets to test novel audio algorithms, heavily influenced by imagenette

The Datasets

ESC-50: Environmental Sound Classification

The ESC-50 dataset is a labeled collection of 2000 environmental audio recordings suitable for benchmarking methods of environmental sound classification.

The dataset consists of 5-second-long recordings organized into 50 semantical classes (with 40 examples per class) loosely arranged into 5 major categories:

ESC-50 is a really nice starting dataset as it is especially clean (fixed-length, hand-labeled, single sample-rate) and well maintained. Many thanks to Karol Piczak for maintaining a really great Github Repo based around this dataset.

Usage

If you are already using the fastaudio library, you can download and access these quickly with commands like:

path = untar_data(URLs.ESC50)

where path now stores the destination to ESC-50.

Leaderboard

Generally you'll see +/- 1% differences from run to run since it's quite a small validation set. So please only send in contributions that are higher than the reported accuracy >80% of the time. Here's the rules:

  • No inference time tricks, e.g. no: TTA
  • Must be one of the split/#epoch combinations listed in the table
  • If you have the resources to do so, try to get an average of 5 runs, to get a stable comparison. Use the "# Runs" column to include this
  • In the URL column include a link to a notebook, blog post, gist, or similar which explains what you did to get your result, and includes the code you used (or a link to it), including the exact commit, so that others can reproduce your result.

ESC-50 Fold 1 Leaderboard

  • For this leaderbord, use only the data corresponding to Fold 1 to validate, and train on the other 4 folds.
Epochs URL Accuracy # Runs
80 fastaudio baseline w/ mixup 78.25% 1
20 fastaudio baseline 66.64% 5, mean
10 fastaudio baseline 62.69% 5, mean

ESC-50 All Folds Leaderboard

  • For this leaderbord, do 5-fold cross validation using the splits defined in the metadata, and report the mean accuracy.

  • The results here are also comparable with the official dataset leaderbods

Epochs URL Accuracy # Runs
80 no entries yet -- --
20 fastaudio baseline 67.35% 1
10 fastaudio baseline 64.94% 1

audio-competition's People

Contributors

scart97 avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.