Coder Social home page Coder Social logo

hadryan / audiopyle Goto Github PK

View Code? Open in Web Editor NEW

This project forked from emkor/audiopyle

0.0 2.0 0.0 2.64 MB

Audio feature extraction engine based on VAMP plugins - (Usefull Vamp Plugins configurations)

License: GNU General Public License v3.0

Makefile 1.05% Python 84.33% Shell 2.91% JavaScript 2.53% CSS 0.02% HTML 9.17%

audiopyle's Introduction

audiopyle Build Status

audiopyle

Docker-based app extracting features from audio files (preferably, music). Makes use of standard VAMP plugins for feature extraction.

Quick start

To start the app, follow these steps:

  • make sure directories with music files (.mp3, .flac formats) and VAMP plugins (.so files in case of Linux) are mapped correctly in scripts/docker-compose.yml under sections:
    • api/volumes
    • worker/volumes
  • do make run from the directory with Makefile
  • docker should pull the audiopyle images now
  • there's simple UI available on port at http://localhost:8008/

App REST API description

YOu can use REST API of an app that is available at http://localhost:8080/ Hosted audiopyle API documentation is available from GitHub Pages Alternative way is to paste open_api_docs.yml (OpenAPI / Swagger 2.0 compliant spec) file contents into Swagger editor

How it works (mental model)

  • Extraction takes as input:
    • filename of an audio file (those are listed in /audio endpoint)
    • VAMP plugin full key - including plugin output (those are listen under /plugin endpoint)
    • optional: VAMP plugin config (if empty, defaults from /config/plugin for this plugin will be used)
    • optional: Metric config (if empty, defaults from /config/metric for this plugin will be used)
  • Extraction outputs:
    • id3tag of an audio file, if included in the file
    • raw feature data; might be 1-, 2- or 3-dimensional; in constant or variable step format; see see VAMP docs for details
    • Metric values, if included in the request
  • What is Metric?
    • Metric value is a bunch of statistics (mean, min, max, sum, variance etc.) over a vector of data selected from raw feature
    • Metric definition is a description of how to calculate vector mentioned above from raw feature data
    • Default metric definitions are defined in audiopyle/scripts/resources/config/metric.json
    • Example: bbc_intensity_lo_bass
      • it's a Metric telling about sub-bass intensity of the track
      • Metric value is calculated from raw feature data produced by bbc-vamp-plugins:bbc-intensity:intensity-ratio VAMP plugin output
      • the input vector is a first row of a 3-dimensional raw feature produced by the VAMP plugin output
      • standards statistics functions are applied over the vector, producing Metric value
  • Automated extraction
    • /request/automation endpoint is triggered by empty POST request (see OpenAPI documentation for details)
    • the API creates Cartesian product of all available audio files and all available VAMP plugins excluding those blacklisted (/config/blacklist endpoint, audiopyle/scripts/resources/config/blacklist.json file))
    • plugin config and metric config are taken from their respective /config endpoints
    • all the requests are queued for execution, API responds with UUIDs (task_id) for each generated request

Building from source

To build Docker images, follow these steps:

  • do one-time setup: make config
  • then, to trigger full-build and test procedure, do make all
  • take a look at Makefile to see what build steps are doing

audiopyle's People

Contributors

emkor avatar jkorzeniowski avatar mkorman9 avatar urxvtcd avatar ziemnior avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.