Coder Social home page Coder Social logo

alisonbma / aisfx Goto Github PK

View Code? Open in Web Editor NEW
40.0 3.0 4.0 61 KB

Representation Learning for the Automatic Indexing of Sound Effects Libraries (ISMIR 2022): Deep audio embeddings pre-trained on UCS & Non-UCS-compliant datasets.

Home Page: https://aisfx.readthedocs.io/en/latest

License: Creative Commons Attribution 4.0 International

Python 100.00%
deep-learning embedding-models machine-learning music-information-retrieval representation-learning sound-effects-library universal-category-system

aisfx's Introduction

aiSFX

Picture

Representation Learning for the Automatic Indexing of Sound Effects Libraries (ISMIR 2022): Deep audio embeddings pre-trained on UCS & Non-UCS-compliant datasets.

This work was inspired by the creation of the Universal Category System (UCS), an industry-proposed public domain initiative initialized by Tim Nielsen, Justin Drury, Kai Paquin, and others. First launching in the fall of 2020, UCS offers a standardized framework for sound effects library metadata designed by and for sound designers and editors.

How To Use

Please refer to this package's documentation for Installation Instructions and Tutorials of how to extract embeddings.

Click the above to visualize coarse-level "Category" UCS classes in Pro Sound Effects (PSE), Soundly (SDLY), and UCS Mixed (UMIX).

Cite This Work

Please cite the paper below if you use it in your work.

This paper has been accepted at the 23rd International Society for Music Information Retrieval Conference (ISMIR) in Bengaluru, India (December 04-08, 2022). To cite our work, please refer to the following.

[1] Representation Learning for the Automatic Indexing of Sound Effects Libraries

  @inproceedings{ismir_aisfx,
    title={Representation Learning for the Automatic Indexing of Sound Effects Libraries},
    author={Ma, Alison Bernice and Lerch, Alexander},
    booktitle={Proceedings of the 23rd International Society for Music Information Retrieval Conference (ISMIR)},
    year={2022},
    pages={866--875}
  }

Acknowledgements

We would like to thank those who provided the data required to conduct this research as well as those who took the time to share their insights and software licenses for tools regarding sound search, query, and retrieval.

Universal Category System (UCS)Alex LaneAll You Can Eat AudioArticulated Sounds • Audio ShadeaXLSoundBig Sound BankBaseHeadBonsonBOOM LibraryFrick & TraaHzandbitsInspectorJKai PaquinKEDR AudioKrotos AudioNikola SimikicPenguin GrenadePro Sound EffectsRick Allen CreativeSononymSound IdeasSoundlySoundminerStoryblocksTim NielsenThomas Rex BeverlyZapSplat

License: Pre-trained Model & Paper

This pre-trained model and paper [1] is made available under a Creative Commons Attribution 4.0 International License (CC BY 4.0).

aisfx's People

Contributors

alisonbma avatar carlthome avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

aisfx's Issues

the UCS-compliant datasets mentioned in your paper

hi, i'm currently working on a research about SFX. The most troublesome problem is how to get a dataset.
I wonder that if some of the UCS-compliant datasets are open-source? if yes, could you please provide some? or tell me if there is any way to get them? thanks!

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.