Coder Social home page Coder Social logo

darad / smc-master-thesis Goto Github PK

View Code? Open in Web Editor NEW

This project forked from lennartnicolas/smc-master-thesis

0.0 0.0 0.0 93.92 MB

A comparative study on improving sound similarity maps with semantic metadata

JavaScript 4.92% Python 1.20% CSS 0.18% HTML 0.70% Jupyter Notebook 93.00%

smc-master-thesis's Introduction

A comparative study on improving sound similarity maps with semantic metadata

This repository contains all data relating to the master thesis on sound similarity maps conducted at University Pompeu Fabra, Barcelona. [Not yet published] The paper is accessible here.

Further you can checkout the interface of the similarity maps on GitHub Pages.

Abstract

Searching and browsing appropriate sounds within large collections of audio samples can be challenging for musicians and sound designers. Most commonly, list-based search approaches are being used for displaying content for music production, however several attempts have been made to improve user experience by projecting sounds in a two dimensional map. These maps usually rely on dimensionality reduction methods like PCA, UMAP or t-SNE to translate an audio embedding or another high dimensional feature representation into a low dimensional latent space, which typically involves a trade-off between the preservation of the global and local structure of the data. Providing metadata or custom distance measures to the algorithms can improve the clustering, which however requires correct labels and a solid feature representation. In this work, we address this issue by including user metadata for classification refinement of the audio to achieve an improved label description and post-process the point positions of the projection with the help of class probabilities. We conducted a comparative study of different map layouts to understand the usefulness of the aforementioned method to improve sound similarity projections. In our study we found that adding semantics in a hierarchical manner and having a more concise local structure assist both sound searching and explorational browsing.

smc-master-thesis's People

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.