Coder Social home page Coder Social logo

rvaughan / stream-audio-fingerprint Goto Github PK

View Code? Open in Web Editor NEW

This project forked from adblockradio/stream-audio-fingerprint

0.0 2.0 0.0 287 KB

Audio landmark fingerprinting as a Node Stream module

License: GNU Affero General Public License v3.0

JavaScript 100.00%

stream-audio-fingerprint's Introduction

Audio landmark fingerprinting as a Node Stream module

This module is a duplex stream (instance of stream.Transform) that converts a PCM audio signal into a series of audio fingerprints. It works with audio tracks as well as with unlimited audio streams, e.g. broadcast radio.

Credits

The acoustic fingerprinting technique used here is the landmark algorithm, as described in the Shazam 2003 paper. The implementation in codegen_landmark.js has been inspired by the MATLAB routine of D. Ellis "Robust Landmark-Based Audio Fingerprinting" (2009). One significant difference with Ellis' implementation is that this module can handle unlimited audio streams, e.g. radio, and not only finished audio tracks.

Description

In a nutshell,

  • a spectrogram is computed from the audio signal
  • significant peaks are chosen in this time-frequency map. a latency of 500ms is used to determine if a peak is not followed by a bigger peak.
  • fingerprints are computed by linking peaks with dt, f1 and f2, ready to be inserted in a database or to be compared with other fingerprints.

Spectrogram, peaks and pairs

In the background, about 12s of musical content is represented as a spectrogram (top frequency is about 5kHz). The blue marks are the chosen spectrogram peaks. Grey lines are peaks pairs that each lead to a fingerprint.

Threshold and peaks

Given the same audio, this figure shows the same peaks and the internal forward threshold that prevent peaks from being too close in time and frequency. The backward threshold selection is not represented here.

Usage

npm install

The algorithm is in codegen_landmark.js.

A demo usage is proposed in codegen_demo.js.

var decoder = require('child_process').spawn('ffmpeg', [
	'-i', 'pipe:0',
	'-acodec', 'pcm_s16le',
	'-ar', 11025,
	'-ac', 1,
	'-f', 'wav',
	'-v', 'fatal',
	'pipe:1'
], { stdio: ['pipe', 'pipe', process.stderr] });
process.stdin.pipe(decoder.stdin);

var Codegen = require("./codegen_landmark.js");
var fingerprinter = new Codegen();
decoder.stdout.pipe(fingerprinter);

fingerprinter.on("data", function(data) {
	for (var i=0; i<data.tcodes.length; i++) {
		console.log("time=" + data.tcodes[i] + " fingerprint=" + data.hcodes[i]);
	}
});

and then we pipe audio data

curl http://radiofg.impek.com/fg | nodejs codegen_demo.js

License

See LICENSE file.

stream-audio-fingerprint's People

Contributors

dest4 avatar

Watchers

R Vaughan avatar James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.