Coder Social home page Coder Social logo

audio-data-augmentation's Introduction

Audio Processing Toolkit

Overview

This repository houses a collection of Bash scripts for audio processing tasks including data augmentation through pitch shifting, time stretching, and format conversion to WAV using rubberband and ffmpeg.

Scripts and Their Functions

  1. pitch_shift.sh - Modifies the pitch of audio files within a given range.
  2. time_stretch.sh - Changes the playback speed of audio files without altering the pitch.
  3. convert_to_wav.sh - Converts audio files to WAV format, ensuring compatibility across different systems and software.

Prerequisites

  • rubberband-cli on Linux or rubberband on macOS.
  • ffmpeg for audio conversion.

Links to resources

  • Rubberband: https://breakfastquay.com/rubberband/ - This is the official website for the Rubberband audio stretching library, where you can find information about the tool, including documentation and download instructions.

  • FFmpeg: https://ffmpeg.org/ - This is the official website for FFmpeg, an open-source project consisting of a vast software suite of libraries and programs for handling video, audio, and other multimedia files and streams.

Installation

The scripts will guide you through the installation process if these tools are not found.

Usage Instructions

  1. Clone the repository to your system.
  2. Run the desired script with bash <script_name>.sh.
  3. Follow the on-screen prompts to input the required paths.

Example Paths

  • Source directory path: /home/user/audio_files/originals
  • Destination directory path: /home/user/audio_files/processed

Note: Always replace placeholders with your specific paths and details.

Windows

  • the power shell scripts are in the windows folder

Installation

ffmpeg rubberband

How to run the scripts

  • Right-click on the saved file and select "Run with PowerShell".
  • Ensure that rubberband is installed and added to your system's PATH before running the script.

Contributing

Feel free to fork, modify, and send pull requests. Bug reports and suggestions for improvements are also welcome.

audio-data-augmentation's People

Contributors

marcoaccardi avatar

Stargazers

 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.