Coder Social home page Coder Social logo

priyanshu9898 / audio-accuracy-detection Goto Github PK

View Code? Open in Web Editor NEW
0.0 1.0 0.0 511 KB

Home Page: https://audioaccuracy.vercel.app

JavaScript 80.84% HTML 12.96% CSS 6.20%
audio-analysis file-upload gcp nodejs reactjs audio-accuracy-classification gcp-speech-to-text

audio-accuracy-detection's Introduction

Audio Transcription API using Google Cloud Speech-to-Text

Project Description:

This project is a simple RESTful API that allows users to upload audio files and transcribe them using Google Cloud Speech-to-Text service. The API receives an audio file and returns a transcription of the content. The project is built with Node.js, Express, and Google Cloud Speech-to-Text API.

Table of Contents

1. Requirements
2. Installation
3. Usage
4. API Endpoints
5. Contributing
6. License

Requirements

Node.js v14.x or later
Google Cloud Platform account with Speech-to-Text API enabled
Google Cloud SDK (gcloud) installed and configured

Run Locally

Clone the project

  git clone https://github.com/Priyanshu9898/PlayPower-Labs-Assignment

Go to the project directory

  cd PlayPower-Labs-Assignment

Go to the frontend directory and Install dependencies

  cd frontend
  npm install

Go to the backend directory and Install dependencies

  cd backend
  npm install

Set up Google Cloud authentication:

Download the JSON key file from Google Cloud Console for your service account.

Set the environment variable GOOGLE_APPLICATION_CREDENTIALS to the path of the JSON key file. For example, on Linux or macOS:
export GOOGLE_APPLICATION_CREDENTIALS="/path/to/your/keyfile.json"

Or on Windows (PowerShell):

$env:GOOGLE_APPLICATION_CREDENTIALS="C:\path\to\your\keyfile.json"

Usage

Start the frontend server

  npm start

Start the backend server

  npm run dev

Environment Variables

PORT: PORT number

Tech Stack

Client: React, react-bootstrap, tsparticles

Server: Node, Express

Database: MongoDB

API Reference

Get all items

  POST /api/v1/transcribe

Upload an audio file and receive a transcription of the content.

Request: Content-Type: multipart/form-data

Parameter Type Description
file binary Required. The audio file

Response: Content-Type: application/json

Parameter Type Description
transcription string Required. The transcribed text of the audio

Screenshots

App Screenshot 1

App Screenshot 2

App Screenshot 3

App Screenshot 4

Badges

Add badges from somewhere like: shields.io

MIT License GPLv3 License AGPL License

๐Ÿ”— Links

portfolio linkedin twitter Medium

Demo

Insert gif or link to demo

License

MIT

audio-accuracy-detection's People

Contributors

priyanshu9898 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.