Coder Social home page Coder Social logo

ml_speech_recognition_google_api's Introduction

Google Cloud Speech API Python Samples

for small files run transcribe.py for large files run transcribe_async.py

https://gstatic.com/cloudssh/images/open-btn.png

This directory contains samples for Google Cloud Speech API. The Google Cloud Speech API enables easy integration of Google speech recognition technologies into developer applications. Send audio and receive a text transcription from the Cloud Speech API service.

  • See the migration guide for information about migrating to Python client library v0.27.

Setup

Authentication

This sample requires you to have authentication setup. Refer to the Authentication Getting Started Guide for instructions on setting up credentials for applications.

Install Dependencies

  1. Clone python-docs-samples and change directory to the sample directory you want to use.

    $ git clone https://github.com/GoogleCloudPlatform/python-docs-samples.git
  2. Install pip and virtualenv if you do not already have them. You may want to refer to the Python Development Environment Setup Guide for Google Cloud Platform for instructions.

  3. Create a virtualenv. Samples are compatible with Python 2.7 and 3.4+.

    $ virtualenv env
    $ source env/bin/activate
  4. Install the dependencies needed to run the samples.

    $ pip install -r requirements.txt

Samples

Quickstart

https://gstatic.com/cloudssh/images/open-btn.png

To run this sample:

$ python quickstart.py

Transcribe

https://gstatic.com/cloudssh/images/open-btn.png

To run this sample:

$ python transcribe.py

usage: transcribe.py [-h] path

Google Cloud Speech API sample application using the REST API for batch
processing.

Example usage:
    python transcribe.py resources/audio.raw
    python transcribe.py gs://cloud-samples-tests/speech/brooklyn.flac

positional arguments:
  path        File or GCS path for audio file to be recognized

optional arguments:
  -h, --help  show this help message and exit

Transcribe async

https://gstatic.com/cloudssh/images/open-btn.png

To run this sample:

$ python transcribe_async.py

usage: transcribe_async.py [-h] path

Google Cloud Speech API sample application using the REST API for async
batch processing.

Example usage:
    python transcribe_async.py resources/audio.raw
    python transcribe_async.py gs://cloud-samples-tests/speech/vr.flac

positional arguments:
  path        File or GCS path for audio file to be recognized

optional arguments:
  -h, --help  show this help message and exit

Transcribe with word time offsets

https://gstatic.com/cloudssh/images/open-btn.png

To run this sample:

$ python transcribe_word_time_offsets.py

usage: transcribe_word_time_offsets.py [-h] path

Google Cloud Speech API sample that demonstrates word time offsets.

Example usage:
    python transcribe_word_time_offsets.py resources/audio.raw
    python transcribe_word_time_offsets.py         gs://cloud-samples-tests/speech/vr.flac

positional arguments:
  path        File or GCS path for audio file to be recognized

optional arguments:
  -h, --help  show this help message and exit

Transcribe Streaming

https://gstatic.com/cloudssh/images/open-btn.png

To run this sample:

$ python transcribe_streaming.py

usage: transcribe_streaming.py [-h] stream

Google Cloud Speech API sample application using the streaming API.

Example usage:
    python transcribe_streaming.py resources/audio.raw

positional arguments:
  stream      File to stream to the API

optional arguments:
  -h, --help  show this help message and exit

Beta Samples

https://gstatic.com/cloudssh/images/open-btn.png

To run this sample:

$ python beta_snippets.py

usage: beta_snippets.py [-h] command path

Google Cloud Speech API sample that demonstrates enhanced models
and recognition metadata.

Example usage:
    python beta_snippets.py enhanced-model resources/commercial_mono.wav
    python beta_snippets.py metadata resources/commercial_mono.wav
    python beta_snippets.py punctuation resources/commercial_mono.wav

positional arguments:
  command
  path        File for audio file to be recognized

optional arguments:
  -h, --help  show this help message and exit

The client library

This sample uses the Google Cloud Client Library for Python. You can read the documentation for more details on API usage and use GitHub to browse the source and report issues.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.