Coder Social home page Coder Social logo

transform's Introduction

TensorFlow Transform PyPI

TensorFlow Transform (tf.Transform) is a library for preprocessing data with TensorFlow. tf.Transform is useful for preprocessing that requires a full pass the data, such as:

  • normalizing an input value by mean and stdev
  • integerizing a vocabulary by looking at all input examples for values
  • bucketizing inputs based on the observed data distribution

TensorFlow already supports arbitrary manipulations on a single example or batch of examples. tf.Transform extends the capabilities to support full passes over the example data.

The output of tf.Transform is exported as a TensorFlow graph for incorporation into training and serving. Using the same graph for both training and serving can prevent training/serving skew, because the same transforms are performed in both scenarios.

tf.Transform may introduce backwards incompatible changes before version 1.0.

Installation and Dependencies

The easiest and recommended way to install tf.Transform is with the PyPI package.

pip install tensorflow-transform

Currently tf.Transform requires that TensorFlow be installed but does not have an explicit dependency on TensorFlow as a package. See TensorFlow documentation for more information on installing TensorFlow.

tf.Transform requires Apache Beam to run distributed analysis. Apache Beam runs in local mode by default, and can also run in distributed mode using Google Cloud Dataflow. tf.Transform is designed to be extensible to other Apache Beam runners.

Getting Started

For instructions on using tf.Transform see the getting started guide.

Compatible Versions

This is a table of versions known to be compatible with each other, based on our testing framework. Other combinations may also work, but are untested.

tensorflow-transform tensorflow apache-beam[gcp]
GitHub master nightly (1.x) 2.4.0
0.6.0 1.6 2.4.0
0.5.0 1.5 2.3.0
0.4.0 1.4 2.2.0
0.3.1 1.3 2.1.1
0.3.0 1.3 2.1.1
0.1.10 1.0 2.0.0

transform's People

Contributors

davidsoergel avatar elmer-garduno avatar gweidner avatar kestertong avatar robertwb avatar sfujiwara avatar terrytangyuan avatar zoyahav avatar

Watchers

 avatar  avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.