Coder Social home page Coder Social logo

leohmoraes / datalab Goto Github PK

View Code? Open in Web Editor NEW

This project forked from googledatalab/datalab

0.0 3.0 0.0 24.56 MB

Interactive tools and developer experiences for Big Data on Google Cloud Platform.

License: Apache License 2.0

Shell 5.68% Python 0.21% Smarty 0.10% TypeScript 58.68% HTML 20.72% CSS 3.16% Jupyter Notebook 0.19% JavaScript 11.26%

datalab's Introduction

Google Cloud DataLab

Google Cloud DataLab provides a productive, interactive, and integrated tool to explore, visualize, analyze and transform data, bringing together the power of Python, SQL, JavaScript, and the Google Cloud Platform with services such as BigQuery and Storage.

Google Cloud Datalab Beta

DataLab builds on the interactive notebooks, and the foundation of Jupyter (formerly IPython) to enable developers, data scientists and data analysts to easily work with their data from exploration to developing and deploying data pipelines, all within notebooks.

DataLab deeply integrates into Google Cloud Platform to allow users to extract insights and harness the full value of their data. It provides a secure environment for members of a cloud project to effortlessly access data and resources accessible from the project, and to share notebooks via git.

You can see an example of the notebooks by browsing through the samples and documentation, which are themselves written in the form of notebooks.

Using DataLab and Getting Started

DataLab is packaged as a docker container which contains Jupyter/IPython, and a variety of python libraries such as numpy, pandas, scikit-learn and matplotlib, in a ready-to-use form.

You can run the docker container locally or in GCE, as described in the wiki.

Contacting Us

For support or help using DataLab, please submit questions tagged with google-cloud-datalab on StackOverflow.

For any product issues, you can either submit issues here on this project page, or you can submit your feedback using the feedback link available within the product.

Developing DataLab

Building and Running

The wiki describes the process of setting up a local development environment, as well as the steps to build and run, and the developer workflow.

Contributing

Contributions are welcome! Please see our roadmap page. Please check the page on contributing for more details.

You can always contribute even without code submissions by submitting issues and suggestions to help improve DataLab and building and sharing samples and being a member of the community.

Testing

Please take a look at the test directory for instructions on how to run tests locally.

Repository Overview

This is a quick description of the repository structure to help understand and discover the relevant pieces.

All source code corresponding to product functionality that is built exists within /sources. The following is a list of the individual components:

  • /sources/lib - set of python libraries used to implement APIs to access Google Cloud Platform services, and implement the DataLab interactive experience.

    • api: Google Cloud Platform APIs (currently: BigQuery and Cloud Storage).
    • datalab: interactive notebook experience to plug into Jupyter and IPython.
  • /sources/web - the DataLab web server. This is implemented in node.js and serves the DataLab front-end experience - both content and APIs, as well as backend infrastructure such as notebook source control. Some of the requests are proxied to the Jupyter notebook server, which manages notebooks and associated kernel sessions.

  • /sources/tools - miscellaneous other supporting tools.

Source code builds into the /build directory, and the generated build outputs are consumed when building the DataLab docker container.

The build outputs are packaged in the form of a docker container.

  • /containers/datalab - the only container for now. This is the container that is used as the DataLab AppEngine module.

datalab's People

Contributors

gramster avatar drewbryant avatar yebrahim avatar nikhilk avatar ojarjur avatar jimmc avatar amshali avatar qimingj avatar bryantgipson avatar chmeyers avatar parthea avatar di-ku avatar rnabel avatar craigcitro avatar fischman avatar blois avatar ekrogers avatar haavardw avatar brandondutra avatar daweihuang avatar corrieann avatar umang-sh avatar corrieelston avatar aman-ebay avatar tylere avatar neilsh avatar michaelthebrute avatar fhoffa avatar ekuefler avatar alanyee avatar

Watchers

James Cloos avatar Léo Moraes avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.