linearregression Goto Github PK
Name: edwardttril
Type: User
Location: San Francisco Bay Area, CA
Name: edwardttril
Type: User
Location: San Francisco Bay Area, CA
Data pipeline is a tool to run Data loading pipelines. It is an open sourced app engine app that users can extend to suit their own needs. Out of the box it will load files from a source, transform them and then output them (output might be writing to a file or loading them into a data analysis tool). It is designed to be modular and support various sources, transformation technologies and output types. The transformations can be chained together to form complex pipelines.
Data Science Course Materials - Fall 2014
Coursera class in data science using R
A card-based approach to data structures
OLAP cubes R data type
R's data.table package extends data.frame. More info:
Python script to collect social data from various channels, store that to Apache Spark for further processing.
Command line utilities for data analysis
Basho Data Platform Core
A web-based pedagogical tool for exploring data structures.
Source-agnostic distributed change data capture system
Helpers for transparently downloading datasets
Multidimensional data storage with rollups for numerical data
Divide and Recombine
Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines.
Google Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines.
Go package for working with Vaultaire dataframes
Estimating Finite State Machine Models from Data
An experimental hosted platform (GitHub-like) for organizing, managing, sharing, collaborating, and making sense of data.
Connect processes into powerful data pipelines with a simple git-like filesystem interface
DataLoader is a generic utility to be used as part of your application's data fetching layer to provide a consistent API over various backends and reduce requests to those backends via batching and caching.
simplified query engine based on logic programming paradigm
The repository contains slides, code and markdown files for the Revolution Analytics Webinar: Data Science with R given 9/25/14
Data team culture
Fast, efficiently stored Trie for Python. Uses libdatrie.
pure functional data structure for Erlang
A repository `similar` to Ecto.Repo that maps to an underlying http client, sending requests to an external rest api instead of a database
A database proxy service for a microservices ecosystem
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.