anjunact / fast-data-processing-with-spark-2 Goto Github PK

View Code? Open in Web Editor NEW

Fast-Data-Processing-with-Spark-2

License: MIT License

Scala 9.82% Java 0.48% Python 1.03% Jupyter Notebook 88.67%

fast-data-processing-with-spark-2's Introduction

Fast Data Processing with Spark 2 - Third Edition

This is the code repository for Fast Data Processing with Spark 2 - Third Edition, published by Packt. It contains all the supporting project files necessary to work through the book from start to finish.

Instructions and Navigations

All of the code is organized into folders. Each folder starts with a number followed by the application name. For example, Chapter02.

The code will look like the following:

<dependency>
<groupId>junit</groupId>
<artifactId>junit</artifactId>
<version>4.11</version>
<scope>test</scope>
</dependency>

Like any development platform, learning to develop systems with Spark takes trial and error. Writing programs, encountering errors, and agonizing over pesky bugs are all part of the process. We assume a basic level of programming – Python or Java and experience in working with operating system commands. We have kept the examples simple and to the point. In terms of resources, we do not assume any esoteric equipment for running the examples and developing code. A normal development machine is enough.

anjunact / fast-data-processing-with-spark-2 Goto Github PK

fast-data-processing-with-spark-2's Introduction

Fast Data Processing with Spark 2 - Third Edition

Instructions and Navigations

Related Products

Suggestions and Feedback

fast-data-processing-with-spark-2's People

Contributors

Watchers

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent