Coder Social home page Coder Social logo

saurabhkhandebharad / bigdata-sk Goto Github PK

View Code? Open in Web Editor NEW
1.0 1.0 0.0 7 KB

Analyzed a multicategory e-commerce store using big data techniques on a Kaggle dataset with the help of AWS EC2, AWS S3, PySpark, AWS Glue ETL, AWS Athena, AWS CloudFormation, AWS Lambda and Power BI!

Python 100.00%
aws aws-athena aws-cloudformation aws-ec2 aws-glue-crawler aws-lambda aws-s3-bucket aws-services big-data end-to-end-pipeline

bigdata-sk's Introduction

BigData - Saurabh Khandebharad

E - Commerce Analytics and Big Data Processing (End-To-End Group Project)

Guidance by - Pradeep Tripathi

KAGGLE DATASET: https://www.kaggle.com/datasets/mkechinov/ecommerce-behavior-data-from-multi-category-store

File Name: 2019-Nov.csv

File Size: 8 GB

Project Architecture

Architecture

Being excellent at data analysis and visualization, I volunteered to do the data cleaning and preprocessing in pyspark. Head over to PySpark.py and check my code! Handling such a large data was fun and a learning experience!

๐Ÿ‘‰My PySpark Script

PowerBI Visualizations..

Page 1 - Dashboard Page1

Page 2 - Dashboard Page2



Don't forget to leave a star!โญ:

bigdata-sk's People

Contributors

saurabhkhandebharad avatar

Stargazers

 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.