satyam245 / stock-market-kafka-pipeline Goto Github PK

View Code? Open in Web Editor NEW

Stock Market Kafka Project: A robust data pipeline leveraging Python, Confluent Kafka, AWS S3, IAM, Glue, and Athena to efficiently process and analyze stock market data. Streamline your workflow from CSV ingestion to insights with this comprehensive solution.

Python 100.00%

stock-market-kafka-pipeline's Introduction

Stock Market Kafka Project 🚀📈

Overview

Welcome to my small project, the "Stock Market Kafka Project"! In this project, I've crafted a robust data pipeline using Python, Confluent Kafka, AWS S3, IAM, Glue, and Athena. The goal is to efficiently process and analyze stock market data through a well-defined workflow.

Technologies Used

Programming Language: Python
Amazon Web Services (AWS):
- S3 (Simple Storage Service)
- Athena
- AWS Glue
  - Glue Crawler
  - Glue Catalog
- Amazon EC2
Confluent Kafka

Workflow Overview 📊

Read data from CSV:
- Utilize Python to read stock market data from a CSV file, preparing it for further processing.
Send it to Confluent Kafka with Avro schema:
- Leverage Confluent Kafka to efficiently stream the data with Avro schema for effective serialization.
Consume, transform, and store individually in AWS S3 as JSON:
- Develop a consumer to process and transform the data, storing it individually in AWS S3 in JSON format.
Glue Crawler does its magic, creating a data catalog:
- Employ AWS Glue Crawler to automatically discover and catalog the data in the S3 bucket, enhancing data management.
Athena steps in, empowering me to query insights seamlessly:
- Utilize AWS Athena for effortless querying and gaining valuable insights from the processed data.

Prerequisites

Before you begin with the setup, make sure you have the following:

Programming Language: Python 3.6 or higher
Amazon Web Services (AWS):
- An AWS account with the following services configured:
  - S3 (Simple Storage Service)
  - [Athena]
  - [AWS Glue] set up with Glue Crawler and Glue Catalog
Confluent Kafka: (https://www.confluent.io/)
- An active Confluent Kafka cluster
- Necessary credentials for accessing the Kafka cluster

For more info checkout this video- Video Link - https://www.youtube.com/embed/KerNf0NANMo

Recommend Projects

satyam245 / stock-market-kafka-pipeline Goto Github PK

stock-market-kafka-pipeline's Introduction

Stock Market Kafka Project 🚀📈

Overview

Technologies Used

Workflow Overview 📊

Prerequisites

stock-market-kafka-pipeline's People

Contributors

Stargazers

Watchers

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent