Coder Social home page Coder Social logo

satyam245 / stock-market-kafka-pipeline Goto Github PK

View Code? Open in Web Editor NEW
1.0 1.0 0.0 2.93 MB

Stock Market Kafka Project: A robust data pipeline leveraging Python, Confluent Kafka, AWS S3, IAM, Glue, and Athena to efficiently process and analyze stock market data. Streamline your workflow from CSV ingestion to insights with this comprehensive solution.

Python 100.00%

stock-market-kafka-pipeline's Introduction

Stock Market Kafka Project ๐Ÿš€๐Ÿ“ˆ

Overview

Welcome to my small project, the "Stock Market Kafka Project"! In this project, I've crafted a robust data pipeline using Python, Confluent Kafka, AWS S3, IAM, Glue, and Athena. The goal is to efficiently process and analyze stock market data through a well-defined workflow.

Technologies Used

  • Programming Language: Python

  • Amazon Web Services (AWS):

    • S3 (Simple Storage Service)
    • Athena
    • AWS Glue
      • Glue Crawler
      • Glue Catalog
    • Amazon EC2
  • Confluent Kafka

Workflow Overview ๐Ÿ“Š

  1. Read data from CSV:

    • Utilize Python to read stock market data from a CSV file, preparing it for further processing.
  2. Send it to Confluent Kafka with Avro schema:

    • Leverage Confluent Kafka to efficiently stream the data with Avro schema for effective serialization.
  3. Consume, transform, and store individually in AWS S3 as JSON:

    • Develop a consumer to process and transform the data, storing it individually in AWS S3 in JSON format.
  4. Glue Crawler does its magic, creating a data catalog:

    • Employ AWS Glue Crawler to automatically discover and catalog the data in the S3 bucket, enhancing data management.
  5. Athena steps in, empowering me to query insights seamlessly:

    • Utilize AWS Athena for effortless querying and gaining valuable insights from the processed data.

Prerequisites

Before you begin with the setup, make sure you have the following:

  • Programming Language: Python 3.6 or higher

  • Amazon Web Services (AWS):

    • An AWS account with the following services configured:
  • Confluent Kafka: (https://www.confluent.io/)

    • An active Confluent Kafka cluster
    • Necessary credentials for accessing the Kafka cluster

For more info checkout this video- Video Link - https://www.youtube.com/embed/KerNf0NANMo

stock-market-kafka-pipeline's People

Contributors

satyam245 avatar

Stargazers

 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.