satyam245 / etl-pipeline-bank-transaction Goto Github PK

View Code? Open in Web Editor NEW

Transform daily bank transactions effortlessly with this AWS ETL pipeline. Ingest CSVs to S3, trigger Glue jobs with Lambda, store securely in Parquet, and analyze seamlessly using Athena

Python 100.00%

etl-pipeline-bank-transaction's Introduction

AWS ETL Pipeline for Daily Bank Transactions

🚀 Welcome to my robust ETL (Extract, Transform, Load) pipeline for processing daily bank transactions using Amazon Web Services (AWS).

Overview

This project implements a scalable and serverless ETL pipeline for handling daily bank transactions. The entire process is orchestrated using various AWS services to ensure efficiency, reliability, and security.

Project Breakdown

1️⃣ Data Ingestion

Source Data: Daily CSV files are uploaded to an S3 bucket (s3://your-source-bucket).

2️⃣ Serverless Trigger

AWS Lambda Function: A serverless AWS Lambda function is triggered upon the upload of CSV files. This Lambda function initiates the execution of the AWS Glue job.

3️⃣ Data Transformation

AWS Glue Job: The AWS Glue job is responsible for processing the data. It leverages bookmarking for efficient incremental loads, ensuring that only new data is processed.

4️⃣ Storage

Processed Data: The transformed data is securely stored in another S3 bucket (s3://your-destination-bucket) in the efficient Parquet format.

5️⃣ Analysis

Athena for Analysis: AWS Athena is employed for seamless analysis of the transformed data. Users can run SQL queries on the Parquet data stored in the destination S3 bucket.

Recommend Projects

satyam245 / etl-pipeline-bank-transaction Goto Github PK

etl-pipeline-bank-transaction's Introduction

AWS ETL Pipeline for Daily Bank Transactions

Overview

Project Breakdown

1️⃣ Data Ingestion

2️⃣ Serverless Trigger

3️⃣ Data Transformation

4️⃣ Storage

5️⃣ Analysis

etl-pipeline-bank-transaction's People

Contributors

Stargazers

Watchers

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent