Coder Social home page Coder Social logo

Satyam's Projects

airflow-logistics-data-pipeline icon airflow-logistics-data-pipeline

Streamline logistics data orchestration with Apache Airflow on Google Cloud Platform. Automate ingestion, transformation, and storage of CSV files in Google Cloud Storage (GCS) into Hive tables on Google Cloud Dataproc. Utilizes dynamic partitioning for scalability and efficiency.

aws-airline-ingestion-pipeline icon aws-airline-ingestion-pipeline

Efficiently ingest daily airline data into AWS using a seamless end-to-end pipeline, integrating S3 uploads, Glue schema discovery, Redshift data transformation, and SNS notifications.

aws-cloud-project icon aws-cloud-project

‘Save To the Cloud’ is a full stack web application that mainly deals with storing and saving files by leveraging cloud infrastructure.

data-science-ipython-notebooks icon data-science-ipython-notebooks

Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.

ecommerce-integration-pipeline icon ecommerce-integration-pipeline

🌐 Seamless E-commerce Data Integration Pipeline using Python, GCP Pub/Sub, DataStax Cassandra, and Pandas. This repository includes scripts to load, publish, and consume data, along with instructions for setting up the environment. Simplify your data integration process for e-commerce orders with this efficient and scalable solution.

etl-pipeline-bank-transaction icon etl-pipeline-bank-transaction

Transform daily bank transactions effortlessly with this AWS ETL pipeline. Ingest CSVs to S3, trigger Glue jobs with Lambda, store securely in Parquet, and analyze seamlessly using Athena

kafka-mongo-logistics-integration icon kafka-mongo-logistics-integration

A Python application integrating Kafka and MongoDB for efficient logistics data processing, with Avro serialization, Docker scaling, and an API for seamless interaction.

order-tracking-incremental-load-project icon order-tracking-incremental-load-project

The Order Tracking Incremental Load Project automates the integration of order tracking data using Apache Spark in Databricks. Leveraging Google Cloud Storage (GCS) for input, it features efficient stage processing, upserts to a target Delta table, and automated execution.

realtime-orders-data-processing icon realtime-orders-data-processing

"Real-Time Data Processing with GCP Pub/Sub and DataStax Cassandra" is a project demonstrating the integration of Google Cloud Platform's Pub/Sub and DataStax Cassandra for efficient real-time data processing. It handles orders and payments data streams, ingesting them via Pub/Sub and storing them in Cassandra tables.

stock-market-kafka-pipeline icon stock-market-kafka-pipeline

Stock Market Kafka Project: A robust data pipeline leveraging Python, Confluent Kafka, AWS S3, IAM, Glue, and Athena to efficiently process and analyze stock market data. Streamline your workflow from CSV ingestion to insights with this comprehensive solution.

udacity-data-engineering-projects icon udacity-data-engineering-projects

Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.