Coder Social home page Coder Social logo

agutiernc / data-eng-zoomcamp Goto Github PK

View Code? Open in Web Editor NEW
0.0 2.0 0.0 1.03 MB

Data Engineering Zoomcamp 2024

Dockerfile 0.03% Python 0.56% Jupyter Notebook 99.38% Shell 0.02%
data-ingestion data-warehouse docker pandas pipelines postgresql pyarrow python sql terraform mage-ai jupyter-notebook etl-pipeline dbt dlt ci-cd apache-kafka apache-spark google-cloud-platform etl

data-eng-zoomcamp's Introduction

Data Engineering Zoomcamp 2024

This repository contains the projects, assignments, and code I've worked on as part of the Data Engineering Zoomcamp offered by Data Talks Club. The ZoomCamp is a comprehensive online program designed to equip individuals with the essential skills and knowledge required for pursuing a career in Data Engineering.

Course Overview

The Data Engineering Zoomcamp covers a wide range of topics, including data modeling, data pipelines, batch and stream processing, data warehousing, and various data engineering tools and technologies. Throughout the course, I've gained hands-on experience with industry-standard tools such as Apache Spark, Apache Kafka, Docker, Mage, PostgreSQL, Redpanda, dbt cloud, dlt, and more.

Repository Structure

The repository is organized into several folders, each representing a module or topic covered in the zoomcamp:

  • module 1: Containerization and Infrastructure as Code
  • module 2: Workflow Orchestration with Mage
  • module 3: Data Warehouse and Big Data
  • module 4: Analytics Engineering with dbt
  • module 5: Batch Processing with Apache Spark
  • module 6: Streaming Data with Apache Spark, Apache Kafka, and Redpanda
  • Workshop 1: Data Ingestion with dlt
  • Workshop 2: Stream processing with RisingWave

Each module folder contains the corresponding assignments, code examples, and documentation related to the respective topic.

Learning Outcomes

Through this course, I have acquired a comprehensive understanding of Data Engineering principles and best practices. Some key areas of learning include:

  • Data modeling techniques
  • Building robust and scalable data pipelines
  • Batch and stream processing with Apache Spark and Apache Kafka
  • Data warehousing concepts and implementation with cloud-based solutions
  • Containerization and orchestration with Docker, PostgreSQL, Mage
  • Proficiency in SQL, Python, and other Data Engineering-related tools

This repository serves as a showcase of my work and demonstrates my proficiency in various Data Engineering tools and technologies.

Contact

If you have any questions or would like to discuss my work further, please feel free to reach out to me on LinkedIn.

data-eng-zoomcamp's People

Contributors

agutiernc avatar

Watchers

Kostas Georgiou avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.