Duc Anh's Projects
Project to get hands on Airflow, and some modern data stack services. (Not finished yet)
This project is designed to provide practical experience with Airflow for workflow management, Soda for data quality checks, and Snowflake for secure and efficient data storage.
A curated list of awesome dbt resources
This project showcases the creation and execution of a comprehensive data pipeline, utilizing Azure services for data manipulation, storage, and visualization within an analytics framework.
The Data Pipeline and Analytics Stack is a comprehensive solution designed for processing, storing, and visualizing data. Explore a complete data pipeline with all components seamlessly set up and ready to use
Free Data Engineering course!
This project provides a practical experience with dbt, encompassing key aspects such as testing, data transformation, and integration with Snowflake. It's an excellent opportunity to dive into the intricacies of data management and enhance your skills in a real-world context.
Multi-container environment with Hadoop, Spark and Hive
My README, copy if you interest
The system processes real-time data and generates analytics, demonstrating the power of these technologies in an industry-grade data pipeline.
EnglishQuiz Project is an interactive JavaFX-based application designed to improve English language skill through quizzes.
Docker multi-container environment with Hadoop, Spark and Hive, Airflow, Superset
Welcome to the "Complete Machine Learning & Data Science Bootcamp 2023" on Udemy! This comprehensive course is designed to take you from a beginner to a proficient Data Scientist and Machine Learning engineer.
Build a movie recommendation data pipeline using Azure services for efficient data ingestion, transformation, and orchestration. Utilize Azure Blob Storage, Azure Databricks, and Azure Data Factory to implement collaborative filtering and PySpark ML for accurate movie recommendations.
This repository contains a simple calculator application built using C# .NET, along with unit tests to ensure the correctness of its functions. The calculator provides basic arithmetic operations and serves as a demonstration of how to apply unit testing to a C# .NET project.
This project demonstrates the implementation of a Change Data Capture (CDC) system using Debezium, Kafka, Postgres, and Docker.
Use machine learning models to predict the value of Walmart Weekly Sales