benuk005's Projects
30 days of Python programming challenge is a step-by-step guide to learn the Python programming language in 30 days. This challenge may take more than100 days, follow your own pace.
Analytics Challenge
The program can be used to scrape the content from an article from web by an input of a set of URLs in a text file or a URL. This project uses newspaper3k and python-docx libraries. The output of this program will give a neatly modified Word Document in '.docx' format with the contents of the article.
A curated list of awesome things related to Flask
A curated list of awesome Python frameworks, libraries, software and resources
This solution helps you deploy ETL jobs on data lake using CDK Pipelines.
A serverless architecture for orchestrating ETL jobs in arbitrarily-complex workflows using AWS Step Functions and AWS Lambda.
Config files for my GitHub profile.
Big Data Engineering practice project, including ETL with Airflow and Spark using AWS S3 and EMR
AWS CloudFormation template for CI/CD pipeline on AWS for Python
Solution to all projects of Udacity's Data Engineering Nanodegree: Data Modeling with Postgres & Cassandra, Data Warehouse with Redshift, Data Lake with Spark and Data Pipeline with Airflow.
Personal Data Engineering Projects
Example end to end data engineering project.
Teaching Materials for Distributed Statistical Computing (大æ°æŽåå¸åŧ莥įŽæåĻææ)
Docker Apache Airflow
Collection of Pycharm IDE snippets to Flask framework
Learn the entire ETL process based on Spotify API data
An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.
Solutions to regex challenges on HackerRank: https://www.hackerrank.com/domains/regex/re-introduction
Programming Problems that I have solved so far on HackerRank
đ HackerRank SQL track solutions
Notes on Apache Spark (pyspark)
Markdown to Docx converter
Office Automation by Using Pythonf (For Excel, Word, PPT and PDF .....)
Examples of applications and tool usage for Oracle Database
A walkthrough to guide you through the process of migrating from Oracle to Amazon Aurora for Postgresql
:rocket:Parse PDFs, Word and Excel documents. Read, Create, Merge/Combine, Extract data from office documents.