dmschauer Goto Github PK
Name: Dominik
Type: User
Name: Dominik
Type: User
Amazon Redshift Cookbook, Published by Packt
A curated list of data engineering tools for software developers
AWS CDK code written in Python to create an AWS Lambda function (also in Python) that does a basic ETL transformation. The project shows how to add pytest unit tests to the Lambda function within the CDK project.
Use AWS CDK (Python) to periodically call Spotify APIs and store artist data using serverless services (Lambda, API-GW, DynamoDB, S3, EventBridge)
Use AWS CDK (Python) to create a Twitter Bot that tweets a poem once a day. Uses Lambda, API-GW and DynamoDB.
Code for our AWS Certified Developer Associate course
A serverless architecture for orchestrating ETL jobs in arbitrarily-complex workflows using AWS Step Functions and AWS Lambda.
Covers a use-case on how to setup local AWS Glue and Apache Spark environment to perform automated unit testing using localstack. Code+Tests = examples + help on getting started with local setup. The setup script helps setup the required environment that can be used as a base to write codebase and tests required for specific app requirements.
Build, Test and Deploy ETL solutions using AWS Glue and AWS CDK based CI/CD pipelines
Code for a blog article. Main reference is https://docs.aws.amazon.com/glue/latest/dg/interactive-sessions.html
Demo code to illustrate the execution of PyTest unit test cases for AWS Glue jobs in AWS CodePipeline using AWS CodeBuild projects
Code for a blog article. Mainly copied from https://docs.aws.amazon.com/glue/latest/dg/aws-glue-programming-etl-libraries.html#develop-local-python
Code for a blog article. Main reference is: https://medium.com/@dominikschauer/professional-aws-glue-pyspark-development-mocking-aws-services-for-unit-tests-e6c222e95933
Example of AWS Glue Jobs and workflow deployment with terraform in monorepo style. Code here supports the miniseries of articles about AWS Glue and python.
I did a simple test to see how deploying a machine learning model on AWS Sagemaker and thus turning it into an API works. Since scikit-learn models require less dependencies than e.g. TensorFlow models I went with them for this test. To do so I used a tutorial.
A Hugo theme built using Yahoo's Pure CSS
Minimalistic Jekyll theme that automatically supports dark and light modes for blogs
Supplementary Materials for the dbtlearn.com Udemy course
Jekyll Theme
Dynamically generate Apache Airflow DAGs from YAML configuration files
This is a template you can use for your next data engineering portfolio project.
Data Engineering with Spark and Delta Lake
Data Engineering with AWS, Published by Packt
Data Engineering with Python, published by Packt
Code for Data Pipelines with Apache Airflow
Build DataOps platform with Apache Airflow and dbt on AWS
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.