Anuj Chauhan's Projects
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
To explore the functionalities of Airflow
📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.
A Data Engineering & Machine Learning Knowledge Hub
Trying to integrate async with gearman job server.
Azkaban workflow manager.
Fancy stream processing made operationally mundane
A repo exploring C
Distributed Task Queue (development branch)
Tried many features provided by celery - distributed task queue
Under heavy development now: Real time Celery monitoring with ASGI 3.0+ django 3.1
Exploring the etl orchestrator tool Dagster
Roadmap to becoming a data engineer in 2021
A list of useful resources to learn Data Engineering from scratch
Code for Data Pipelines with Apache Airflow
Examples for running Debezium (Configuration, Docker Compose files etc.)
Diabetic Retinopathy (DR) is an eye disease classified on a scale of 0-4. My task was to create an automated analysis system capable of assigning a score based on this scale • High resolution retina images were available which were cropped, resized, converted to 50% grayscale using OpenCV and pandas. Data augmentation methods were used to increase training set • Various convolutional neural networks (CNN) having different number of hidden layers were trained using tensorflow • The best CNN achived the training accuracy of 83.80% and test accuracy of 79.34%
A direct message app
EDA of House Prices dataset
minimo-eng - Minimalist theme for Hugo tuned for engineering content, based on Minimo
An automation - Send a custom email to the form google form submitter.