Dorian Beganovic's Projects
Maastricht university - semester 1 team project - solving the knapsack problem
Comparing the performance of Apache Spark to Python's Pandas in popular data analysis tasks such as joins, aggregates and where clauses
Demo of running AWS Lambda and AWS Kineses locally using Localstack
Jupyter notebook showing how to use pandas to parse JSON data from REST API into dataframes, clustering based on column names and values within the columns as wells as plotting the resulting heatmaps using Seaborn
Ensemble prediction based submission for a private Kaggle competition. Ranked 7/27.
How to install Cloudera quickstart
Graded lab exercises from the CS110x Big Data Analysis with Apache Spark online course on edx
Lots of code I wrote practicing for the algorithms and data structures course
A Java Swing GUI for building EEG data analysis workflows
Application for analyzing EEG data stored on Hadoop using Apache Spark
A Spring Boot server for managing Apache Spark jobs on a remote server
Applying deep learning techniques to predict prices of houses
Homework assignments for course Data Analysis at Maastricht University. Presented as HTML output of R markdown.
Guide on how to install, configure and administer Kerberos on Cloudera Manager as well as how to setup a client connection using Kerberos
KubeScale: A Hybrid Kubernetes Auto-Scaler
Platform for evaluating auto-scalers deployed on Kubernetes
The kubespray setup for running Kubernetes on Birkbeck VM
Developing a Lambda Architecture pipeline using Apache Kafka, Spark Structured Streaming, Redshift, S3, Python
A 3D mini golf game with an AI and a Physics engine
Applying advanced analytics on NBA shot log datasets using Python
Dark color theme for Pycharm by Sublime Text's Monokai Theme.