miguruiz Goto Github PK
Name: Miguel Ruiz
Type: User
Company: Expedia
Bio: Data Engineer @ Expedia. Scala, Python, Spark
Location: Madrid, Spain
Name: Miguel Ruiz
Type: User
Company: Expedia
Bio: Data Engineer @ Expedia. Scala, Python, Spark
Location: Madrid, Spain
Created an ETL on airflow that extracts files from S3, loads them into staging tables in Redshift, then transforms the data and loads it again to Redshift
Technical exercise proposed for Amadeus interview process.
Technical exercise proposed for Amadeus interview process. (First contact with Scala & Spark)
Playing around with Apache Beam in Python
Assignment for data engineering candidates to create a tool to provide commercial insights based on booking data
Project files for Intro to DevOps class
Containerized frontend - using Docker - that gets tested everytime there is a PR opened/modified, and once the tests pass and the PR is merged to master, deploys the app to AWS Elastic Beanstalk. The automation is done through Github Actions.
Drafting an ETL on jupyter notebooks that cleans and models us immigration data
Add support for Markdown to Atom (including Github flavored, Markdown Extra, CriticMark, YAML/TOML front-matter, and R Markdown), and smart behavior to lists.
Utilizing time-series prediction library prophet, NLP, and other Machine Learning algorithms to predict the demand of a factory.
Custom version of the aliases plugin for enriched, combined with common-aliases
Parser for rdf data to property
Line-follower robot with speech recognition to identify where to stop. Developed for Lego NXT
Testing pyspark on AWS EMR with sparkify dataset
ETL from data sources stored in JSON to AWS Redshift database. Written in Python
ETL from data sources stored in JSON to PostgreSQL database. Written in Python
Presentation about Spark
Apache Spark Connector for SQL Server and Azure SQL
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.