Coder Social home page Coder Social logo

tiao553 / bigdata-k8s-kafka Goto Github PK

View Code? Open in Web Editor NEW
1.0 1.0 0.0 13.06 MB

In this repository i will allocate my architecture project, all on a kubernetes cluster.

HCL 0.85% Shell 4.23% Python 30.44% Mustache 45.55% Makefile 0.07% Smarty 18.87%
kafka kuberbetes big-data ksqldb pinot minio

bigdata-k8s-kafka's Introduction

📌 About me

Well, I'm 22 years old, living in Ipanema, Minas Gerais, Brazil.

I'm graduating in Control and Automation Engineering and I've always been passionate about technology but the career opportunities always took me to another path, until I finally decided to drop everything and go after my dream. I am currently working as a Data Scientist at precato, a fintech company focused on the purchase of precatórios, where I work directly with python projects and test automation and model implementation.

Currently I invest my time in learning about data engineering, an area I fell in love with.

🛠️ Programing languages


Projects about the Data Science

  • Analyzing the Violence in Rio de Janeiro [PT-BR]: link

    A descriptive analysis was made with the objective of understanding which were the main influencers of the high violence observed in Rio de Janeiro.

  • Airbnb Data Analysis, Buenos Aires, Argentina [PT-BR]: link

    Descriptive analysis on the data provided by Airbnb in order to understand which neighborhoods are the best to rent according to price and location.

  • Features engineering, learn interactively [EN-US]: link

    One way to improve the performance of a model is feature enginerring. In this article I show how to apply this technique in a simple interactive way.

Projects about the Data engineer

  • [AWS] Building a lambda pipeline with CDK [PT-BR] : Link

    This pipeline architecture is widely used in cases where the data is integrated all in one zone. In addition, it has 3 storage stages (bronze, silver, gold). Curious? Go to the repositor at link above.

  • [GCP] Hackathon A3data [PT-BR] : link

    A very cool challenge that contemplated the treatment and analysis of a large set of data. In this repository you can see my project carried out in this competition.

  • Data Collection Pipeline in Yahoo Finance [PT-BR] : link

    Back to the origins. I entered the market with many cloud-oriented solutions. With that in mind I worked on this project creating instances of Hadoop, spark, and Hive to get an experience with these tools.


  • Machine Learning and Data Science with Python of A to Z: Certificate

    With this course I was able to learn about a good part of a data scientist's pipeline considering from data preparation to application of supervised to unsupervised algorithms. In the course it was also proposed several use cases.

  • TensorFlow: Machine Learning and Deep Learning with Python: Certificate

    Theory and practice of how to build artificial neural networks to solve real problems of the day with convolutional neural networks, recurrent neural networks, autoencoders, and robust generative adversarial networks using TensorFlow were proposed in the course.

  • SQL and NoSQL Databases from basic to advanced: Certificate

    With course I learned to use different SQL and NoSQL Database Management Systems, model relational databases applying the five normal ways. [MySQL, PostgreSQL, SQLite, MongoDB,Redis, CouchDB,and Firehouse]

  • HOWBootcamps Engenharia de dados: Certificate

    In this bootcamp it was proposed to build a pipeline with lambda architecture. Where we use the infrastructure as AWS CDK code, create a datalake and a datawarehouse, process data with databricks, orchestrate tasks with airflow and automate tests and processes.


✅ Get in contact with me! ✉️

author

bigdata-k8s-kafka's People

Contributors

tiao553 avatar

Stargazers

 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.