Coder Social home page Coder Social logo
Labriji Saad photo

labrijisaad Goto Github PK

followers: 41.0 following: 15.0 repos: 47.0 gists: 0.0

Name: Labriji Saad

Type: User

Company: Machine Learning Engineer

Bio: Passionate about developing, modeling πŸ“Š, and understanding the world around us 🌍 through the lens of Data πŸ’» and Machine Learning πŸ€–

Location: Paris, France

This image, generated with DALL-E, depicts a wide Moroccan landscape where ancient ruins and modern AI structures blend, symbolizing the harmony between the past and the future.

πŸ˜„ About Me:

Typing animation showing my roles and certifications

  • 🌱 Hello, I'm Saad, a 23-year-old based in France, with a deep passion for creating projects in the realms of Data and Artificial Intelligence.
  • πŸŽ“ I hold a Data Engineering degree from INPT.
  • πŸ’Ό Currently working as a Machine Learning Engineering Apprentice at AXA - Direct Assurance.
  • πŸ“š I'm also preparing for a Master's degree in Machine Learning and Data Science at Paris CitΓ© University.

πŸ… Certifications: (5x Azure Certified)

  • Azure Data Engineer
  • Azure Data Scientist
  • Azure Data Fundamentals
  • Azure AI Fundamentals
  • Azure Fundamentals

πŸ“š Contributions:

Contributed to repackaging and updating the GIT Clustering algorithm πŸ”„ based on insights from an arXiv paper, with implementation available in the GitHub repository πŸ“‚ and distribution through the TestPyPI Package πŸ“¦.

πŸ’Ό Work Experience:

  • Machine Learning Engineer / Data Scientist Apprenticeship at AXA - Direct Assurance, Paris, France (Ongoing) More details
  • Data Engineer / Data Scientist Internship at Chefclub, Paris, France (6 months) More details
  • Data Engineer Intern at Capgemini Engineering, Casablanca, Morocco (2 months)
  • Data Scientist Intern at AIOX Labs, Rabat, Morocco (2 months)
  • Web/Backend Developer Intern at DXC Technologies, Rabat, Morocco (2 months)

🌟 Top 4 Repositories

1. LLM RAG - Streamlit RAG Language Model App πŸ€–

Description: A Streamlit application leveraging a Retrieval-Augmented Generation (RAG) Language Model (LLM) πŸ€– with FAISS indexing πŸ—ƒοΈ to provide answers from uploaded markdown files. Users can upload documents πŸ“, input queries, and receive contextually relevant answers using Similarity Search πŸ”, showcasing a practical application of NLP technologies πŸ€–. The project is also equipped with a CI/CD pipeline πŸ”„ ensuring code quality & tests and simple deployment, and it supports containerization with Docker 🐳 for easy distribution and deployment.

  • Technologies/Tools: Streamlit, OpenAI API Models (LLMs, Embedding Models), FAISS, Python, Docker, CI/CD (Github Actions), Makefile, venv.

2. Kedro Energy Forecasting Machine Learning Pipeline 🏯

Description: A showcase of MLOps best practices using Kedro πŸ› οΈ, this repository shows the journey of Machine Learning Models from development to deployment πŸš€, utilizing Docker 🐳. Featuring straightforward training, evaluation, and deployment of models such as XGBoost Regressor, LightGBM πŸ’‘ and Random Forest Regeressor 🌳, it integrates built-in visualization πŸ“Š and logging πŸ“ for effective monitoring. Dive into the world of modular and scalable data pipelines with Kedro πŸ“š Kedro Documentation. The integration of an automated CI pipeline πŸ”„ with Github Actions ensures code quality βœ… and reliability πŸ”’.

  • Technologies/Tools: Docker, Kedro, MLOps, CI/CD (Github Actions), Machine Learning (XGBoost, Random Forest, LightGBM), Jupyter Notebook, Makefile, venv, Python.

3. Repackaged GIT Clustering Algorithm 🧩

Description: An upgraded version of the GIT Clustering algorithm πŸ”„, informed by insights from an arXiv paper πŸ“„, with easy deployment via TestPyPI πŸ“¦. Includes benchmarking notebooks πŸ“Š comparing it to state-of-the-art clustering algorithms πŸ”.

  • Technologies/Tools: Benchmarking, Poetry Packaging, PyPI Distributing, Machine Learning (K-means, DBSCAN, AgglomerativeClustering, Gaussian Mixture..), Jupyter Notebook, Makefile, venv, Python.

4. Monthly & Daily Energy Forecasting Docker API ⚑

Description: This repository πŸ“¦ houses an Energy Forecasting API ⚑ that uses Machine Learning to predict daily πŸ“… and monthly πŸ—“ energy consumption from historical data πŸ“Š. It's designed as a practical demonstration of a ML Engeineering/Data Science workflow, from initial analysis to a deployable API packaged with Docker 🐳.

  • Technologies/Tools: MLOps, Docker, API design, Machine Learning (XGBoost, Random Forest), Jupyter Notebook, Makefile, venv, Python.

πŸ™Œ Connect with Me:

LinkedIn Kaggle

Let's make something innovative together! Feel free to reach out for collaborations or discussions in Data & Artificial Intelligence!

πŸ”„ Last Updated:

  • README last updated on 17/04/2024. Regularly updated to reflect current work and interests.

Labriji Saad's Projects

analyzing-and-forecasting-a-time-series-with-python icon analyzing-and-forecasting-a-time-series-with-python

The main objective of this project was to study a time series to approach and manipulate abstract notions that we have seen during the course on time series class. Among the things we did in this project: the decomposition of the series, the study of each component, the stationarity test, the choice of the degree of the model, the learning of the latter and the forecast of future values ​​and much more...

apache-beam-k-means icon apache-beam-k-means

Implementing K-means clustering in sequential, streaming, and distributed formats using Apache Beam.

axa-direct-ml-apprenticeship icon axa-direct-ml-apprenticeship

Repository showcasing my Machine Learning Engineering Apprenticeship at AXA-Direct Assurance, contributing to the development and implementation of Machine Learning solutions.

car_renting_application_with_spring-boot icon car_renting_application_with_spring-boot

The car rental management system can be used by companies that manage a car rental project, note that this project only runs on a local machine, but can be modified to run on multiple machines simultaneously for more high efficiency and to serve the whole company.

chefclub-data-internship icon chefclub-data-internship

Repository showcasing my Data Engineer / Scientist internship at Chefclub, contributing to data infrastructure enhancement and fostering data-driven insights.

data-scientist-tools-pandas icon data-scientist-tools-pandas

In this hands-on training notebook, we'll see all the basics of the Pandas library used for data science/data analysis and machine learning tasks.

data-warehousing-in-azure-postgresql icon data-warehousing-in-azure-postgresql

In this repository, I address missing values in the Prosper dataset using advanced data cleaning techniques. The refined data is then seamlessly uploaded to a pre-configured Azure Postgres Database via a Jupyter Notebook, showcasing efficient data management and cloud database integration.

deepmoji icon deepmoji

State-of-the-art deep learning model for analyzing sentiment, emotion, sarcasm etc.

devenvconfigurations icon devenvconfigurations

πŸš€ Centralized repository for my customized IDE settings, system configurations, and tech stack preferences. πŸ› οΈ

dfinder_console_mode_with_java icon dfinder_console_mode_with_java

Dfinder is a local file browser, which searches through files (according to the user's choice) and then generates a txt file containing the search results.

dimreduce-healthanalytics icon dimreduce-healthanalytics

A project showcasing the application of various dimensionality reduction techniques for visualizing and analyzing simulated health diagnostics data in 2D and 3D.

git icon git

density growing clustering

git-clustering icon git-clustering

Enhanced and Repackaged GIT Clustering: This repository offers an open-source, enhanced version of the GIT (Graph of Intensity Topology) clustering algorithm.

kedro-energy-forecasting-machine-learning-pipeline icon kedro-energy-forecasting-machine-learning-pipeline

This repo showcases a project that transforms ML model training into a simplified, production-ready Kedro Dockerized Pipeline. It emphasizes best MLOps practices, enabling easy training, evaluation, and deployment of models, including XGBoost, LightGBM and Random Forest, with built-in visualization and logging features for effective monitoring.

language-identifier-svm icon language-identifier-svm

Language identification script that can detect the language of a given text. Currently supports Swahili, Wolof, French, English, Arabic, and Dyula. Customizable language support.

llm-rag icon llm-rag

A Dockerized Streamlit app leveraging a RAG LLM with FAISS to offer answers from uploaded markdown files, deployed on GCP Cloud.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    πŸ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. πŸ“ŠπŸ“ˆπŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❀️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.