Coder Social home page Coder Social logo

Hi, I'm Harsh Solanki 👋

Machine Learning Deep Learning MLOps Full Stack Engineering AWS

I'm a Senior ML Engineer at Target Tech India, passionate about deep learning in computer vision and full-stack engineering. My expertise spans Python, AWS, Springboot, Java, Scala, Kafka, PyTorch, TensorFlow, Flask, FastAPI, React, and Docker. A professional introvert, I love to write, learn, and explore new technologies every day.

  • 🔭 I’m currently working on building a MLOps platform from scratch.
  • 🌱 I’m currently learning about advanced machine learning techniques and full-stack development.
  • 👯 I’m looking to collaborate on projects involving computer vision and natural language processing.
  • 🤔 I’m looking for help with deep learning in computer vision.
  • 💬 Ask me about machine learning, AWS, full-stack development, or anything tech-related.
  • 📫 How to reach me: LinkedIn

A Bit More About Me

I'm driven by a mathematical captivation and a curiosity to question and improve existing processes. My work in machine learning covers algorithms like Decision Trees, SVMs, Neural Networks, and much more. I'm also skilled in statistical techniques and experienced with a variety of data applications and databases.

🔍 My core competencies include:

  • Machine Learning and Deep Learning
  • Data Mining and Data Engineering
  • Natural Language Processing and Computer Vision
  • Data Analysis and Visualization

📈 I enjoy turning complex data into compelling stories through visualization, and I'm passionate about empathetic leadership and clear communication within teams.

Some of My Work

Lending Club Insights and Modeling Fraud Detection Don't Overfit Learning Deep Learning Automating Video Editing CoWIN Vaccine Availability

Harsh's GitHub Stats

Outside of work, I value meeting new people, engaging in deep conversations, and building lasting relationships. I believe in the power of empathy, gratitude, and networking to foster meaningful connections and achieve professional success.

Harsh Solanki's Projects

automating-video-editing icon automating-video-editing

Automated video editing to avoid seeing my relatives in a wedding video Are you lazy and self-centered like me to hate to go through hours of video footage just to have a few find yourself in it (or a person of interest)? Perhaps you're preparing a video for a girlfriend's birthday celebration, wanting to find happy memories of you both only. Maybe you are scouring security video footage, looking to see just yourself in case you wish to see what you might have forgotten or something. Or it may be that you want to produce a highlight reel of your favorite footballer's game.

aws_general_snippets icon aws_general_snippets

Some useful AWS libraries that I created or copied from the internet for day to day use.

basics icon basics

📚 Learn ML with clean code, simplified math and illustrative visuals.

cowin_vaccine_availability icon cowin_vaccine_availability

Python script to check the available slots for Covid-19 Vaccination Centers from CoWIN API in India. You can get information filtered by your PIN and you can get slots for today or upcoming 7 days.

dont_overfit icon dont_overfit

Avoid overfitting with a tiny sliver for training data Inspired by the Kaggle Don’t Overfit Challenge: Tiny Training Trial. The challenge; build the best performing model you can with a <5% training vs >95% test split with TF-IDF encodings on an Amazon multi-classification problem. With so many data hungry algorithms out there that take days or more to compute, we thought it’d be refreshing to go the other way and experiment with what can be done with extremely small and noisy datasets! Iterate and experiment with training times on the order of seconds. Our split is: Train: 1244 points Approach overview •Build Ensemble that includes multiple model categories: Logistic Regression, Random Forests, XGBoost, Adaboost, and Neural Networks. •Split the training dataset into K stratified folds. For each fold and model category, train a separate model using Grid Search. •Combine all models into ensemble using Averaging. I experimented with: 1.Which model categories to include in the ensemble 2.How many stratified folds to use: 1, 5, 10, 20, 40 3.How to build the ensemble: Averaging vs. Max voting 4.Oversampling techniques such as SMOTE and ADASYN: including models trained with SMOTE data in the ensemble worked for the Public leaderboad, but not for Private 5.Feature standardization: did not seem to improve anything. Lessons Learned Ensembling is the way to go, of course. Increasing the number of stratified folds improved performance. Improvements in training data accuracy (on validation set) did not necessarrily translate to better accuracies in the Public dataset. A prime example for this was the LR method that did not perform as well in the training validation accuracy compared to other methods such as NN. However, LR was an integral part of the overall Ensemble; whenever we removed it, the Public dataset accuracy ended up much worse. Ensembling using Averaging always worked better than Max voting. We kind of `overfitted' to the Public Leaderboard, i.e., our best performing model in Public was not the best in Private. Adding models trained with oversampled data, using either SMOTE or ADASYN, decreased accuracy in Private dataset. Gini impurity appeared to work better than Entropy for tree-based models.

eks-for-inference icon eks-for-inference

Create a cluster docker build -t eksctl-cli . docker run -it eksctl-cli bash aws configure ./create-cluster.sh

fast-ml-demo icon fast-ml-demo

If you have no time for production level deployment, this could be used for very quick demo purpose

fastapi-mvc icon fastapi-mvc

Developer productivity tool for making high-quality FastAPI production-ready APIs.

fastapi_profiler icon fastapi_profiler

A FastAPI Middleware of joerick/pyinstrument to check your service performance.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.