Coder Social home page Coder Social logo

mlfromscratch's Introduction

Hi there ๐Ÿ‘‹

I'm Shamik and I enjoy building solutions to problems, mostly through programming (and occasionally with WD-40). I work as a Lead Data Scientist building machine learning applications for detecting and anonymizing PII and PHI in data breaches. I am also a part-time contributor to the BigScience Workshop, the BigBIO effort and the BigCode Project from ๐Ÿค—. In addition, I am working with PIISA, a collection of data scientists, software developers and lawyers to establish an open standard for PII protection that can be used across the globe. You can follow our efforts here. I also like to cook ๐Ÿ‘จโ€๐Ÿณ

โ”œโ”€โ”€ Interests
โ”‚   โ”œโ”€โ”€ Natural Language Processing
โ”‚   โ”œโ”€โ”€ Explainable Machine Learning
โ”‚   โ”œโ”€โ”€ AI Ethics
โ”‚   โ”œโ”€โ”€ System Design
โ”‚   โ””โ”€โ”€ PII Anonymization
โ”œโ”€โ”€ Occupations
โ”‚   โ”œโ”€โ”€ Software Engineer
โ”‚   โ”œโ”€โ”€ Graduate Research Assistant
โ”‚   โ”œโ”€โ”€ Lead Data Scientist
โ”‚   โ””โ”€โ”€ Senior Researcher
โ”œโ”€โ”€ Locations
โ”‚   โ”œโ”€โ”€ Kolkata, India
โ”‚   โ”œโ”€โ”€ Boston, MA, USA
โ”‚   โ”œโ”€โ”€ Tallahassee, FL, USA
โ”‚   โ””โ”€โ”€ Leeds, England
โ””โ”€โ”€ Book Suggestions
    โ”œโ”€โ”€ Fiction
    โ”‚   โ”œโ”€โ”€ The Three Body Problem - Cixin Liu
    โ”‚   โ”œโ”€โ”€ All the Light we cannot see - Anthony Doerr
    โ”‚   โ””โ”€โ”€ Purple Hibiscus - Chimamanda Ngozi Adichie
    โ”œโ”€โ”€ Non-Fiction
    โ”‚   โ”œโ”€โ”€ Algorithms of Oppression - Safiya Umoji Noble
    โ”‚   โ”œโ”€โ”€ Braiding Sweetgrass - Robin Wall Kimmerer
    |   โ”œโ”€โ”€ Chaos Machine - Max Fisher
    |   โ”œโ”€โ”€ Viral Justice - Ruha Benjamin
    โ”‚   โ””โ”€โ”€ Weapons of Math Destruction - Cathy O. Neill
    โ””โ”€โ”€ Cookbooks
        โ”œโ”€โ”€ The Food Lab - J. Kenji Lopez-Alt
        โ”œโ”€โ”€ Mi Cocina - Rick Martinez
        โ””โ”€โ”€ Dessert Person - Claire Saffitz
Projects
  1. Scientific Title Generator
  2. BigBIO dataloaders
  3. MIT 6.006 Solution Notebooks
Publications
  1. Explaining AI for Malware Detection: Analysis of Mechanisms of MalConv
  2. PhD Thesis: Towards Explainability in Machine Learning for Malware Detection
  3. Static Malware Modeling and Detection using Topic Models
  4. BigBIO: A Framework for Data-Centric Biomedical Natural Language Processing
  5. The bigscience roots corpus: A 1.6 tb composite multilingual dataset

P.S. The tree was built using Rich

mlfromscratch's People

Contributors

shamikbose avatar

Stargazers

 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.