Coder Social home page Coder Social logo

Hi there πŸ‘‹

I'm Shamik and I enjoy building solutions to problems, mostly through programming (and occasionally with WD-40). I work as a Lead Data Scientist building machine learning applications for detecting and anonymizing PII and PHI in data breaches. I am also a part-time contributor to the BigScience Workshop, the BigBIO effort and the BigCode Project from πŸ€—. In addition, I am working with PIISA, a collection of data scientists, software developers and lawyers to establish an open standard for PII protection that can be used across the globe. You can follow our efforts here. I also like to cook πŸ‘¨β€πŸ³

β”œβ”€β”€ Interests
β”‚   β”œβ”€β”€ Natural Language Processing
β”‚   β”œβ”€β”€ Explainable Machine Learning
β”‚   β”œβ”€β”€ AI Ethics
β”‚   β”œβ”€β”€ System Design
β”‚   └── PII Anonymization
β”œβ”€β”€ Occupations
β”‚   β”œβ”€β”€ Software Engineer
β”‚   β”œβ”€β”€ Graduate Research Assistant
β”‚   β”œβ”€β”€ Lead Data Scientist
β”‚   └── Senior Researcher
β”œβ”€β”€ Locations
β”‚   β”œβ”€β”€ Kolkata, India
β”‚   β”œβ”€β”€ Boston, MA, USA
β”‚   β”œβ”€β”€ Tallahassee, FL, USA
β”‚   └── Leeds, England
└── Book Suggestions
    β”œβ”€β”€ Fiction
    β”‚   β”œβ”€β”€ The Three Body Problem - Cixin Liu
    β”‚   β”œβ”€β”€ All the Light we cannot see - Anthony Doerr
    β”‚   └── Purple Hibiscus - Chimamanda Ngozi Adichie
    β”œβ”€β”€ Non-Fiction
    β”‚   β”œβ”€β”€ Algorithms of Oppression - Safiya Umoji Noble
    β”‚   β”œβ”€β”€ Braiding Sweetgrass - Robin Wall Kimmerer
    |   β”œβ”€β”€ Chaos Machine - Max Fisher
    |   β”œβ”€β”€ Viral Justice - Ruha Benjamin
    β”‚   └── Weapons of Math Destruction - Cathy O. Neill
    └── Cookbooks
        β”œβ”€β”€ The Food Lab - J. Kenji Lopez-Alt
        β”œβ”€β”€ Mi Cocina - Rick Martinez
        └── Dessert Person - Claire Saffitz
Projects
  1. Scientific Title Generator
  2. BigBIO dataloaders
  3. MIT 6.006 Solution Notebooks
Publications
  1. Explaining AI for Malware Detection: Analysis of Mechanisms of MalConv
  2. PhD Thesis: Towards Explainability in Machine Learning for Malware Detection
  3. Static Malware Modeling and Detection using Topic Models
  4. BigBIO: A Framework for Data-Centric Biomedical Natural Language Processing
  5. The bigscience roots corpus: A 1.6 tb composite multilingual dataset

P.S. The tree was built using Rich

Shamik Bose's Projects

Shamik Bose doesn’t have any public repositories yet.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    πŸ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. πŸ“ŠπŸ“ˆπŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❀️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.