Coder Social home page Coder Social logo

sajfb / isotopic-profile-deconvolution-chromatogram-ipdc-algorithm Goto Github PK

View Code? Open in Web Editor NEW
1.0 1.0 0.0 1017 KB

The IPDC algorithm was developed to screen for a wide variety of organic compounds in high-resolution mass spectrometry (HRMS) data acquired from instruments with resolution power as low as 22 000 fwhm.

MATLAB 100.00%
environmental-monitoring isotopic-distribution-calculations gc-ms lc-ms mass-spectrometry apgc environmental-toxicology

isotopic-profile-deconvolution-chromatogram-ipdc-algorithm's Introduction

CRAN stats Python PyTorch LinkedIn Google Scholar

I'm an award-winning data scientist bridging cheminformatics and metabolomics focusing on small molecule discovery and mass spectrometry data sciences (see my award news from Metabolomics Association of North America (MANA) and my presentation details here).

I've crafted multiple computational pipelines designed for untargeted mass spectrometry data processing across diverse research domains including metabolomics, lipidomics, exposomics, and environmental studies. My software development philosophy emphasizes on maximal automation, highest precision, multi-platform compatibility, and user-friendly interfaces to minimize lab-based experiments.

Thriving in my current role at Syntekabio, but always driven to pursue cutting-edge research for next-generation AI drug design and discovery.

Completed projects

Mass Spectrometry Data Processing Workflows at the Integrated Data Science Laboratory for Metabolomics and Exposomics

image description

  • IDSL_MINT (Mass INTerpretator) is deep learning framework to further interpret unannoated mass spectrometry data using deep cheminformatics analyses.

  • IDSL.ICA (Integrated Compound Annotation) is a full-scale annotation workflow to facilitate integration of metabolomics data for multi-omics analyses. (pending release ... )

  • IDSL.IPA (Intrinsic Peak Analysis) is a chromatographic peak-picking software package which is able to screen at lowest signal intensities (S/N > 2). IDSL.IPA is able to pair isotopologues with a fixed distance (e.g. ΔC = 13C - 12C = 1.003354835336 Da), to detect chromatographic peaks via geometric analysis, to correct retention time drifts using endogenous index markers, to align peaks (m/z-RT pairs) across population size studies (N > 200), to fill gaps on the aligned peak tables, to annotate peaks, and to visualize extracted ion chromatograms (EICs) and total ion chromatograms (TICs).

  • IDSL.FSA (Fragmentation Spectra Analysis) is a computational fragmentation annotation workflow to annotate .msp (mass spectra format) and .mgf (Mascot generic format) fragmentation data files rapidly via measurement of spectral entropy and/or cosine similarity even when precursor values are not available nor reliable. IDSL.FSA also may be employed to process bottom-up proteomics data.

  • IDSL.CSA (Composite Spectra Analysis) is a pipeline to deconvolute fragmentation spectra from Data Dependent Acquisition (DDA), and various Data-Independent Acquisition (DIA) methods such as SWATH-MS, MSE, and All-Ion Fragmentation (AIF) analyses.

  • IDSL.UFA (United Formula Annotation) is a computationally enhanced pipeline to annotate chromatographic peaks with molecular formula using an isotopic profile matching approach. IDSL.UFA only requires MS1 level data which is especially beneficial when MS/MS data are not available. The IDSL.UFA pipeline can screen for isotopic profiles of up to 108 molecular formulas using a computationally efficient algorithm without any memory complications.

  • IDSL.UFAx (exhaustive UFA) was developed to annotate chromatographic peaks using an exhaustive chemical enumeration-based approach. This package can perform elemental composition calculations using the following 15 elements: C, B, Br, Cl, K, S, Si, N, H, As, F, I, Na, O, and P. IDSL.UFAx is also able to screen for isotopic profiles of 1027 molecular formulas without any memory complications; however, IDSL.UFAx is not computationally as fast as IDSL.UFA.

  • IDSL.SUFA is a simplified version of the IDSL.UFA package to calculate isotopic profiles and adduct formulas from molecular formulas with no dependency on other R packages for online tools such as isotopic profile calculator. The IDSL.SUFA package also provides functions to process user-defined adduct formulas.

  • IDSL.NPA (Nominal Peak Analysis) is a pipeline for processing nominal mass spectrometry data to create and annotate .msp files for untargeted MS/MS workflows.

  • IDSL.MXP (Mass Spectrometry Parser) is a light and fast parser for mzML/mzXML/netCDF mass spectrometry data files. IDSL.MXP is especially a proven tool to read corrupted mass spectrometry files.

Computational mass spectrometry pipelines for environmental cheminformatics projects as part of my doctoral research

  • An IPDC (Isotopic Profile Deconvoluted Chromatogram) algorithm to screen biologically complex environmental matrices for unknown contaminants using chemometric methods. The IPDC algorithm was successfully employed in five different projects during my PhD.

isotopic-profile-deconvolution-chromatogram-ipdc-algorithm's People

Contributors

sajfb avatar

Stargazers

 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.