Coder Social home page Coder Social logo

volkamerlab / kinodata-3d-affinity-prediction Goto Github PK

View Code? Open in Web Editor NEW
1.0 1.0 3.0 34.61 MB

DL models to test added value of using generated complex data for affinity prediction

License: MIT License

Shell 0.12% Dockerfile 0.69% Python 80.23% Jupyter Notebook 18.95%

kinodata-3d-affinity-prediction's Introduction

Kinodata-3D dataset and models

This repository contains a pyg-based interface to the Kinodata-3D dataset and the code used to train and evaluate the models presented in the Kinodata-3D publication

Installation

We currently only support installation from source.

(1) Clone this repo

(2) Set up Python environment

Use mamba (or conda) to set up a Python environment,

mamba env create -f environment.yml
mamba activate kinodata

and install this package in editable/develop mode

pip install -e .

(3) Obtain raw data

The raw data, docked poses and kinase pdb files, can be obtained from Zenodo. After downloading the archives, extract them in the root directory of this repository.

cd PATH_TO_REPO
unzip ...

See the Kinodata-3D repo for more information and the code used to generate the raw data.

General usage

Reproducing results

(1) Acquire exact dataset and data split versions

If you intend to reproduce our results, we strongly recommend that you use our preprocessed version of the dataset and corresponding data splits.

(2) Model training and evaluation

You can use the shell script condor/train_generic.sh to train and test a model in one run, on one particular split. Create a file wandb_api_key in the root directory of this repository and paste your wandb API key, if you want to sync results to Weight & Biases. Otherwise, run wandb disable in a terminal with the conda environment activated, before training.

The script requires the following positional arguments

  1. Base python script, one of "train_dti_baseline", "train_sparse_transformer"
  2. Split type, i.e. one of "scaffold-k-fold", "random-k-fold", "pocket-k-fold".
  3. Integer RMSD cutoff for the dataset, e.g. 2, 4, or 6 as used in the publication.
  4. A .yaml file that contains additional configuration parameters, e.g. model hyperparameters.
  5. The integer index of the cross-validation fold used for testing.

For instance,

./condor/train_generic.sh train_dti_baseline scaffold-k-fold 2 dti.yaml 0

trains and tests the DTI baseline on the scaffold-5-fold (default k is 5) split of the dataset containing all complexes with predicted RMSD <= 2 Angstroms. Folds 1-4 are used for training and fold 0 for testing.

kinodata-3d-affinity-prediction's People

Contributors

joschka-gross avatar mbackenkoehler avatar andreavolkamer avatar mikemhenry avatar

Stargazers

 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.