Coder Social home page Coder Social logo

kinodata's People

Contributors

corey-taylor avatar ijpulidos avatar jaimergp avatar mbackenkoehler avatar t-kimber avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar

kinodata's Issues

Multiple columns per source

Do not hardcode priority of data sources, it'd be something like:

Kinases,KinHub,KLIFS,PKinFam
Kinase A,True,True,True
Kinase B,False,True,False
Kinase C,False,False,True
....

Add entries to csv

List of small things that could be added to the table

  • compound id: allows to refind or compare compounds (not just smiles)
  • max phase: would allow us to generate our own kinase inhibitor set and host it here

ChEMBL29

A new version of ChEMBL is out. Should the zip file be adapted to v.29?

Chembl 33 Update

Much of the necessary code is merged.

The release draft is here.

Remaining tasks

Code

  • update the EGFR notebook (references to the current release; the last release has the non-updated version...)
  • add data/human_kinases_and_chembl_targets.chembl_33.csv
  • Both changes are done with merging #20

Release artifacts

  • package the code as kinodata-0.5
  • create the activities artifacts
  • create the EGFR activities artifacts

Release notes

  • get the numbers for the table in the release post
  • mention removal of kinase.com
  • mention changed Uniprot query

Discrepancies in summary data

The summary data in the last release (https://github.com/openkinome/kinodata/releases/tag/v0.2) differs from what reported by the notebooks and in the data files that are outputted by them:

Dataset Non-curated Curated
ChEMBL 27 182 223 148 836
ChEMBL 28 199 238 159 978

vs the number of unique records in the output .csv's and reported in the notebooks:

Dataset Non-curated Curated
ChEMBL 27 217 612 174 238
ChEMBL 28 237 336 186 972

The notebooks appear to run fine so I have added the data from the notebooks/output files themselves to the latest release (https://github.com/openkinome/kinodata/releases/tag/v0.3). But having looked at the data directly, I can't establish where the data in the first table comes from.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.