Coder Social home page Coder Social logo

stanford-ribonanza-rna-folding's Introduction

Stanford-Ribonanza-RNA-Folding

Description:

Homepage

Ribonucleic acid (RNA) is essential for most biological functions. A better understanding of how to manipulate RNA could help usher in an age of programmable medicine, including first cures for pancreatic cancer and Alzheimer’s disease as well as much-needed antibiotics and new biotechnology approaches for climate change. But first, researchers must better understand each RNA molecule's structure, an ideal problem for data science.

Recent efforts to predict RNA structure have run into a number of challenges: (1) a paucity of training data, (2) lack of intellectual and computational power, and (3) difficulties in rigorously splitting training and test data. Can a Kaggle competition close these gaps?

Towards sourcing a large and diverse collection of RNA molecules for data acquisition, the host Das laboratory brings together scientists and gamers to solve puzzles and invent medicine in the Eterna project. The Eterna community has previously unlocked new scientific principles, designed thermodynamically-optimized riboswitches, and revealed RNA degradation patterns for improving the shelf life of mRNA vaccines, which formed the basis of the Kaggle OpenVaccine challenge. Now the Eterna project is creating diverse sequences that are predicted to form complex structures.

The data for this new Ribonanza competition are experimental measurements of the chemical reactivity at each position of an RNA molecule. These data are exquisitely sensitive to the structure – or multiple structures – that each RNA forms in the test tube. An algorithm that could perfectly predict these chemical reactivities would need to have an implicit ‘understanding’ of RNA structure to do so. Such an oracle could be then utilized to predictively model structures of novel RNA molecules. As in the OpenVaccine competition, the majority of the private leaderboard data will be collected in parallel with your prediction efforts -- no one will know the answers until the competition closes!

To reiterate the potential impact of your participation, an accurate model that solves the RNA structure prediction problem could be a game changer for medical researchers who are trying to identify unique RNA-based drug targets in the many bacterial, viral, neurological, and cancer genes that remain undruggable at the protein level. In addition, accurate RNA structure prediction is needed to predictively design RNA-based medicines such as mRNA vaccines and CRISPR gene therapeutics that promise to treat nearly all human disease. More than its medical implications, RNA molecules underlie and can even dominate core biological processes for all of life, including the very first forms of life on Earth and the plants and marine organisms that fix most of the carbon on our planet now. A full understanding of life requires a full, predictive understanding of RNA.

Data:

In this competition, you will be predicting the reactivity of an RNA sequence to two chemical modifiers DMS and 2A3. These data can be measured efficiently through a mutational profiling (MaP) experiment read out by high-throughput sequencing and positions that are protected from chemical modification are likely to be forming base pairs or other kinds of RNA structure.

Learn more about DMS

image

learn more about 2A3

TFRecords dataset version can be find here

stanford-ribonanza-rna-folding's People

Contributors

jeanjulesbigeard avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.