Coder Social home page Coder Social logo

landrylab / evans-yamamoto_et_al_2021 Goto Github PK

View Code? Open in Web Editor NEW
2.0 2.0 0.0 28.64 MB

Jupyter notebooks for BFG-PCA related simulation and analysis.

License: Other

Jupyter Notebook 23.13% R 27.06% Python 49.81%
synthetic-biology systems-biology protein-protein-interaction dna-sequencing dna-barcode barcode-fusion-genetics

evans-yamamoto_et_al_2021's Introduction

Barcode Fusion Genetics related codes

This page explains how to install and execute the BFG-PCA codes. If you have any questions, please don't hesitate to post a question in the Discussions.

Please make sure you have appropriate Python and pip before starting.

Python version >=3.5
pip    version >= 1.1.0

Dependencies :

numpy  version >=1.19 
tqdm   version >=2.2.4

To install these pakcages, first clone this repository by

git clone https://github.com/DanYamamotoEvans/BFG-PCA.git

Next, go to the location of the BFG-PCA folder in the terminal, and install the dependencies by

pip install .

Other core programs to install:

pip install jupyterlab
  • Commandline BLAST+

Follow the instruction manual for installation. Set the PATH of the binary file.

Execute the following to see if installation is complete.

blastn -help

Overview

This script was built to perform experimental plans and data analysis for Barcode Fusion Genetics screenings. There are four main steps in this suit, which I have prepared jupyter-notebooks for each.

  • Monte-Carlo simulation
  • Barcode calling
  • Normalization
  • Performance measure
  • (Visualization, You will need to install R.)

Monte-carlo simulation of BFG screening proccess

Since BFG screenings have multiple sampling steps while handling a complex pool of strains, we suimulate the sampling process with a Monte-Carlo simulation. This notebook follows the procedures of BFG screenings, and allows the user to estimate the nessesary paramaters for sampling.

Barcode calling

This notebook creates the BLAST databse based on your barcode database file, and performs BLAST on the fastq files you provide. The results will be parsed to combine the count data. Please look at the wiki for input file specifications.

Normalization

This notebook normalizes the count data, and compute raw PPI signals based on the barcode counts in the control condition and auto-activity level. It will also output some csv files for plotting stats.

Performance measure

Based on the normalized scores, this notebook computes the agreement against the BioGRID database. It willcomute the agreement for various percentile values of the PPI scores generated from multiple replicates.

Visualization

This notebook will help you plot the data you obtained from the BFG screening. Please go to the visualization page in the wiki for more details on the plots.

References

Citation

Pleae cite Evans-Yamamamto et al, 2022, Nucleic Acids Research if you use this repository.

evans-yamamoto_et_al_2021's People

Stargazers

 avatar  avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.