Coder Social home page Coder Social logo

cdeitrick / analyze_clusters Goto Github PK

View Code? Open in Web Editor NEW

This project forked from camillilab/analyze_clusters

0.0 1.0 0.0 3.21 MB

Camilli Laboratory's Genome-Wide Detection of Conservative Site-Specific Recombination in Bacteria

License: Other

Python 98.38% Shell 1.62%

analyze_clusters's Introduction

README for Camilli Laboratory's Genome-Wide Detection of Conservative Site-Specific Recombination in Bacteria

This is a series of scripts that extracts same-orientation reads (SOR) and 5'-clipped 
reads (sCLIP) from sequencing data and analyzes the patterns for detection of potential 
inversion sequences.

The ideology is described here: https://doi.org/10.1371/journal.pgen.1007332

To analyze your own data, first extract SOR and sCLIP reads using the bash script reads_analysis.sh.

Then, run the python executable prepfile.py to convert to csv and add the proper headers.

> python prepfile.py

Place these output files in the "SOR Data" and "sCLIP Data" folders within the
"analyze_clusters" folder. Name these files according to the reference accession number. For
example, if analyzing data on C.diff, the files may be named "FN545816_sor.csv" and
"FN545816_sclip.csv" in each respective folder. Example data for C.diff has been included.

Edit accession_list.txt to include the accession numbers for which inversion analysis
is to be performed.

Finally, run the detection script using python:
> python detect_inversion_clusters.py

This requires Python 3.6+ and the following packages:

- Matplotlib
- BioPython
- Numpy
- Seaborn
- ReportLab

If you use these scripts in work contributing to a scientific publication, 
we ask that you cite our application. Our work is licensed generously.

If you have any questions, comments, ideas, or cool gumbo recipes, please reach us at:
[email protected]
[email protected]

Thanks and happy hunting!

analyze_clusters's People

Contributors

camillilab avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.