Coder Social home page Coder Social logo

schlosslab / hannigan_crcvirome_mbio_2018 Goto Github PK

View Code? Open in Web Editor NEW
10.0 10.0 8.0 54.94 MB

Investigating the gut virus communities associated with colon cancer.

License: MIT License

Shell 0.53% R 1.57% Makefile 0.26% Python 0.05% Perl 0.42% TeX 0.03% PostScript 97.14%
cancer-virome microbiome phages reproducible-paper

hannigan_crcvirome_mbio_2018's People

Contributors

ecogenomix avatar microbiology avatar pschloss avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar

hannigan_crcvirome_mbio_2018's Issues

Add mean summary stats

Add simple mean and std error bar graphs of contamination levels to compare to previous studies. Simple as that.

Confirm that use of BLAST's `-max_target_seqs` is intentional

Hi there,

This is a semi-automated message from a fellow bioinformatician. Through a GitHub search, I found that the following source files make use of BLAST's -max_target_seqs parameter:

Based on the recently published report, Misunderstood parameter of NCBI BLAST impacts the correctness of bioinformatics workflows, there is a strong chance that this parameter is misused in your repository.

If the use of this parameter was intentional, please feel free to ignore and close this issue but I would highly recommend to add a comment to your source code to notify others about this use case. If this is a duplicate issue, please accept my apologies for the redundancy as this simple automation is not smart enough to identify such issues.

Thank you!
-- Arman (armish/blast-patrol)

Run virome metagenome library through Quant-IT

Quantify the DNA for all of the samples in the virome metagenome library (second library that Charlie prepped) using the Quant-IT system.

Samples need to be run in duplicate to account for potential variation in the system.

Once this is done we can pool the samples and ship them off for sequencing.

Evaluate impact of sequencing depth on model performance.

It looks like the impact of sequencing depth on the model performance might be fairly significant. To get at this, I would like to run some curves plotting model performance vs sequencing depth. I suspect that might be important here.

Call ORFs

Extracts ORFs from the contigs using Prodigal.

Rerun files impacted by contig cat error.

The last time I ran the contig cat, there was a problem causing a bunch of reads to mix together. Probably due to a parallel run.

I fixed it but the files need to be remade.

Get replacement NexteraXT kit

It seemed like there was something wrong with the first NexteraXT kit that Charlie used (for the whole metagenome prep). We need to contact Illumina and get them to replace that kit since they ain't cheap.

No 16S analysis for Zackular data.

This is a pretty vague issue, but essentially I have the data up on Axiom so now I just have to go through with the main analysis workflow.

I should be able to follow the standard Mothur SOP.

Rerun bacterial metagenome library prep

The original bacterial metagenome prep did not work, which I think might have been due to a faulty kit. In addition to getting a replacement kit from Illumina, we are going to re-prepare this library using a new kit.

This is a rerun of the first kit that charlie prepped.

Update Sequencing Database

I realized I don't have the sample information officially recorded in my master SQL sequencing database, so I need to go back and get that taken care of.

Pool virome library

After the Quant-IT is done, we can calculate the volumes required to pool the samples. Once this is done we can submit the library.

Change repo name

Change the name of the repo to conform to Schloss lab standards.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.