Coder Social home page Coder Social logo

sze_obesity_mbio_2016's Introduction

Looking for a Signal in the Noise: Revisiting Obesity and the Microbiome

Two recent studies have re-analyzed published data and found that when datasets are analyzed independently there was limited support for the widely accepted hypothesis that changes in the microbiome are associated with obesity. This hypothesis was reconsidered by increasing the number of datasets and pooling the results across the individual datasets. The Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines were applied to identify 10 studies for an updated and more synthetic analysis. Alpha diversity metrics and the relative risk of obesity based on those metrics were used to identify a limited number of significant associations with obesity; however, when the results of the studies were pooled using a random effects model significant associations were observed between Shannon diversity, number of observed OTUs, and Shannon evenness and obesity status. They were not observed for the ratio of Bacteroidetes and Firmicutes or their individual relative abundances. Although these tests yielded small P-values, the difference between the Shannon diversity index of non-obese and obese individuals was 2.07%. A power analysis demonstrated that only one of the studies had sufficient power to detect a 5% difference in diversity. When Random Forest machine learning models were trained on one dataset and then tested using the other 9 datasets, the median accuracy varied between 33.01 and 64.77% (median=56.68%). Although there was support for a relationship between the microbial communities found in human feces and obesity status, this association was relatively weak and its detection is confounded by large interpersonal variation and insufficient sample sizes.

=======

Extra Information on Data set

  • To analyze full data set will need at least 200 GB of RAM
  • From start to finish it will take at least 2-3 weeks to complete

=======

Feel free to add your own data sets to the pipeline and submit a pull request to incorporate it into the full analysis

Overview

project
|- README          # the top level description of content
|
|- doc/            # documentation for the study
|  |- notebook/    # preliminary analyses (dead branches of analysis)
|  +- paper/       # manuscript(s), whether generated or not
|
|- data            # raw and primary data, are not changed once created
|  |- references/  # reference files to be used in analysis
|  |- raw/         # raw data, will not be altered
|  |- mothur/      # mothur processed data
|  +- process/     # cleaned data, will not be altered once created;
|                  # will be committed to repo
|
|- code/           # any programmatic code
|- results         # all output from workflows and analyses
|  |- tables/      # text version of tables to be rendered with kable in R
|  |- figures/     # graphs, likely designated for manuscript figures
|  +- pictures/    # diagrams, images, and other non-graph graphics
|
|- scratch/        # temporary files that can be safely deleted or lost
|
|- study.Rmd       # executable Rmarkdown for this study, if applicable
|- study.md        # Markdown (GitHub) version of the *Rmd file
|- study.html      # HTML version of *.Rmd file
|
+- Makefile        # executable Makefile for this study, if applicable

sze_obesity_mbio_2016's People

Contributors

marcsze avatar pschloss avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar

sze_obesity_mbio_2016's Issues

Obesity in mice literature

@marcsze can you track down the papers that put human stool into mice and found changes in obesity and those that used antibiotics to do the same?

My latest commit

Hey Pat

I changed the strategy denovo R file and deleted the old one (although this shouldn't change anything on your end).

I also uploaded a newer version of the manuscript (with no meta at the end). I plan on editing this one for typos and such we have made.

Marc

One last look

@marcsze could you take a look at the PDF in submission and make sure everything looks good? Once I get word back from you, I'll update the line numbers in the Response to Reviewers document

Cross validation with cross_validate.R

First of all, thanks for having all the scripts here. It helps to understand your approach, and shows your commitment to open science.
I have a question on one of your steps. More specifically, I noticed in your scripts that you used a custom function for cross-validation instead of the one provided in the aucRF package. Could you elaborate on the reason? (You say there is a bug ). I am using it and I am surprised that the average cvAUC I get is way lower than I expected when I compared it to the one obtained from running my model with all the datapoints.

figure 1

@marcsze can you redo the pull request to upload the figure as results/figures/flow_chart.png? The image you uploaded was of pretty poor quality and didn't replace the exiting Figure 1.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.