Coder Social home page Coder Social logo

gtex_variation_atlas's Introduction

GTEx_Variation_Atlas

Overall goal of project

Determining not only what variation exists in GTEx, but what drives said variation.

Data acquistion:

  • gtex_read_counts.gct* The version 8 readcounts, are downloaded through R scripts, for most recent download see gtex_read_counts.gct last date modified in Setup Code/downloaded_input/gtex_read_counts.gct
  • gtex_phenotypes_v8: A file containing deeper phenotype data on GTEx participants was originally acquired through dbGAP on 2019-10-21
  • GTEx_Analysis_v8_Annotations_SampleAttributesDS.txt: a file containing sample data on GTEx samples was last download 2019-9-30
  • GTEx_Analysis_v8_Annotations_SubjectAttributesDS.txt: A file containing subject data on GTEx subjects was last downloaded 2019-8-26.

File descriptions

gen_script_gtex_v8_1:

  1. Separates tissues readcount matrices from one another.

gen_script_gtex_v8_2:

  1. Normalizes the data using DESEq2's VST() function before filtering genes on a threshold that rowMeans() should be >5 normalized counts.

gen_script_gtex_v8_3_v2:

mccall_analysis_corr_step.R:

  1. The gene data is further filtered to only genes in the .98 percentile of high variance genes for each tissue.
  2. Then using a Kendall's tau correlation the high variance genes generate a correlation matrix.

mccall_cluster_generator.R:

  1. This correlation matrix is used to create euclidean distances to cluster the data on absolute values. Generate actual clusters through a kendall's tau critical value.
  2. Further break up clusters based on how tightly they correlate using a quantile threshold of correlation, and new agglometric clustering method.

sub_cluster_maker.R:

  1. Break the clusters up on if they positively or negatively correlate with one another.

mccall_profile_maker.R:

  1. Using the genes from each cluster, generate a Z-score of samples association with each cluster using the normalized count data.

gtex_variation_atlas's People

Contributors

tnieuwe avatar

Stargazers

Truman avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.