Coder Social home page Coder Social logo

siblinggwas's Introduction

SiblingGWAS

Scripts for running GWAS using siblings to estimate Within-Family (WF) and Between-Family (BF) effects of genetic variants on continuous traits. Allows the inclusion of more than two siblings from one family.

Basic Requirements

  1. Siblings. The analysis pipeline requires data on siblings. We suggest including all siblings from families with one or more pairs of genotyped dizygotic siblings. For example, in a family with a pair of monozygotic twins and an additional sibling, include both MZ twins and the sibling. The inclusion of both MZ twins should (very) modestly improve power by accounting for variation in the phenotypic outcome. If siblings have not been previously identified in the dataset, we suggest using KING (http://people.virginia.edu/~wc9c/KING/manual.html) to infer siblings.
  2. Imputed genotype data. The analysis scripts use best guess genotype data in PLINK binary format. We have provided scripts to convert different file formats (e.g. vcf, bgen) to PLINK binary best guess format satisfying pipeline input requirements.
  3. Phenotypes. Phenotype data for siblings on outcomes of interest (e.g. height and body mass index).

For more details on the prerequisites and inputs required for the pipeline, please consult the wiki
https://github.com/LaurenceHowe/SiblingGWAS/wiki/

Downloading and running the pipeline

Navigate to the directory where you want to download the repository. The repository can then be downloaded using git:

git clone https://github.com/LaurenceHowe/SiblingGWAS/


Once the repository is downloaded, run the following command to check that files have downloaded properly:

head ./SiblingGWAS/resources/parameters


SCRIPTS:

config file

File to be edited with paths to relevant input files.
Note that only this file should be edited.

1.0_setup

The set-up script runs checks to ensure that the input files are in the correct format and checks the installation of R packages.

2.0_summary

This script extracts summary data on available phenotypes.

3.0_partitions

This script partitions the genetic data into smaller lists of SNPs to be run in batches.

4.0_unified_regression

This script runs the regressions in R. The script fits two models: a conventional regression of genotype on phenotype and a model including the family mean for each genotype to generate Within-Family and Between-Family estimates. Standard errors are adjusted to account for family structure.

5.0_tidy

This script compiles the output into a final summary statistics file.

Any queries to Laurence Howe [email protected]

Note scripts were adapted from scripts by GoDMC (Gibran Hemani et al) and the SSGAC (Sean Lee/Patrik Turley et al). See the Wiki for more information!

siblinggwas's People

Contributors

laurencehowe avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.