Coder Social home page Coder Social logo

circulatoryhealth / charge_1000g_cac Goto Github PK

View Code? Open in Web Editor NEW
3.0 2.0 0.0 953.03 MB

CHARGE Consortium: collaboration for mapping targets from 1000G GWAS on coronary artery calcification (CAC) with Maryam Kavousi, Maxime Bos, Clint L. Miller, and Pat Peyser.

License: MIT License

HTML 79.30% PostScript 20.25% R 0.31% Shell 0.13% Python 0.01%
genome-wide-association-studies gwas coronary-artery-calcification scrna-seq rnaseq regional-association-plots

charge_1000g_cac's Introduction

Multi-ancestry genome-wide analysis of coronary artery calcification identifies novel loci, functional pathways, and druggable targets

Background

Collaboration for mapping targets from the CHARGE Consortium 1000G GWAS on coronary artery calcification (CAC) to the carotid plaque data in the Athero-Express Biobank Study with Maryam Kavousi, Maxime Bos, Clint L. Miller, and Pat Peyser.

Athero-Express Biobank Study

We have bulk RNAseq (n = 635 samples) and single-cell RNAseq data, genome-wide methylation (Illumina 450K) in n ± 600, as well as overlapping genetic data for ±2,100 individuals with extensive histological plaque characterisation. We will address the following questions:

  • Gene expression correlated to characteristics of plaques?
  • Where target genes expressed, which cell types?
  • Are any of the variants associated to plaque characteristics?

Methods

Mapping genes to single cells

We will use the last dataset from the scRNAseq data, including 35 individuals, to project target genes.

Genetic analyses

For the genetic analyses we will perform regression analyses adjusted for age, sex (where applicable) and principal components. So, we will apply the following model:

model 1: phenotype ~ age + sex + chip-used + PC1 + PC2 + year-of-surgery

phenotypes are:

  • calcification, coded Calc.bin no/minor vs. moderate/heavy staining
  • collagen, coded Collagen.bin no/minor vs. moderate/heavy staining
  • fat10, coded Fat.bin_10 no/<10% fat vs. >10% fat
  • fat40, coded Fat.bin_40 no/<40% fat vs. >40% fat
  • intraplaque hemorrhage, coded IPH.bin no vs. yes
  • macrophages (CD68), coded macmean0 mean of computer-assisted calculation CD68+ region of interest
  • smooth muscle cells (alpha-actin), coded smcmean0 mean of computer-assisted calculation SMA+ region of interest
  • intraplaque vessel density (CD34), coded vessel_density manually counted CD34+ cells per 3-4 hotspots
  • mast cells, coded Mast_cells_plaque manually counted mast cell tryptase+ cells (https://academic.oup.com/eurheartj/article/34/48/3699/484981)
  • neutrophils (CD66b), coded neutrophils manually counted CD66b+ cells (https://pubmed.ncbi.nlm.nih.gov/20595650/)

Continuous variables were inverse-rank normal transformated, indicated by _rankNorm.

Figure 1: Genotyped individuals in the Athero-Express Biobank Study Genotyped individuals in the Athero-Express Biobank Study

Gene expression

Whole-plaque RNAseq

For the expression analysis we used carotid plaque-derived bulk RNAseq data and queried it for the gene list. Below a graph showing the overall expression of the genes (not all are in the data) compared to the mean expression of 1,000 randomly picked genes.

Figure 2: Overall expression of target genes in carotid plaques from the Athero-Express Biobank Study Overall expression of target genes in carotid plaques from the Athero-Express Biobank Study

Where do I start?

You can load this project in RStudio by opening the file called 'CHARGE_1000G_CAC.Rproj'.

Project structure

  1. CHARGE_1000G_CAC.Rmd
  2. bulkRNAseq.Rmd
  3. scRNAseq.Rmd
  4. Parsing_GWASSumStats.Rmd
  5. RegionalAssociationPlots.Rmd
  6. PolarMorphism.Rmd
File Description Usage
README.md Description of project Human editable
LICENSE User permissions Read only
.worcs WORCS metadata YAML Read only
renv.lock Reproducible R environment Read only
renv Reproducible R directory Read only
BASELINE Baseline tables directory Human editable
COLOC_EA_vs_AA Colocalization results directory Human editable
CredibleSets Credible sets as calculated and used Human editable
images Images directory Human editable
OUTPUT Generic output directory Human editable
PLOTS Generic plot directory Human editable
PolarMorphism PolarMorphism results directory Human editable
RACER Racer output directory Human editable
RACER_AA Racer African-American (AA) only output directory Human editable
RACER_EA_vs_AA Racer European-AA output directory Human editable
scripts Script to process some things Human editable
SNP SNP analyses scripts and results Human editable
targets Variant and gene target directory Human editable
"1. CHARGE_1000G_CAC.Rmd" Setting up work, including baseline, for AE Human editable
"2. bulkRNAseq.Rmd" Bulk RNAseq analyses in AE (not used) Human editable
"3. scRNAseq.Rmd" Single-cell RNAseq analyses in AE (not used) Human editable
"4. Parsing_GWASSumStats.Rmd" Parsing GWAS summary statistics for other analyses Human editable
"5. RegionalAssociationPlots.Rmd" Regional association plots, mirror and scatter, colocalization plots Human editable
"6. PolarMorphism.Rmd" Execute genome-wide overlap-analysis using PolarMorphism Human editable
CHARGE_1000G_CAC Script to process raw data Human editable
CHARGE_1000G_CAC.Rproj Project file Loads project

Reproducibility

This project uses the Workflow for Open Reproducible Code in Science (WORCS) to ensure transparency and reproducibility. The workflow is designed to meet the principles of Open Science throughout a research project.

To learn how WORCS helps researchers meet the TOP-guidelines and FAIR principles, read the preprint at https://osf.io/zcvbs/

WORCS: Advice for authors

WORCS: Advice for readers

Please refer to the vignette on reproducing a WORCS project for step by step advice.

Acknowledgements

Dr. Sander W. van der Laan is funded through grants from the Netherlands CardioVascular Research Initiative of the Netherlands Heart Foundation (CVON 2011/B019 and CVON 2017-20: Generating the best evidence-based pharmaceutical targets for atherosclerosis [GENIUS I&II]). We are thankful for the support of the ERA-CVD program ‘druggable-MI-targets’ (grant number: 01KL1802), the EU H2020 TO_AITION (grant number: 848146), and the Leducq Fondation ‘PlaqOmics’.

Plaque samples are derived from carotid endarterectomies as part of the Athero-Express Biobank Study which is an ongoing study in the UMC Utrecht.

The framework was based on the WORCS package.

Changes log

Version:      v1.3
Last update:  2023-05-05
Written by:   Sander W. van der Laan (s.w.vanderlaan-2[at]umcutrecht.nl).
Description:  Script to get some Athero-Express Biobank Study baseline characteristics.
Minimum requirements: R version 3.4.3 (2017-06-30) -- 'Single Candle', Mac OS X El Capitan

Changes log
* v1.3.0 Fixed an issue with LD r2 plotting. Plus cleaned up project.
* v1.2.0 Cleanup of project. Re-initialize and snapshot renv. Also added colocalization between ancestries. 
* v1.1.0 Rename. Cleaning up of project (archiving/removing unused code and results).
* v1.0.7 Update to the count data.
* v1.0.6 Update to WORCS framework.
* v1.0.6 Update code and fix coding of plaque vulnerability index (PVI) to work with SNPTEST.
* v1.0.5 Updates to the gene list.
* v1.0.4 Updates to the gene list.
* v1.0.3 Update to the gene list.
* v1.0.2 Updated baseline characteristics information. Update GWASToolKit preparation. Added more candidate SNPs to look at. Updated some variable names. 
* v1.0.1 Add an additional phenotype.
* v1.0.0 Initial version. Fixed baseline table, added codes, and results. Created sample-files.

The MIT License (MIT)

Copyright (c) 1979-2022 Sander W. van der Laan | s.w.vanderlaan [at] gmail [dot] com.

Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the "Software"), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:

The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.

THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.

Reference: http://opensource.org.

charge_1000g_cac's People

Contributors

llandsmeer avatar swvanderlaan avatar

Stargazers

 avatar  avatar  avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.