Coder Social home page Coder Social logo

jinbinchan / exposomedatachallenge2021 Goto Github PK

View Code? Open in Web Editor NEW

This project forked from isglobal-exposomehub/exposomedatachallenge2021

0.0 0.0 0.0 83.89 MB

License: GNU General Public License v3.0

Python 0.21% R 3.24% MATLAB 0.04% HTML 5.05% Jupyter Notebook 91.46%

exposomedatachallenge2021's Introduction

NOTE for publications using data from this challenge

Overall, users of data are strongly encouraged to publish their results in peer-reviewed journals and to present research findings at scientific meetings, etc. Investigators planning to conduct analyses similar to those described at https://www.projecthelix.eu/ may contact Consortium members at [email protected] to discuss collaborations, if so desired. The raw data supporting the current study are available on request subject to ethical and legislative review, see details here: https://www.projecthelix.eu/index.php/es/data-inventory These data may also be used for educational purposes.

The following text should be added to any publications based on this data: This data were created as part of the ISGlobal Exposome data challenge 2021, presented in this publication (preprint: https://arxiv.org/abs/2202.01680 - under review in Env. Int.). The HELIX study [Vrijheid, Slama, et al. EHP 2014; Maitre et al. 2018 BMJ Open] represents a collaborative project across six established and ongoing longitudinal population-based birth cohort studies in six European countries (France, Greece, Lithuania, Norway, Spain, and the United Kingdom). The research leading to these results has received funding from the European Community’s Seventh Framework Programme (FP7/2007-2013) under grant agreement no 308333 – the HELIX project and the H2020-EU.3.1.2. - Preventing Disease Programme under grant agreement no 874583 (ATHLETE project). The data used for the analyses described in this manuscript were obtained from: Figshare https://figshare.com/account/home#/projects/98813 (project number 98813 accesed on MM/DD/YYYY) and github https://github.com/isglobal-exposomeHub/ExposomeDataChallenge2021/.

Exposome Data Challenge 2021

The exposome, described as "the totality of human environmental exposures from conception onwards", recognizes that individuals are exposed simultaneously to a multitude of different environmental factors and takes a holistic approach to the discovery of etiological factors for disease. The exposome’s main advantage over traditional ‘one-exposure-one-disease’ study approaches is that it provides an unprecedented conceptual framework for the study of multiple environmental hazards (urban, chemical, lifestyle, social) and their combined effects.

The objective of this event (described here) is to promote innovative statistical, data science, or other quantitative approaches to studying the health effects of complex high-throughput measurement of exposure indicators (exposome). Detailed challenge examples are given on this link.

These are the availalbe datasets to propose data analyses to address any challenge:

  • Exposome data (n=1301): Rdata file without missings and with missings containing three objects:
    • 1 object for exposures: exposome
    • 1 object for covariates: covariates
    • 1 object for outcomes: phenotype

The three tables can be linked using ID variable. See the codebook for variable description (variable name, domain, type of variable, transformation, ...)

  • omic data: Exposome and omic data can be linked using ID variable.
    • Proteome: ExpressionSet called metabol_serum of 1170 individuals and 39 proteins (log-transformed) that are annotated in the ExpressionSet object (use fData(proteome) after loading Biobase Bioconductor package).
    • Serum Metabolome: ExpressionSet called metabol_serum of 1198 individuals and 177 metabolites (log-transformed) (see here for a descripton).
    • Urine Metabolome: ExpressionSet called metabol_urine of 1192 individuals and 44 metabolites (see here for a descripton).
    • Gene expression: ExpressionSet called genexpr (see here what an ExpressionSet is) of 1007 individuals and 28,738 transcripts with annotated gene symbols.
    • Methylation: GenomicRatioSet called methy (see here what a GenomicRatioSet is) of 918 individuals and 386,518 CpGs

The variables that are available in the metadata are:

  1. ID: identification number
  2. e3_sex: gender (male, female)
  3. age_sample_years: age (in years)
  4. h_ethnicity_cauc: caucasic? (yes, no)
  5. ethn_PC1: first PCA to address population stratification
  6. ethn_PC2: second PCA to address population stratification
  7. Cell-type estimates (only for methylation): NK_6, Bcell_6, CD4T_6, CD8T_6, Gran_6, Mono_6

exposomedatachallenge2021's People

Contributors

isglobal-exposomehub avatar qwu1221 avatar isglobal-brge avatar shounakch avatar vishalmidya avatar hiyer09 avatar parasitetwin avatar congrongwang avatar wangziyue57 avatar mmcarli avatar yinqi93 avatar chiaramoccia avatar jaime-benavides avatar mkln avatar yufree avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.