Comprehensive multiple eQTL detection and its application to GWAS interpretation

Biao Zeng, Luke R. Lloyd-Jones, Grant W. Montgomery, Andres Metspalu, Tonu Esko, Lude Franke, Urmo Vosa, Annique Claringbould, Kenneth L. Brigham, Arshed A. Quyyumi, Youssef Idaghdour, Jian Yang, Peter M. Visscher, Joseph E. Powell, Greg Gibson

Research output: Contribution to journalArticlepeer-review

Abstract

Expression QTL (eQTL) detection has emerged as an important tool for unraveling the relationship between genetic risk factors and disease or clinical phenotypes. Most studies are predicated on the assumption that only a single causal variant explains the association signal in each interval. This greatly simplifies the statistical modeling, but is liable to biases in scenarios where multiple local causal-variants are responsible. Here, our primary goal was to address the prevalence of secondary cis-eQTL signals regulating peripheral blood gene expression locally, utilizing two large human cohort studies, each >2500 samples with accompanying whole genome genotypes. The CAGE (Consortium for the Architecture of Gene Expression) dataset is a compendium of Illumina microarray studies, and the Framingham Heart Study is a two-generation Affymetrix dataset. We also describe Bayesian colocalization analysis of the extent of sharing of cis-eQTL detected in both studies as well as with the BIOS RNAseq dataset. Stepwise conditional modeling demonstrates that multiple eQTL signals are present for ~40% of over 3500 eGenes in both microarray datasets, and that the number of loci with additional signals reduces by approximately two-thirds with each conditioning step. Although <20% of the peak signals across platforms fine map to the same credible interval, the colocalization analysis finds that as many as 50-60% of the primary eQTL are actually shared. Subsequently, colocalization of eQTL signals with GWAS hits detected 1349 genes whose expression in peripheral blood is associated with 591 human phenotype traits or diseases, including enrichment for genes with regulatory functions. At least 10%, and possibly as many as 40%, of eQTL-trait colocalized signals are due to nonprimary cis-eQTL peaks, but just one-quarter of these colocalization signals replicated across the gene expression datasets. Our results are provided as a web-based resource for visualization of multi-site regulation of gene expression and its association with human complex traits and disease states.

Original languageEnglish (US)
Pages (from-to)905-918
Number of pages14
JournalGenetics
Volume212
Issue number3
DOIs
StatePublished - Jul 2019

Keywords

  • Colocalization
  • Conditional association
  • Fine mapping
  • Gene regulation
  • Linkage disequilibrium
  • PolyQTL

ASJC Scopus subject areas

  • Genetics

Fingerprint Dive into the research topics of 'Comprehensive multiple eQTL detection and its application to GWAS interpretation'. Together they form a unique fingerprint.

Cite this