Symmetric directional false discovery rate control

Sarah E. Holte, Eva K. Lee, Yajun Mei

Research output: Contribution to journalArticlepeer-review

Abstract

This research is motivated from the analysis of a real gene expression data that aims to identify a subset of “interesting” or “significant” genes for further studies. When we blindly applied the standard false discovery rate (FDR) methods, our biology collaborators were suspicious or confused, as the selected list of significant genes was highly unbalanced: there were ten times more under-expressed genes than the over-expressed genes. Their concerns led us to realize that the observed two-sample t-statistics were highly skewed and asymmetric, and thus the standard FDR methods might be inappropriate. To tackle this case, we propose a symmetric directional FDR control method that categorizes the genes into “over-expressed” and “under-expressed” genes, pairs “over-expressed” and “under-expressed” genes, defines the p-values for gene pairs via column permutations, and then applies the standard FDR method to select “significant” gene pairs instead of “significant” individual genes. We compare our proposed symmetric directional FDR method with the standard FDR method by applying them to simulated data and several well-known real data sets.

Original languageEnglish (US)
Pages (from-to)71-82
Number of pages12
JournalStatistical Methodology
Volume33
DOIs
StatePublished - Dec 1 2016

Keywords

  • Column permutation
  • Directional FDR
  • False discovery rate
  • Multiple testing
  • Symmetric decision
  • Three-decisions

ASJC Scopus subject areas

  • Statistics and Probability

Fingerprint

Dive into the research topics of 'Symmetric directional false discovery rate control'. Together they form a unique fingerprint.

Cite this