Multiset multicover methods for discriminative marker selection

Euxhen Hasanaj, Amir Alavi, Anupam Gupta, Barnabás Póczos, Ziv Bar-Joseph

Research output: Contribution to journalArticlepeer-review

Abstract

Markers are increasingly being used for several high-throughput data analysis and experimental design tasks. Examples include the use of markers for assigning cell types in scRNA-seq studies, for deconvolving bulk gene expression data, and for selecting marker proteins in single-cell spatial proteomics studies. Most marker selection methods focus on differential expression (DE) analysis. Although such methods work well for data with a few non-overlapping marker sets, they are not appropriate for large atlas-size datasets where several cell types and tissues are considered. To address this, we define the phenotype cover (PC) problem for marker selection and present algorithms that can improve the discriminative power of marker sets. Analysis of these sets on several marker-selection tasks suggests that these methods can lead to solutions that accurately distinguish different phenotypes in the data.

Original languageEnglish (US)
Article number100332
JournalCell Reports Methods
Volume2
Issue number11
DOIs
StatePublished - Nov 21 2022

Keywords

  • algorithm
  • biomarker
  • cross-entropy method
  • gene sets
  • marker discovery
  • multiset multicover
  • phenotype cover
  • scRNA-seq
  • set cover

ASJC Scopus subject areas

  • Biotechnology
  • Biochemistry
  • Biochemistry, Genetics and Molecular Biology (miscellaneous)
  • Genetics
  • Radiology Nuclear Medicine and imaging
  • Computer Science Applications

Fingerprint

Dive into the research topics of 'Multiset multicover methods for discriminative marker selection'. Together they form a unique fingerprint.

Cite this