Abstract
A unified variational methodology is developed f or classification and clustering problems and is tested in the classification of tumors from gene expression data. It is based on fluid-like flows in feature space that cluster a set of observations by transforming them into likely samples from p isotropic Gaussians, where p is the number of classes sought. The methodology blurs the distinction between training and testing populations through the soft assignment of both to classes. The observations act as Lagrangian markers for the flows, comparatively active or passive depending on the current strength of the assignment to the corresponding class.
Original language | English (US) |
---|---|
Pages (from-to) | 1784-1802 |
Number of pages | 19 |
Journal | Multiscale Modeling and Simulation |
Volume | 8 |
Issue number | 5 |
DOIs | |
State | Published - 2010 |
Keywords
- Density estimation
- Expectation maximization
- Gaussianization
- Inference
- Machine learning
- Maximum likelihood
ASJC Scopus subject areas
- General Chemistry
- Modeling and Simulation
- Ecological Modeling
- General Physics and Astronomy
- Computer Science Applications