TY - GEN
T1 - Workload-driven learning of mallows mixtures with pairwise preference data
AU - Stoyanovich, Julia
AU - Ilijasic, Lovro
AU - Ping, Haoyue
N1 - Funding Information:
This work was supported in part by NSF Grants No. 1464327 and 1539856, and BSF Grant No. 2014391.
Publisher Copyright:
© 2016 ACM.
PY - 2016/6/26
Y1 - 2016/6/26
N2 - In this paper we present a framework for learning mixtures of Mallows models from large samples of incomplete preferences. The problem we address is of significant practical importance in social choice, recommender systems, and other domains where it is required to aggregate, or otherwise analyze, preferences of a heterogeneous user base. We improve on state-of-the-art methods for learning mixtures of Mallows models with pairwise preference data. Exact sampling from the Mallows posterior in presence of arbitrary pairwise evidence is known to be intractable even for a single Mallows. This motivated to the development of an approximate sampler called AMP. In this paper we propose AMPx, an ensemble method for approximate sampling from the Mallows posterior that combines AMP with frequency-based estimation of posterior probabilities. We experimentally demonstrate that AMPx achieves faster convergence and higher accuracy than AMP alone. We also adapt stateof-the-art clustering techniques that have not been used in this setting, for learning parameters of the Mallows mixture, and show experimentally that mixture parameters can be learned accurately and efficiently.
AB - In this paper we present a framework for learning mixtures of Mallows models from large samples of incomplete preferences. The problem we address is of significant practical importance in social choice, recommender systems, and other domains where it is required to aggregate, or otherwise analyze, preferences of a heterogeneous user base. We improve on state-of-the-art methods for learning mixtures of Mallows models with pairwise preference data. Exact sampling from the Mallows posterior in presence of arbitrary pairwise evidence is known to be intractable even for a single Mallows. This motivated to the development of an approximate sampler called AMP. In this paper we propose AMPx, an ensemble method for approximate sampling from the Mallows posterior that combines AMP with frequency-based estimation of posterior probabilities. We experimentally demonstrate that AMPx achieves faster convergence and higher accuracy than AMP alone. We also adapt stateof-the-art clustering techniques that have not been used in this setting, for learning parameters of the Mallows mixture, and show experimentally that mixture parameters can be learned accurately and efficiently.
UR - http://www.scopus.com/inward/record.url?scp=84979775163&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84979775163&partnerID=8YFLogxK
U2 - 10.1145/2932194.2932202
DO - 10.1145/2932194.2932202
M3 - Conference contribution
AN - SCOPUS:84979775163
T3 - Proceedings of the 19th International Workshop on Web and Databases, WebDB 2016
BT - Proceedings of the 19th International Workshop on Web and Databases, WebDB 2016
PB - Association for Computing Machinery, Inc
T2 - 19th International Workshop on Web and Databases, WebDB 2016
Y2 - 26 June 2016 through 1 July 2016
ER -