TY - GEN
T1 - Analyzing crowd rankings
AU - Stoyanovich, Julia
AU - Jacob, Marie
AU - Gong, Xuemei
N1 - Publisher Copyright:
Copyright ©2010 by the Association for Computing Machinery.
PY - 2015/5/31
Y1 - 2015/5/31
N2 - Ranked data is ubiquitous in real-world applications, arising naturally when users express preferences about products and services, when voters cast ballots in elections, and when funding proposals are evaluated based on their merits or university departments based on their reputation. This paper focuses on crowd-sourcing and novel analysis of ranked data. We describe the design of a data collection task in which Amazon MT workers were asked to rank movies. We present results of data analysis, correlating our ranked dataset with IMDb, where movies are rated on a discrete scale rather than ranked. We develop an intuitive measure of worker quality appropriate for this task, where no gold standard answer exists. We propose a model of local structure in ranked datasets, reflecting that subsets of the workers agree in their ranking over subsets of the items, develop a data mining algorithm that identifies such structure, and evalu- Ate in on our dataset. Our dataset is publicly available at https://github.com/stoyanovich/CrowdRank.
AB - Ranked data is ubiquitous in real-world applications, arising naturally when users express preferences about products and services, when voters cast ballots in elections, and when funding proposals are evaluated based on their merits or university departments based on their reputation. This paper focuses on crowd-sourcing and novel analysis of ranked data. We describe the design of a data collection task in which Amazon MT workers were asked to rank movies. We present results of data analysis, correlating our ranked dataset with IMDb, where movies are rated on a discrete scale rather than ranked. We develop an intuitive measure of worker quality appropriate for this task, where no gold standard answer exists. We propose a model of local structure in ranked datasets, reflecting that subsets of the workers agree in their ranking over subsets of the items, develop a data mining algorithm that identifies such structure, and evalu- Ate in on our dataset. Our dataset is publicly available at https://github.com/stoyanovich/CrowdRank.
UR - http://www.scopus.com/inward/record.url?scp=84960440036&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84960440036&partnerID=8YFLogxK
U2 - 10.1145/2767109.2767110
DO - 10.1145/2767109.2767110
M3 - Conference contribution
AN - SCOPUS:84960440036
T3 - 18th International Workshop on the Web and Databases, WebDB 2015: Freshness, Correctness, Quality of Information and Knowledge on the Web - Proceedings
SP - 41
EP - 47
BT - 18th International Workshop on the Web and Databases, WebDB 2015
A2 - Stoyanovich, Julia
A2 - Suchanek, Fabian M.
PB - Association for Computing Machinery, Inc
T2 - 18th International Workshop on the Web and Databases, WebDB 2015
Y2 - 31 May 2015
ER -