TY - GEN
T1 - Scheduling human intelligence tasks in multi-Tenant crowd-powered systems
AU - Difallah, Djellel Eddine
AU - Demartini, Gianluca
AU - Cudré-Mauroux, Philippe
N1 - Funding Information:
National Science Foundation under grant number PP00P2 153023
PY - 2016
Y1 - 2016
N2 - Micro-Task crowdsourcing has become a popular approach to efiectively tackle complex data management problems such as data linkage, missing values, or schema matching. However, the backend crowdsourced operators of crowd-powered systems typically yield higher latencies than the machineprocessable operators, this is mainly due to inherent efficiency difierences between humans and machines. This problem can be further exacerbated by the lack of workers on the target crowdsourcing platform, or when the workers are shared unequally among a number of competing requesters; including the concurrent users from the same organization who execute crowdsourced queries with difierent types, priorities and prices. Under such conditions, a crowd-powered system acts mostly as a proxy to the crowdsourcing platform, and hence it is very difficult to provide effiency guarantees to its end-users. Scheduling is the traditional way of tackling such problems in computer science, by prioritizing access to shared resources. In this paper, we propose a new crowdsourcing system architecture that leverages scheduling algorithms to optimize task execution in a shared resources environment, in this case a crowdsourcing platform. Our study aims at assessing the efficiency of the crowd in settings where multiple types of tasks are run concurrently. We present extensive experimental results comparing i) difierent multi-Tenant crowdsourcing jobs, including a workload derived from real traces, and ii) difierent scheduling techniques tested with real crowd workers. Our experimental results show that task scheduling can be leveraged to achieve fairness and reduce query latency in multi-Tenant crowd-powered systems, although with very different tradeoffs compared to traditional settings not including human factors.
AB - Micro-Task crowdsourcing has become a popular approach to efiectively tackle complex data management problems such as data linkage, missing values, or schema matching. However, the backend crowdsourced operators of crowd-powered systems typically yield higher latencies than the machineprocessable operators, this is mainly due to inherent efficiency difierences between humans and machines. This problem can be further exacerbated by the lack of workers on the target crowdsourcing platform, or when the workers are shared unequally among a number of competing requesters; including the concurrent users from the same organization who execute crowdsourced queries with difierent types, priorities and prices. Under such conditions, a crowd-powered system acts mostly as a proxy to the crowdsourcing platform, and hence it is very difficult to provide effiency guarantees to its end-users. Scheduling is the traditional way of tackling such problems in computer science, by prioritizing access to shared resources. In this paper, we propose a new crowdsourcing system architecture that leverages scheduling algorithms to optimize task execution in a shared resources environment, in this case a crowdsourcing platform. Our study aims at assessing the efficiency of the crowd in settings where multiple types of tasks are run concurrently. We present extensive experimental results comparing i) difierent multi-Tenant crowdsourcing jobs, including a workload derived from real traces, and ii) difierent scheduling techniques tested with real crowd workers. Our experimental results show that task scheduling can be leveraged to achieve fairness and reduce query latency in multi-Tenant crowd-powered systems, although with very different tradeoffs compared to traditional settings not including human factors.
KW - Crowd-Powered System
KW - Crowdsourcing
KW - Scheduling
UR - http://www.scopus.com/inward/record.url?scp=85011386598&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85011386598&partnerID=8YFLogxK
U2 - 10.1145/2872427.2883030
DO - 10.1145/2872427.2883030
M3 - Conference contribution
AN - SCOPUS:85011386598
T3 - 25th International World Wide Web Conference, WWW 2016
SP - 855
EP - 865
BT - 25th International World Wide Web Conference, WWW 2016
PB - International World Wide Web Conferences Steering Committee
T2 - 25th International World Wide Web Conference, WWW 2016
Y2 - 11 April 2016 through 15 April 2016
ER -