TY - GEN
T1 - Design and implementation of contextual information portals
AU - Chen, Jay
AU - Power, Russell
AU - Subramanian, Lakshminarayanan
AU - Ledlie, Jonathan
N1 - Copyright:
Copyright 2011 Elsevier B.V., All rights reserved.
PY - 2011
Y1 - 2011
N2 - This paper presents a system for enabling offline web use to satisfy the information needs of disconnected communities. We describe the design, implementation, evaluation, and pilot deployment of an automated mechanism to construct Contextual Information Portals (CIPs). CIPs are large searchable information repositories of web pages tailored to the information needs of a target population. We combine an efficient classifier with a focused crawler to gather the web pages for the portal for any given topic. Given a set of topics of interest, our system constructs a CIP containing the most relevant pages from the web across these topics. Using several secondary school course syllabi, we demonstrate the effectiveness of our system for constructing CIPs for use as an education resource. We evaluate our system across several metrics: classification accuracy, crawl scalability, crawl accuracy and harvest rate. We describe the utility and usability of our system based on a preliminary deployment study at an after-school program in India, and also outline our ongoing larger-scale pilot deployment at five schools in Kenya.
AB - This paper presents a system for enabling offline web use to satisfy the information needs of disconnected communities. We describe the design, implementation, evaluation, and pilot deployment of an automated mechanism to construct Contextual Information Portals (CIPs). CIPs are large searchable information repositories of web pages tailored to the information needs of a target population. We combine an efficient classifier with a focused crawler to gather the web pages for the portal for any given topic. Given a set of topics of interest, our system constructs a CIP containing the most relevant pages from the web across these topics. Using several secondary school course syllabi, we demonstrate the effectiveness of our system for constructing CIPs for use as an education resource. We evaluate our system across several metrics: classification accuracy, crawl scalability, crawl accuracy and harvest rate. We describe the utility and usability of our system based on a preliminary deployment study at an after-school program in India, and also outline our ongoing larger-scale pilot deployment at five schools in Kenya.
KW - document classification
KW - focused crawling
KW - offline
KW - web portal
UR - http://www.scopus.com/inward/record.url?scp=79955153090&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=79955153090&partnerID=8YFLogxK
U2 - 10.1145/1963192.1963359
DO - 10.1145/1963192.1963359
M3 - Conference contribution
AN - SCOPUS:79955153090
SN - 9781450305181
T3 - Proceedings of the 20th International Conference Companion on World Wide Web, WWW 2011
SP - 453
EP - 462
BT - Proceedings of the 20th International Conference Companion on World Wide Web, WWW 2011
T2 - 20th International Conference Companion on World Wide Web, WWW 2011
Y2 - 28 March 2011 through 1 April 2011
ER -