TY - GEN
T1 - Text vs. space
T2 - 20th ACM Conference on Information and Knowledge Management, CIKM'11
AU - Christoforaki, Maria
AU - He, Jinru
AU - Dimopoulos, Constantinos
AU - Markowetz, Alexander
AU - Suel, Torsten
PY - 2011
Y1 - 2011
N2 - Many web search services allow users to constrain text queries to a geographic location (e.g., yoga classes near Santa Monica). Important examples include local search engines such as Google Local and location-based search services for smart phones. Several research groups have studied the efficient execution of queries mixing text and geography; their approaches usually combine inverted lists with a spatial access method such as an R-tree or space-filling curve. In this paper, we take a fresh look at this problem. We feel that previous work has often focused on the spatial aspect at the expense of performance considerations in text processing, such as inverted index access, compression, and caching. We describe new and existing approaches and discuss their different perspectives. We then compare their performance in extensive experiments on large document collections. Our results indicate that a query processor that combines state-of-the-art text processing techniques with a simple coarse-grained spatial structure can outperform existing approaches by up to two orders of magnitude. In fact, even a naive approach that first uses a simple inverted index and then filters out any documents outside the query range outperforms many previous methods.
AB - Many web search services allow users to constrain text queries to a geographic location (e.g., yoga classes near Santa Monica). Important examples include local search engines such as Google Local and location-based search services for smart phones. Several research groups have studied the efficient execution of queries mixing text and geography; their approaches usually combine inverted lists with a spatial access method such as an R-tree or space-filling curve. In this paper, we take a fresh look at this problem. We feel that previous work has often focused on the spatial aspect at the expense of performance considerations in text processing, such as inverted index access, compression, and caching. We describe new and existing approaches and discuss their different perspectives. We then compare their performance in extensive experiments on large document collections. Our results indicate that a query processor that combines state-of-the-art text processing techniques with a simple coarse-grained spatial structure can outperform existing approaches by up to two orders of magnitude. In fact, even a naive approach that first uses a simple inverted index and then filters out any documents outside the query range outperforms many previous methods.
KW - efficient query processing
KW - geographic web search engines
UR - http://www.scopus.com/inward/record.url?scp=83055191910&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=83055191910&partnerID=8YFLogxK
U2 - 10.1145/2063576.2063641
DO - 10.1145/2063576.2063641
M3 - Conference contribution
AN - SCOPUS:83055191910
SN - 9781450307178
T3 - International Conference on Information and Knowledge Management, Proceedings
SP - 423
EP - 432
BT - CIKM'11 - Proceedings of the 2011 ACM International Conference on Information and Knowledge Management
Y2 - 24 October 2011 through 28 October 2011
ER -