TY - GEN
T1 - Search and result presentation in scientific workflow repositories
AU - Davidson, Susan B.
AU - Huang, Xiaocheng
AU - Stoyanovich, Julia
AU - Yuan, Xiaojie
N1 - Copyright:
Copyright 2013 Elsevier B.V., All rights reserved.
PY - 2013
Y1 - 2013
N2 - We study the problem of searching a repository of complex hierarchical workflows whose component modules, both composite and atomic, have been annotated with keywords. Since keyword search does not use the graph structure of a workflow, we develop a model of workflows using context- free bag grammars. We then give efficient polynomial-time algorithms that, given a workflow and a keyword query, de- termine whether some execution of the workflow matches the query. Based on these algorithms we develop a search and ranking solution that efficiently retrieves the top-k gram- mars from a repository. Finally, we propose a novel result presentation method for grammars matching a keyword query, based on representative parse-trees. The effectiveness of our approach is validated through an extensive experimental evaluation.
AB - We study the problem of searching a repository of complex hierarchical workflows whose component modules, both composite and atomic, have been annotated with keywords. Since keyword search does not use the graph structure of a workflow, we develop a model of workflows using context- free bag grammars. We then give efficient polynomial-time algorithms that, given a workflow and a keyword query, de- termine whether some execution of the workflow matches the query. Based on these algorithms we develop a search and ranking solution that efficiently retrieves the top-k gram- mars from a repository. Finally, we propose a novel result presentation method for grammars matching a keyword query, based on representative parse-trees. The effectiveness of our approach is validated through an extensive experimental evaluation.
UR - http://www.scopus.com/inward/record.url?scp=84883042043&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84883042043&partnerID=8YFLogxK
U2 - 10.1145/2484838.2484847
DO - 10.1145/2484838.2484847
M3 - Conference contribution
AN - SCOPUS:84883042043
SN - 9781450319218
T3 - ACM International Conference Proceeding Series
BT - SSDBM 2013 - Proceedings of the 25th International Conference on Scientific and Statistical Database Management
T2 - 25th International Conference on Scientific and Statistical Database Management, SSDBM 2013
Y2 - 29 July 2013 through 31 July 2013
ER -