TY - GEN
T1 - A first study on clustering collections of workflow graphs
AU - Santos, Emanuele
AU - Lins, Lauro
AU - Ahrens, James P.
AU - Freire, Juliana
AU - Silva, Cláudio T.
N1 - Funding Information:
Our research has been funded by the Department of Energy SciDAC (VACET and SDM centers), the National Science Foundation (grants IIS-0746500, CNS- 0751152, IIS-0713637, OCE-0424602, IIS-0534628, CNS-0514485, IIS-0513692, CNS-0524096, CCF-0401498, OISE-0405402, CCF-0528201, CNS-0551724), and IBM Faculty Awards (2005, 2006, 2007, and 2008). E. Santos is partially supported by a CAPES/Fulbright fellowship
Publisher Copyright:
© Springer-Verlag Berlin Heidelberg 2008.
PY - 2008
Y1 - 2008
N2 - As workflow systems get more widely used, the number of workflows and the volume of provenance they generate has grown considerably. New tools and infrastructure are needed to allow users to interact with, reason about, and re-use this information. In this paper, we explore the use of clustering techniques to organize large collections of workflow and provenance graphs. We propose two different representations for these graphs and present an experimental evaluation, using a collection of 1,700 workflow graphs, where we study the trade-offs of these representations and the effectiveness of alternative clustering techniques.
AB - As workflow systems get more widely used, the number of workflows and the volume of provenance they generate has grown considerably. New tools and infrastructure are needed to allow users to interact with, reason about, and re-use this information. In this paper, we explore the use of clustering techniques to organize large collections of workflow and provenance graphs. We propose two different representations for these graphs and present an experimental evaluation, using a collection of 1,700 workflow graphs, where we study the trade-offs of these representations and the effectiveness of alternative clustering techniques.
UR - http://www.scopus.com/inward/record.url?scp=84961839654&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84961839654&partnerID=8YFLogxK
U2 - 10.1007/978-3-540-89965-5_18
DO - 10.1007/978-3-540-89965-5_18
M3 - Conference contribution
AN - SCOPUS:84961839654
SN - 9783540899648
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 160
EP - 173
BT - Provenance and Annotation of Data and Processes - 2nd International Provenance and Annotation Workshop, IPAW 2008, Revised Selected Papers
A2 - Freire, Juliana
A2 - Koop, David
A2 - Freire, Juliana
A2 - Freire, Juliana
A2 - Moreau, Luc
PB - Springer Verlag
T2 - 2nd International Provenance and Annotation Workshop, IPAW 2008
Y2 - 17 June 2008 through 18 June 2008
ER -