TY - GEN
T1 - Provenance and scientific workflows
T2 - 2008 ACM SIGMOD International Conference on Management of Data 2008, SIGMOD'08
AU - Davidson, Susan B.
AU - Freire, Juliana
PY - 2008
Y1 - 2008
N2 - Provenance in the context of workflows, both for the data they derive and for their specification, is an essential component to allow for result reproducibility, sharing, and knowledge re-use in the scientific community. Several workshops have been held on the topic, and it has been the focus of many research projects and prototype systems. This tutorial provides an overview of research issues in provenance for scientific workflows, with a focus on recent literature and technology in this area. It is aimed at a general database research audience and at people who work with scientific data and workflows. We will (1) provide a general overview of scientific workflows, (2) describe research on provenance for scientific workflows and show in detail how provenance is supported in existing systems; (3) discuss emerging applications that are enabled by provenance; and (4) outline open problems and new directions for database-related research.
AB - Provenance in the context of workflows, both for the data they derive and for their specification, is an essential component to allow for result reproducibility, sharing, and knowledge re-use in the scientific community. Several workshops have been held on the topic, and it has been the focus of many research projects and prototype systems. This tutorial provides an overview of research issues in provenance for scientific workflows, with a focus on recent literature and technology in this area. It is aimed at a general database research audience and at people who work with scientific data and workflows. We will (1) provide a general overview of scientific workflows, (2) describe research on provenance for scientific workflows and show in detail how provenance is supported in existing systems; (3) discuss emerging applications that are enabled by provenance; and (4) outline open problems and new directions for database-related research.
KW - Provenance
KW - Scientific workflows
UR - http://www.scopus.com/inward/record.url?scp=57149126952&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=57149126952&partnerID=8YFLogxK
U2 - 10.1145/1376616.1376772
DO - 10.1145/1376616.1376772
M3 - Conference contribution
AN - SCOPUS:57149126952
SN - 9781605581026
T3 - Proceedings of the ACM SIGMOD International Conference on Management of Data
SP - 1345
EP - 1350
BT - SIGMOD 2008
Y2 - 9 June 2008 through 12 June 2008
ER -