TY - GEN
T1 - Fine-grained provenance collection over scripts through program slicing
AU - Pimentel, João Felipe
AU - Freire, Juliana
AU - Murta, Leonardo
AU - Braganholo, Vanessa
N1 - Publisher Copyright:
© Springer International Publishing Switzerland 2016.
PY - 2016
Y1 - 2016
N2 - Collecting provenance from scripts is often useful for scientists to explain and reproduce their scientific experiments. However, most existing automatic approaches capture provenance at coarse-grain, for example, the trace of user-defined functions. These approaches lack information of variable dependencies. Without this information, users may struggle to identify which functions really influenced the results, leading to the creation of false-positive provenance links. To address this problem, we propose an approach that uses dynamic program slicing for gathering provenance of Python scripts. By capturing dependencies among variables, it is possible to expose execution paths inside functions and, consequently, to create a provenance graph that accurately represents the function activations and the results they affect.
AB - Collecting provenance from scripts is often useful for scientists to explain and reproduce their scientific experiments. However, most existing automatic approaches capture provenance at coarse-grain, for example, the trace of user-defined functions. These approaches lack information of variable dependencies. Without this information, users may struggle to identify which functions really influenced the results, leading to the creation of false-positive provenance links. To address this problem, we propose an approach that uses dynamic program slicing for gathering provenance of Python scripts. By capturing dependencies among variables, it is possible to expose execution paths inside functions and, consequently, to create a provenance graph that accurately represents the function activations and the results they affect.
UR - http://www.scopus.com/inward/record.url?scp=84976619648&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84976619648&partnerID=8YFLogxK
U2 - 10.1007/978-3-319-40593-3_21
DO - 10.1007/978-3-319-40593-3_21
M3 - Conference contribution
AN - SCOPUS:84976619648
SN - 9783319405926
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 199
EP - 203
BT - Provenance and Annotation of Data and Processes - 6th International Provenance and Annotation Workshop, IPAW 2016, Proceedings
A2 - Glavic, Boris
A2 - Mattoso, Marta
PB - Springer Verlag
T2 - 6th International Provenance and Annotation Workshop, IPAW 2016
Y2 - 7 June 2016 through 8 June 2016
ER -