TY - JOUR
T1 - Explicit state representation and the ATLAS event data model
T2 - Theory and practice
AU - Nowak, Marcin
AU - Malon, David
AU - Gemmeren, Peter
AU - Schaffer, Arthur
AU - Snyder, Scott
AU - Binet, Sebastien
AU - Cranmer, Kyle
PY - 2008/7/1
Y1 - 2008/7/1
N2 - In anticipation of data taking, ATLAS has undertaken a program of work to develop an explicit state representation of the experiment's complex transient event data model. This effort has provided both an opportunity to consider explicitly the structure, organization, and content of the ATLAS persistent event store before writing tens of petabytes of data (replacing simple streaming, which uses the persistent store as a core dump of transient memory), and a locus for support of event data model evolution, including significant refactoring, beyond the automatic schema evolution capabilities of underlying persistence technologies. ATLAS has encountered the need for such non-trivial schema evolution on several occasions already. This paper describes the state representation strategy (transient/persistent separation) and its implementation, including both the payoffs that ATLAS has seen (significant and sometimes surprising space and performance improvements, the extra layer notwithstanding, and extremely general schema evolution support) and the costs (additional and relatively pervasive additional infrastructure development and maintenance). The paper further discusses how these costs are mitigated, and how ATLAS is able to implement this strategy without losing the ability to take advantage of the (improving!) automatic schema evolution capabilities of underlying technology layers when appropriate. Implications of state representations for direct ROOT browsability, and current strategies for associating physics analysis views with such state representations, are also described.
AB - In anticipation of data taking, ATLAS has undertaken a program of work to develop an explicit state representation of the experiment's complex transient event data model. This effort has provided both an opportunity to consider explicitly the structure, organization, and content of the ATLAS persistent event store before writing tens of petabytes of data (replacing simple streaming, which uses the persistent store as a core dump of transient memory), and a locus for support of event data model evolution, including significant refactoring, beyond the automatic schema evolution capabilities of underlying persistence technologies. ATLAS has encountered the need for such non-trivial schema evolution on several occasions already. This paper describes the state representation strategy (transient/persistent separation) and its implementation, including both the payoffs that ATLAS has seen (significant and sometimes surprising space and performance improvements, the extra layer notwithstanding, and extremely general schema evolution support) and the costs (additional and relatively pervasive additional infrastructure development and maintenance). The paper further discusses how these costs are mitigated, and how ATLAS is able to implement this strategy without losing the ability to take advantage of the (improving!) automatic schema evolution capabilities of underlying technology layers when appropriate. Implications of state representations for direct ROOT browsability, and current strategies for associating physics analysis views with such state representations, are also described.
UR - http://www.scopus.com/inward/record.url?scp=50849113759&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=50849113759&partnerID=8YFLogxK
U2 - 10.1088/1742-6596/119/4/042024
DO - 10.1088/1742-6596/119/4/042024
M3 - Article
AN - SCOPUS:50849113759
SN - 1742-6588
VL - 119
JO - Journal of Physics: Conference Series
JF - Journal of Physics: Conference Series
IS - 4
M1 - 042024
ER -