TY - JOUR
T1 - Making snapshot isolation serializable
AU - Fekete, Alan
AU - Liarokapis, Dimitrios
AU - O'Neil, Elizabeth
AU - O'Neil, Patrick
AU - Shasha, Dennis
PY - 2005/6
Y1 - 2005/6
N2 - Snapshot Isolation (SI) is a multiversion concurrency control algorithm, first described in Berenson et al. [1995]. SI is attractive because it provides an isolation level that avoids many of the common concurrency anomalies, and has been implemented by Oracle and Microsoft SQL Server (with certain minor variations). SI does not guarantee serializability in all cases, but the TPC-C benchmark application [TPC-C], for example, executes under SI without serialization anomalies. All major database system products are delivered with default nonserializable isolation levels, often ones that encounter serialization anomalies more commonly than SI, and we suspect that numerous isolation errors occur each day at many large sites because of this, leading to corrupt data sometimes noted in data warehouse applications. The classical justification for lower isolation levels is that applications can be run under such levels to improve efficiency when they can be shown not to result in serious errors, but little or no guidance has been offered to application programmers and DBAs by vendors as to how to avoid such errors. This article develops a theory that characterizes when nonserializable executions of applications can occur under SI. Near the end of the article, we apply this theory to demonstrate that the TPC-C benchmark application has no serialization anomalies under SI, and then discuss how this demonstration can be generalized to other applications. We also present a discussion on how to modify the program logic of applications that are nonserializable under SI so that serializability will be guaranteed.
AB - Snapshot Isolation (SI) is a multiversion concurrency control algorithm, first described in Berenson et al. [1995]. SI is attractive because it provides an isolation level that avoids many of the common concurrency anomalies, and has been implemented by Oracle and Microsoft SQL Server (with certain minor variations). SI does not guarantee serializability in all cases, but the TPC-C benchmark application [TPC-C], for example, executes under SI without serialization anomalies. All major database system products are delivered with default nonserializable isolation levels, often ones that encounter serialization anomalies more commonly than SI, and we suspect that numerous isolation errors occur each day at many large sites because of this, leading to corrupt data sometimes noted in data warehouse applications. The classical justification for lower isolation levels is that applications can be run under such levels to improve efficiency when they can be shown not to result in serious errors, but little or no guidance has been offered to application programmers and DBAs by vendors as to how to avoid such errors. This article develops a theory that characterizes when nonserializable executions of applications can occur under SI. Near the end of the article, we apply this theory to demonstrate that the TPC-C benchmark application has no serialization anomalies under SI, and then discuss how this demonstration can be generalized to other applications. We also present a discussion on how to modify the program logic of applications that are nonserializable under SI so that serializability will be guaranteed.
KW - Anomaly
KW - Concurrency control
KW - Consistency
KW - Multiversion concurrency
KW - Serializability
KW - Snapshot isolation
KW - Weak isolation
UR - http://www.scopus.com/inward/record.url?scp=23944513290&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=23944513290&partnerID=8YFLogxK
U2 - 10.1145/1071610.1071615
DO - 10.1145/1071610.1071615
M3 - Review article
AN - SCOPUS:23944513290
SN - 0362-5915
VL - 30
SP - 492
EP - 528
JO - ACM Transactions on Database Systems
JF - ACM Transactions on Database Systems
IS - 2
ER -