Minimizing faulty executions of distributed systems

Colin Scott, Aurojit Panda, Vjekoslav Brajkovic, George Necula, Arvind Krishnamurthy, Scott Shenker

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

When troubleshooting buggy executions of distributed systems, developers typically start by manually separating out events that are responsible for triggering the bug (signal) from those that are extraneous (noise). We present DEMi, a tool for automatically performing this minimization. We apply DEMi to buggy executions of two very different distributed systems, Raft and Spark, and find that it produces minimized executions that are between 1X and 4.6X the size of optimal executions.

Original languageEnglish (US)
Title of host publicationProceedings of the 13th USENIX Symposium on Networked Systems Design and Implementation, NSDI 2016
PublisherUSENIX Association
Pages291-309
Number of pages19
ISBN (Electronic)9781931971294
StatePublished - Jan 1 2016
Event13th USENIX Symposium on Networked Systems Design and Implementation, NSDI 2016 - Santa Clara, United States
Duration: Mar 16 2016Mar 18 2016

Publication series

NameProceedings of the 13th USENIX Symposium on Networked Systems Design and Implementation, NSDI 2016

Conference

Conference13th USENIX Symposium on Networked Systems Design and Implementation, NSDI 2016
Country/TerritoryUnited States
CitySanta Clara
Period3/16/163/18/16

ASJC Scopus subject areas

  • Computer Networks and Communications
  • Control and Systems Engineering

Fingerprint

Dive into the research topics of 'Minimizing faulty executions of distributed systems'. Together they form a unique fingerprint.

Cite this