Flushing without cascades

Michael A. Bender, Rathish Das, Martín Farach-Colton, Rob Johnson, William Kuszmaul

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Abstract

    Buffer-and-flush is a technique for transforming standard external-memory search trees into write-optimized search trees. In exchange for faster amortized insertions, buffer-and-flush can sometimes significantly increase the latency of operations by causing cascades of flushes. In this paper, we show that flushing cascades are not a fundamental consequence of the buffer-flushing technique, and can be removed entirely using randomization techniques. The underlying implementation of buffer flushing relies on a buffer-eviction strategy at each node in the tree. The ability for the user to select the buffer eviction strategy based on the workload has been shown to be important for performance, both in theory and in practice. In order to support arbitrary buffer-eviction strategies, we introduce the notion of a universal flush, which uses a universal eviction policy that can simulate any other eviction policy. This abstracts away the underlying eviction strategy, even allowing for workload-specific strategies that change dynamically. Our deamortization preserves the amortized throughput of the underlying flushing strategy on all workloads. In particular, with our deamortization and a node cache of size poly-logarithmic in the number of insertions performed on the tree, the amortized insertion cost matches the lower bound of Brodal and Fagerberg. For typical parameters, the lower bound is less than 1 I/O per insertion. For such parameters, our worst-case insertion cost is O(1) I/Os.

    Original languageEnglish (US)
    Title of host publication31st Annual ACM-SIAM Symposium on Discrete Algorithms, SODA 2020
    EditorsShuchi Chawla
    PublisherAssociation for Computing Machinery
    Pages650-669
    Number of pages20
    ISBN (Electronic)9781611975994
    StatePublished - 2020
    Event31st Annual ACM-SIAM Symposium on Discrete Algorithms, SODA 2020 - Salt Lake City, United States
    Duration: Jan 5 2020Jan 8 2020

    Publication series

    NameProceedings of the Annual ACM-SIAM Symposium on Discrete Algorithms
    Volume2020-January

    Conference

    Conference31st Annual ACM-SIAM Symposium on Discrete Algorithms, SODA 2020
    Country/TerritoryUnited States
    CitySalt Lake City
    Period1/5/201/8/20

    ASJC Scopus subject areas

    • Software
    • General Mathematics

    Fingerprint

    Dive into the research topics of 'Flushing without cascades'. Together they form a unique fingerprint.

    Cite this