Skip to main content

Tolerating Slowdowns in Replicated State Machines using Copilots

Author(s): Ngo, Khiem; Sen, Siddhartha; Lloyd, Wyatt

Download
To refer to this page use: http://arks.princeton.edu/ark:/88435/pr1sz75
Full metadata record
DC FieldValueLanguage
dc.contributor.authorNgo, Khiem-
dc.contributor.authorSen, Siddhartha-
dc.contributor.authorLloyd, Wyatt-
dc.date.accessioned2021-10-08T19:50:17Z-
dc.date.available2021-10-08T19:50:17Z-
dc.date.issued2020en_US
dc.identifier.citationNgo, Khiem, Siddhartha Sen, and Wyatt Lloyd. "Tolerating Slowdowns in Replicated State Machines using Copilots." In 14th USENIX Symposium on Operating Systems Design and Implementation (2020): pp. 583-598.en_US
dc.identifier.urihttps://www.usenix.org/system/files/osdi20-ngo.pdf-
dc.identifier.urihttp://arks.princeton.edu/ark:/88435/pr1sz75-
dc.description.abstractReplicated state machines are linearizable, fault-tolerant groups of replicas that are coordinated using a consensus algorithm. Copilot replication is the first 1-slowdown-tolerant consensus protocol: it delivers normal latency despite the slowdown of any 1 replica. Copilot uses two distinguished replicas—the pilot and copilot—to proactively add redundancy to all stages of processing a client’s command. Copilot uses dependencies and deduplication to resolve potentially differing orderings proposed by the pilots. To avoid dependencies leading to either pilot being able to slow down the group, Copilot uses fast takeovers that allow a fast pilot to complete the ongoing work of a slow pilot. Copilot includes two optimizations—ping-pong batching and null dependency elimination—that improve its performance when there are 0 and 1 slow pilots respectively. Our evaluation of Copilot shows its performance is lower but competitive with Multi-Paxos and EPaxos when no replicas are slow. When a replica is slow, Copilot is the only protocol that avoids high latencies.en_US
dc.format.extent583 - 598en_US
dc.language.isoen_USen_US
dc.relation.ispartof14th USENIX Symposium on Operating Systems Design and Implementationen_US
dc.rightsFinal published version. This is an open access article.en_US
dc.titleTolerating Slowdowns in Replicated State Machines using Copilotsen_US
dc.typeConference Articleen_US
pu.type.symplectichttp://www.symplectic.co.uk/publications/atom-terms/1.0/conference-proceedingen_US

Files in This Item:
File Description SizeFormat 
SlowdownCopilot.pdf992.13 kBAdobe PDFView/Download


Items in OAR@Princeton are protected by copyright, with all rights reserved, unless otherwise indicated.