Table of Contents
Fetching ...

A Lightweight Approach for State Machine Replication

Christian Cachin, Jinfeng Dou, Christian Scheideler, Philipp Schneider

TL;DR

This work tackles state machine replication under an adaptive blocking adversary by marrying a leaderless median-rule consensus with lightweight, client-held commitment certificates. It introduces a compact, log-based approach where servers store only the shared state and uncommitted commands, while clients maintain short certificates (via a Merkle forest) proving that their commands were committed. The methodology includes an extended median-rule SMR with a $(k,\ell)$-Gossip broadcasting mechanism, a compact rule to trim logs, and a recovery protocol that preserves monotonicity and enables fast restoration after surges, all with logarithmic communication, latency, and per-client storage. The framework provides liveness under $1$-late, $\Theta(1)$-blocking adversaries and strong safety under arbitrary blocking, offering a practical, scalable, and partition-tolerant SMR solution with potential extensions to higher blocking or churn by adjusting parameters. Overall, the approach advances scalable, leaderless SMR with provable guarantees and minimal server-side state, suitable for lightweight, verifiable transaction histories where clients shoulder commitment proofs.

Abstract

We present a lightweight solution for state machine replication with commitment certificates. Specifically, we adapt and analyze a median rule for the stabilizing consensus problem [Doerr11] to operate in a client-server setting where arbitrary servers may be blocked adaptively based on past system information. We further extend our protocol by compressing information about committed commands, thus keeping the protocol lightweight, while still enabling clients to easily prove that their commands have indeed been committed on the shared state. Our approach guarantees liveness as long as at most a constant fraction of servers are blocked, ensures safety under any number of blocked servers, and supports fast recovery even after all servers are blocked. In addition to offering near-optimal asymptotic performance in several respects, our method is fully decentralized, unlike other near-optimal solutions that rely on leaders. In particular, our solution is robust against adversaries that target key servers (which captures insider-based denial-of-service attacks), whereas leader-based approaches fail under such a blocking model.

A Lightweight Approach for State Machine Replication

TL;DR

This work tackles state machine replication under an adaptive blocking adversary by marrying a leaderless median-rule consensus with lightweight, client-held commitment certificates. It introduces a compact, log-based approach where servers store only the shared state and uncommitted commands, while clients maintain short certificates (via a Merkle forest) proving that their commands were committed. The methodology includes an extended median-rule SMR with a -Gossip broadcasting mechanism, a compact rule to trim logs, and a recovery protocol that preserves monotonicity and enables fast restoration after surges, all with logarithmic communication, latency, and per-client storage. The framework provides liveness under -late, -blocking adversaries and strong safety under arbitrary blocking, offering a practical, scalable, and partition-tolerant SMR solution with potential extensions to higher blocking or churn by adjusting parameters. Overall, the approach advances scalable, leaderless SMR with provable guarantees and minimal server-side state, suitable for lightweight, verifiable transaction histories where clients shoulder commitment proofs.

Abstract

We present a lightweight solution for state machine replication with commitment certificates. Specifically, we adapt and analyze a median rule for the stabilizing consensus problem [Doerr11] to operate in a client-server setting where arbitrary servers may be blocked adaptively based on past system information. We further extend our protocol by compressing information about committed commands, thus keeping the protocol lightweight, while still enabling clients to easily prove that their commands have indeed been committed on the shared state. Our approach guarantees liveness as long as at most a constant fraction of servers are blocked, ensures safety under any number of blocked servers, and supports fast recovery even after all servers are blocked. In addition to offering near-optimal asymptotic performance in several respects, our method is fully decentralized, unlike other near-optimal solutions that rely on leaders. In particular, our solution is robust against adversaries that target key servers (which captures insider-based denial-of-service attacks), whereas leader-based approaches fail under such a blocking model.

Paper Structure

This paper contains 22 sections, 40 theorems, 32 equations, 3 figures, 6 algorithms.

Key Result

Lemma 2.1

If the initial fraction of useful servers is at most $1/3-\varepsilon$ for any constant $\varepsilon>0$ then even if no server is blocked, within $O(\log \log n)$ rounds no server will be useful anymore, w.h.p.

Figures (3)

  • Figure 1: $g(x)=-10x^6+36x^5-45x^4+20x^3-3x^2$
  • Figure 2: $g'(x)=-8x^6+ \frac{144}{5}x^5-36x^4+16x^3-\frac{41}{40}x$
  • Figure 3: $g"(x)=-7x^6+ \frac{126}{5}x^5-\frac{63}{2}x^4+14x^3-\frac{39}{40}x$

Theorems & Definitions (74)

  • Definition 1.2: Stabilizing Consensus Problem
  • Definition 1.3: Stabilizing SMR Problem
  • Definition 1.4: Commitment Problem
  • Definition 1.5: Recovery Problem
  • Lemma 2.1
  • proof
  • Lemma 2.2
  • proof
  • Lemma 2.3
  • proof
  • ...and 64 more