Table of Contents
Fetching ...

A Formula for the I/O Cost of Linear Repair Schemes and Application to Reed-Solomon Codes

Zhongyan Liu, Zhifang Zhang

TL;DR

The paper addresses the I/O cost of linear repair schemes for Reed-Solomon codes in erasure-coded distributed storage, introducing a general formula that ties the I/O cost to the Hamming weight of a related $B$-linear space. This enables tractable lower bounds for full-length RS codes with two or three parities and guides the construction of repair schemes with reduced I/O, while preserving repair bandwidth. A key contribution is a scheme for full-length RS codes using affine $q$-polynomials that achieves an optimal or near-optimal I/O cost in several regimes, and demonstrates that reducing I/O cost can be achieved with limited or zero trade-offs in repair bandwidth. The results have practical impact for implementing RS codes in large-scale storage systems by lowering the data read during node repair and improving overall repair efficiency.

Abstract

Node repair is a crucial problem in erasure-code-based distributed storage systems. An important metric for repair efficiency is the I/O cost which equals the total amount of data accessed at helper nodes to repair a failed node. In this work, a general formula for computing the I/O cost of linear repair schemes is derived from a new perspective, i.e., by investigating the Hamming weight of a related linear space. Applying the formula to Reed-Solomon (RS) codes, we obtain lower bounds on the I/O cost for full-length RS codes with two and three parities. Furthermore, we build linear repair schemes for the RS codes with improved I/O cost. For full-length RS codes with two parities, our scheme meets the lower bound on the I/O cost.

A Formula for the I/O Cost of Linear Repair Schemes and Application to Reed-Solomon Codes

TL;DR

The paper addresses the I/O cost of linear repair schemes for Reed-Solomon codes in erasure-coded distributed storage, introducing a general formula that ties the I/O cost to the Hamming weight of a related -linear space. This enables tractable lower bounds for full-length RS codes with two or three parities and guides the construction of repair schemes with reduced I/O, while preserving repair bandwidth. A key contribution is a scheme for full-length RS codes using affine -polynomials that achieves an optimal or near-optimal I/O cost in several regimes, and demonstrates that reducing I/O cost can be achieved with limited or zero trade-offs in repair bandwidth. The results have practical impact for implementing RS codes in large-scale storage systems by lowering the data read during node repair and improving overall repair efficiency.

Abstract

Node repair is a crucial problem in erasure-code-based distributed storage systems. An important metric for repair efficiency is the I/O cost which equals the total amount of data accessed at helper nodes to repair a failed node. In this work, a general formula for computing the I/O cost of linear repair schemes is derived from a new perspective, i.e., by investigating the Hamming weight of a related linear space. Applying the formula to Reed-Solomon (RS) codes, we obtain lower bounds on the I/O cost for full-length RS codes with two and three parities. Furthermore, we build linear repair schemes for the RS codes with improved I/O cost. For full-length RS codes with two parities, our scheme meets the lower bound on the I/O cost.
Paper Structure (10 sections, 12 theorems, 30 equations)

This paper contains 10 sections, 12 theorems, 30 equations.

Key Result

Lemma 1

Let $\mathcal{C}$ be an $[n,k]$ MDS code over $F$, and $B\subseteq F$ be a subfield of $F$ with $[F:B]=\ell$. Then for each node $i^*\in[n]$, the following are equivalent.

Theorems & Definitions (21)

  • Lemma 1: Guruswami-Wootters RSrepair
  • Definition 1: IOfulllength
  • Lemma 2: IOfulllength
  • Lemma 3: shortr=2
  • Lemma 4
  • proof
  • Theorem 5
  • proof
  • Remark 1
  • Theorem 6
  • ...and 11 more