Optimal ancilla-free Clifford+T synthesis for general single-qubit unitaries
Hayata Morisaki, Kaoru Sano, Seiseki Akibue
TL;DR
This work addresses the problem of ancilla-free Clifford+$T$ synthesis for arbitrary single-qubit unitaries, optimizing the $T$-count while achieving a target accuracy under the diamond distance $d_onterges{\diamond}$. It introduces two complementary approaches: a deterministic algorithm that attains $T$-optimal circuits with runtimes scaling as $\varepsilon^{-1/2 - o(1)}$ and a $T$-count of $3\log_2(1/\varepsilon) + o(\log_2(1/\varepsilon))$, and a probabilistic (mixed-unitary) algorithm with runtimes $\varepsilon^{-1/4 - o(1)}$ and a $T$-count of $1.5\log_2(1/\varepsilon) + o(\log_2(1/\varepsilon))$, both demonstrated to be practical up to $\varepsilon \approx 10^{-15}$ and $10^{-22}$ respectively. The methods rely on number-theoretic properties of Clifford+$T$, lattice enumeration via integer-point techniques, Matsumoto-Amano normal form, and SDP-based optimization to compute optimal mixtures, all without conjectures. The results provide rigorous, conjecture-free guarantees and offer concrete resource estimates for fault-tolerant quantum computation using Clifford+$T$ gates, with potential extensions to other gate sets and questions about polynomial-time solvability under relaxed optimality.
Abstract
We propose two Clifford+$T$ synthesis algorithms that are optimal with respect to $T$-count. The first algorithm, called deterministic synthesis, approximates any single-qubit unitary by a single-qubit Clifford+$T$ circuit with the minimum $T$-count. The second algorithm, called probabilistic synthesis, approximates any single-qubit unitary by a probabilistic mixture of single-qubit Clifford+$T$ circuits with the minimum $T$-count. For most of single-qubit unitaries, the runtimes of deterministic synthesis and probabilistic synthesis are $\varepsilon^{-1/2 - o(1)}$ and $\varepsilon^{-1/4 - o(1)}$, respectively, for an approximation error $\varepsilon$. Although this complexity is exponential in the input size, we demonstrate that our algorithms run in practical time at $\varepsilon \approx 10^{-15}$ and $\varepsilon \approx 10^{-22}$, respectively. Furthermore, we show that, for most single-qubit unitaries, the deterministic synthesis algorithm requires at most $3\log_2(1/\varepsilon) + o(\log_2(1/\varepsilon))$ $T$-gates, and the probabilistic synthesis algorithm requires at most $1.5\log_2(1/\varepsilon) + o(\log_2(1/\varepsilon))$ $T$-gates. Remarkably, complexity analyses in this work do not rely on any numerical or number-theoretic conjectures.
