Unitary synthesis with fewer T gates
Xinyu Tan
TL;DR
This paper tackles the problem of compiling an arbitrary $n$-qubit unitary into a Clifford+T circuit with minimal $T$-gates. It introduces a recursive cosine-sine decomposition to express any unitary as a product of $2^n-1$ multi-controlled $k$-qubit unitaries and then groups blocks that share target qubits to reduce cost. By extending the diagonal-unitary synthesis method of Gosset, Kothari, and Wu to multi-controlled unitaries and carefully balancing ancilla and gate counts, the authors achieve a breakthrough upper bound of $O\left(2^{4n/3} \, L^{2/3} + 2^n L\right)$ $T$-gates, where $L = n + \log(1/\varepsilon)$. They also show a complementary bound with a tradeoff parameter $k$ giving $O\left(2^{(3n-k)/2} \sqrt{L}\right)$ $T$-gates and $O\left(2^{(n+k)/2} \sqrt{L}\right)$ ancillae for suitable choices of $k$, highlighting that large ancilla resources could approach the conjectured $2^n$ scaling. Together, these results provide evidence that the optimal $T$-gate count for arbitrary unitary synthesis could be as low as $2^n$ under favorable ancilla budgets and offer a concrete path toward more practical fault-tolerant quantum compilation. The work has implications for quantum simulation and state-preparation primitives, where reducing $T$-gates is particularly impactful on hardware platforms with costly magic-state distillation.
Abstract
We present a simple algorithm that implements an arbitrary $n$-qubit unitary operator using a Clifford+T circuit with T-count $O(2^{4n/3} n^{2/3})$. This improves upon the previous best known upper bound of $O(2^{3n/2} n)$, while the best known lower bound remains $Ω(2^n)$. Our construction is based on a recursive application of the cosine-sine decomposition, together with a generalization of the optimal diagonal unitary synthesis method by Gosset, Kothari, and Wu to multi-controlled $k$-qubit unitaries.
