DCC: Differentiable Cardinality Constraints for Partial Index Tracking
Wooyeon Jo, Hyunsouk Cho
TL;DR
The paper addresses the NP-hard challenge of partial replication in index tracking by introducing the Differentiable Cardinality Constraint (DCC) and its floating-point precision-aware variant, DCC_fpp. By replacing the non-differentiable cardinality indicator with differentiable surrogates such as $\widetilde{b}(w_i)=1-\frac{1}{a w_i+1}$ and $\tilde{b}_{fpp}(w_i)=\frac{1}{1+e^{-a(w_i-\epsilon)}}$, the authors integrate the constraint into standard differentiable optimization frameworks, notably SLSQP, while proving conditions under which actual cardinality is enforced. They establish accuracy and assurance criteria (C0–C2) and prove that, with a sufficiently large $a$ (e.g., $a \ge 1.38\times10^5$ in 64-bit precision), the surrogate preserves correct cardinality and constraint satisfaction, achieving polynomial-time complexity. Empirical results across S&P 100/500 and KOSPI 100 show that DCC_fpp yields competitive tracking performance—often outperforming heuristic baselines—while offering improved efficiency and interpretability. The work provides practical hyperparameter guidance and demonstrates the approach’s viability for real-world portfolio optimization under cardinality constraints.
Abstract
Index tracking is a popular passive investment strategy aimed at optimizing portfolios, but fully replicating an index can lead to high transaction costs. To address this, partial replication have been proposed. However, the cardinality constraint renders the problem non-convex, non-differentiable, and often NP-hard, leading to the use of heuristic or neural network-based methods, which can be non-interpretable or have NP-hard complexity. To overcome these limitations, we propose a Differentiable Cardinality Constraint ($\textbf{DCC}$) for index tracking and introduce a floating-point precision-aware method ($\textbf{DCC}_{fpp}$) to address implementation issues. We theoretically prove our methods calculate cardinality accurately and enforce actual cardinality with polynomial time complexity. We propose the range of the hyperparameter $a$ ensures that $\textbf{DCC}_{fpp}$ has no error in real implementations, based on theoretical proof and experiment. Our method applied to mathematical method outperforms baseline methods across various datasets, demonstrating the effectiveness of the identified hyperparameter $a$.
