Table of Contents
Fetching ...

A Risk-Neutral Neural Operator for Arbitrage-Free SPX-VIX Term Structures

Jian'an Zhang

TL;DR

ARBITER reframes risk‑neutral pricing of SPX–VIX term structures as a structured neural operator tuned to arbitrage‑free and martingale constraints. It combines a risk‑neutral Green operator with a Lipschitz‑controlled Q‑Align layer and a convex–monotone SPX–VIX decoder, trained via saddle‑point optimization with explicit stopping criteria. The framework delivers dimensionless metrics (NAS/CNAS/NI/DualGap/Stability/SW/GenGap) with HAC‑CI and Holm‑Bonferroni corrections, demonstrating strong out‑of‑sample performance and robustness under ablations. Theoretical results guarantee approximation, identifiability, and convergence within a safety‑certified loop, while empirical analyses on a high‑fidelity synthetic SPX–VIX generator illustrate coherent pricing, stable calibration, and interpretable diagnostics. The work provides a reproducible blueprint for safety‑first operator learning in finance, with potential extensions to multi‑market coupling and more realistic market frictions.

Abstract

We propose ARBITER, a risk-neutral neural operator for learning joint SPX-VIX term structures under no-arbitrage constraints. ARBITER maps market states to an operator that outputs implied volatility and variance curves while enforcing static arbitrage (calendar, vertical, butterfly), Lipschitz bounds, and monotonicity. The model couples operator learning with constrained decoders and is trained with extragradient-style updates plus projection. We introduce evaluation metrics for derivatives term structures (NAS, CNAS, NI, Dual-Gap, Stability Rate) and show gains over Fourier Neural Operator, DeepONet, and state-space sequence models on historical SPX and VIX data. Ablation studies indicate that tying the SPX and VIX legs reduces Dual-Gap and improves NI, Lipschitz projection stabilizes calibration, and selective state updates improve long-horizon generalization. We provide identifiability and approximation results and describe practical recipes for arbitrage-free interpolation and extrapolation across maturities and strikes.

A Risk-Neutral Neural Operator for Arbitrage-Free SPX-VIX Term Structures

TL;DR

ARBITER reframes risk‑neutral pricing of SPX–VIX term structures as a structured neural operator tuned to arbitrage‑free and martingale constraints. It combines a risk‑neutral Green operator with a Lipschitz‑controlled Q‑Align layer and a convex–monotone SPX–VIX decoder, trained via saddle‑point optimization with explicit stopping criteria. The framework delivers dimensionless metrics (NAS/CNAS/NI/DualGap/Stability/SW/GenGap) with HAC‑CI and Holm‑Bonferroni corrections, demonstrating strong out‑of‑sample performance and robustness under ablations. Theoretical results guarantee approximation, identifiability, and convergence within a safety‑certified loop, while empirical analyses on a high‑fidelity synthetic SPX–VIX generator illustrate coherent pricing, stable calibration, and interpretable diagnostics. The work provides a reproducible blueprint for safety‑first operator learning in finance, with potential extensions to multi‑market coupling and more realistic market frictions.

Abstract

We propose ARBITER, a risk-neutral neural operator for learning joint SPX-VIX term structures under no-arbitrage constraints. ARBITER maps market states to an operator that outputs implied volatility and variance curves while enforcing static arbitrage (calendar, vertical, butterfly), Lipschitz bounds, and monotonicity. The model couples operator learning with constrained decoders and is trained with extragradient-style updates plus projection. We introduce evaluation metrics for derivatives term structures (NAS, CNAS, NI, Dual-Gap, Stability Rate) and show gains over Fourier Neural Operator, DeepONet, and state-space sequence models on historical SPX and VIX data. Ablation studies indicate that tying the SPX and VIX legs reduces Dual-Gap and improves NI, Lipschitz projection stabilizes calibration, and selective state updates improve long-horizon generalization. We provide identifiability and approximation results and describe practical recipes for arbitrage-free interpolation and extrapolation across maturities and strikes.

Paper Structure

This paper contains 195 sections, 21 theorems, 214 equations, 9 figures, 2 tables.

Key Result

Lemma 1

Assume $\rho\!\left(A_\theta(T_\ell)\right)\,\Delta t_\ell \le 1-\varepsilon$ for all $\ell$ with some $\varepsilon\in(0,1)$, and that $\|B_s\|\le b\,\Delta t_s$ for a constant $b>0$ under an operator norm subordinate to a vector norm. Then there exists $C=C(\varepsilon,b,\overline{\Delta t})<\infty

Figures (9)

  • Figure 1: Core metrics with 95% HAC-CI. NAS, CNAS, and NI are reported as point estimates with HAC-CI bands. The dashed line at $1.0$ highlights normalization for NAS/CNAS.
  • Figure 2: Pricing curves across maturities. Three legs (legend) exhibit smooth-in-$T$ behavior with monotone structure consistent with the convex--monotone decoder.
  • Figure 3: Implied-volatility (IV) contours (multi-view). Top-left: filled contours in $(T,K)$; top-right: line contours with labeled levels; bottom-left: filled contours in $(T,\log(K/S_0))$; bottom-right: IV slices $\sigma(K)$ at selected maturities. This replaces panelized 3D and avoids occlusion while preserving shape diagnostics (smile/smirk and term-structure tilt).
  • Figure 4: Model-implied volatility surface (3D). A complementary view to Fig. \ref{['fig:iv-4panel']} confirming smoothness across $(T,K)$ and the absence of butterfly/time-arbitrage artifacts on the synthetic generator.
  • Figure 5: Spectral Guard & projection effect. Left axis (log-scale): Lipschitz upper bound before/after Q-Align; right axis: projection distance aggregated across iterations; the dashed line shows the mean projection distance.
  • ...and 4 more figures

Theorems & Definitions (30)

  • Lemma 1: Green kernel bound
  • proof : Proof sketch
  • Proposition 1: RN-operator stability under Q-Align
  • proof : Proof sketch
  • Proposition 2: Consistency of discretized VIX replication
  • Proposition 3: Variance-swap identifiability via replication
  • Proposition 4: Static no-arbitrage and replication consistency
  • Theorem 1: Extragradient convergence to a noise ball
  • Theorem 2: Approximation rate and conditioning
  • Theorem 3: Local identifiability and information bound
  • ...and 20 more