NVRNet: Deep Learning Model for Fast Nitrogen Vacancy Characterization under Room Temperature

Chao Shang; Gregory D. Fuchs

NVRNet: Deep Learning Model for Fast Nitrogen Vacancy Characterization under Room Temperature

Chao Shang, Gregory D. Fuchs

Abstract

Characterization of the local spin environment of single diamond nitrogen-vacancy centers is a critical task for quantum sensing, quantum networking, and diamond materials optimization. We introduce NVRNet, a physics-informed simulation-to-reality pipeline that maps a fast acquisition, noisy Ramsey photoluminescence (PL) trace to a denoised waveform as well as outputting a direct estimate of hyperfine coupling to ${}^{13}\mathrm{C}$ spins in the environment. The denoiser is a two-stage time-frequency UNet followed by an attention-augmented time-domain UNet, pretrained on Hamiltonian-based simulations with experimentally calibrated noise. The simulation-pretrained, experimentally fine-tuned denoiser reduces the median reconstruction error on held-out few-sweep experimental traces to $0.44$-$0.67\times$ that of the raw experimental noisy traces across the three NV centers. A transformer-based estimator trained on simulation labels then predicts hyperfine parameters, and forward reconstruction from the inferred parameters reproduces the dominant experimental time- and frequency-domain features, with representative normalized FFT reconstruction errors of 0.10-0.19. These results establish NVRNet as a fast, hardware-compatible route to hyperfine inference from minimal Ramsey data.

NVRNet: Deep Learning Model for Fast Nitrogen Vacancy Characterization under Room Temperature

Abstract

spins in the environment. The denoiser is a two-stage time-frequency UNet followed by an attention-augmented time-domain UNet, pretrained on Hamiltonian-based simulations with experimentally calibrated noise. The simulation-pretrained, experimentally fine-tuned denoiser reduces the median reconstruction error on held-out few-sweep experimental traces to

that of the raw experimental noisy traces across the three NV centers. A transformer-based estimator trained on simulation labels then predicts hyperfine parameters, and forward reconstruction from the inferred parameters reproduces the dominant experimental time- and frequency-domain features, with representative normalized FFT reconstruction errors of 0.10-0.19. These results establish NVRNet as a fast, hardware-compatible route to hyperfine inference from minimal Ramsey data.

Paper Structure (31 sections, 65 equations, 14 figures, 3 tables)

This paper contains 31 sections, 65 equations, 14 figures, 3 tables.

Introduction
Physical Modeling and Data Generation
Lattice Structure Generation
Ramsey Signal Simulation
Noise Modeling
NVRNet
Denoising Network
Model structure
Training objective and optimization
Uncertainty adapters for simulation-to-reality fine-tuning
Denoising Result
Hyperfine Parameter Estimator
Model structure and training
Parameter Estimation Results
Evaluation on Auxiliary Encoders
...and 16 more sections

Figures (14)

Figure 1: Workflow of NVRNet. A Ramsey PL trace is collected under room temprature with single photon detector. Then it is first processed by the denoising module, which comprises a simulation-pretrained core augmented with uncertainty-aware adapter layers, fine-tuned on experimental traces. The resulting denoised trace is then passed to the parameter-estimation network, which predicts the ${}^{13}\mathrm{C}$ count (up to $n_{\max}=9$) and the associated parallel hyperfine couplings $A_{\parallel}$.
Figure 2: Comparison between experimental and simulated noise statistics and principal components. (a) Distributions in PL(%) for experimental (blue) and synthetic (pink) residuals. (b) PCA embedding of full traces: the simulated dataset (pink) spans the region occupied by the experimental data (blue) in the leading two principal components, indicating that simulation covers the dominant experimental variability.
Figure 3: Core of the denoising network. Stage 1 denoises the Fourier-domain representation and returns to the time domain. Stage 2 refines in the time domain using a UNet with a self-attention bottleneck to preserve long-range phase coherence and weak beating.
Figure 4: Adapters for experimental fine-tuning. A front adapter ingests the noisy PL trace and an uncertainty channel and produces the core input. The core, composed of a pretrained UNet, is frozen during fine-tuning. A back adapter maps the output to the experimental trace format.
Figure 5: Experimental denoising via simulation pretraining and adapter fine-tuning.(a--c) Example Ramsey PL(%) traces from three NV centers. Black: raw experimental input. Orange: 200-sweep fitted reference ($+10$ PL(%) offset). Green: simulation-trained core-only output ($-10$ PL(%) offset). Blue: core+adapter output after experimental fine-tuning. Offsets are for visualization only. (d) RMSE distributions (in PL percentage points) over all held-out few sweep test traces ($K\le 10$) for each NV, comparing the raw experimental baseline (orange), the simulation-trained core-only model (green), and the experimentally fine-tuned core+adapter model (blue). (e) RMSE versus number of averaged sweeps $K$ per inference.
...and 9 more figures

NVRNet: Deep Learning Model for Fast Nitrogen Vacancy Characterization under Room Temperature

Abstract

NVRNet: Deep Learning Model for Fast Nitrogen Vacancy Characterization under Room Temperature

Authors

Abstract

Table of Contents

Figures (14)