GPU-Accelerated X-ray Pulse Profile Modeling
Tianzhe Zhou, Chun Huang
TL;DR
This work presents a public GPU-accelerated X-ray pulse-profile modeling framework that overcomes the long-standing accuracy-speed bottleneck in neutron-star forward modeling. By combining an oblate--Schwarzschild spacetime treatment, atmosphere-based beaming from NSX-H tables, full-surface temperature maps, and NICER-style instrument response, the authors achieve $\sim 2$--$5~\mathrm{ms}$ per evaluation with $10^{3}$--$10^{4}\times$ speedups on a modern GPU while maintaining $\sim 10^{-3}$ relative waveform accuracy against benchmarks. A key contribution is the identification and mitigation of a bias from atmosphere-table interpolation near grid boundaries, via a mixed-order interpolation that enforces linearity at grazing angles. The GPU framework enables high-resolution, comprehensive hotspot modeling and robust Bayesian inference on accessible hardware, opening the door to reanalyzing NICER targets with physically motivated temperature maps and broader parameter exploration. Overall, this work significantly enhances the practicality and reliability of PPM for current and future X-ray missions.
Abstract
Pulse-profile modeling (PPM) of thermal X-ray emission from rotation-powered millisecond pulsars enables simultaneous constraints on the mass $M$, radius $R$, and hence the equation of state of cold, dense matter. However, Bayesian PPM has faced a hard accuracy-speed bottleneck: current production resolutions used to keep inference tractable can under-resolve extreme hotspot geometries and bias the waveform computation, whereas the higher resolutions that remove this bias push forward models to minutes per evaluation, making inference impractical. We break this trade-off with, to our knowledge, the first public GPU-accelerated X-ray PPM framework that matches established benchmarks to within $\sim10^{-3}$ relative accuracy even for extreme geometries, while collapsing minutes-long high-fidelity computations to $2$--$5$ ms on an RTX 4080 ($10^{3}$--$10^{4}\times$ speedups), enabling posterior exploration at resolutions and complexities previously out of reach. We further uncover a bias near the interpolation boundaries of atmosphere lookup tables, demonstrate it with two diagnostic tests, and counter it with a mixed-order interpolator. Together, these advances enlarge the feasible hotspot model space and reduce key systematics in PPM, strengthening inferences for current and future X-ray missions.
