From Narrow to Wide: Autoencoding Transformers for Ultrasound Bandwidth Recovery
Sepideh KhakzadGharamaleki, Hassan Rivaz, Brandon Helfield
TL;DR
This work tackles the limited axial resolution of narrow‑band ultrasound by learning a spectral super‑resolution mapping from bandlimited RF data to broadband content. A Tiny Vision Transformer–based auto‑encoder operates on STFT spectrograms of RF lines, trained on synthetic Field II data with a curriculum‑weighted loss to recover missing high‑frequency components while preserving phase. The model achieves substantial gains (e.g., PSNR gains up to +6.7 dB and SSIM near 0.996) on speckle phantoms and generalizes to unseen resolution phantoms, suggesting it can retrofit existing narrow‑band probes with broadband‑like performance without hardware changes. This software‑driven approach holds promise for higher‑resolution ultrasound in resource‑constrained settings, with caveats related to amplitude estimation and computational memory that guide future improvements.
Abstract
Conventional pulse-echo ultrasound suffers when low-cost probes deliver only narrow fractional bandwidths, elongating pulses and erasing high-frequency detail. We address this limitation by learning a data-driven mapping from band-limited to broadband spectrogram of radio-frequency (RF) lines. To this end, a variation of Tiny Vision Transform (ViT) auto-encoder is trained on simulation data using a curriculum-weighted loss. On heterogeneous speckle-cyst phantoms, the network reduces image-domain MSE by 90 percent, boosts PSNR by 6.7 dB, and raises SSIM to 0.965 compared with the narrow-band input. It also sharpens point-target rows in a completely unseen resolution phantom, demonstrating strong out-of-distribution generalisation without sacrificing frame rate or phase information. These results indicate that a purely software upgrade can endow installed narrow-band probes with broadband-like performance, potentially widening access to high-resolution ultrasound in resource-constrained settings.
