Optimization of array encoding for ultrasound imaging
Jacob Spainhour, Korben Smart, Stephen Becker, Nick Bottenus
TL;DR
The paper addresses the challenge of improving ultrasound image quality in synthetic aperture imaging by learning transmit encoding sequences that are parameterized by time delays $t_{MT}$ and apodization weights $w_{MT}$ within the REFoCUS framework. It introduces a differentiable end-to-end pipeline where a frequency-domain encoding matrix $\mathbf{H}(\omega)$ links the ground-truth multistatic data $\mathbf{U}(\omega)$ to the transmitted responses $\mathbf{S}(\omega)$, and a Tikhonov decoder reconstructs $\widehat{\boldsymbol{\mathcal{U}}}$ prior to delay-and-sum beamforming to form image $|\mathcal{B}(\boldsymbol{\mathcal{H}}_{\boldsymbol{\sigma}}^\dagger \boldsymbol{\mathcal{S}})|_{\text{Im}}$. By backpropagating through a differentiable DAS beamformer, the model learns encodings that minimize an image-based loss $\mathcal{L}$, demonstrated on Field II simulations and validated experimentally with a tissue-mimicking phantom and a wire target. Results show significant improvements in image quality metrics (e.g., $\mathcal{L}$ against ground-truth contrast, gCNR, and cystic resolution) over conventional planewave and Hadamard encodings, with robustness to noise when trained accordingly. The work reveals that common encoding schemes occupy only a subset of possible sequences and that jointly optimized delays and apodizations can generalize across imaging targets, offering a path toward higher SNR and more uniform PSFs in clinical ultrasound.
Abstract
Objective: The transmit encoding model for synthetic aperture imaging is a robust and flexible framework for understanding the effects of acoustic transmission on ultrasound image reconstruction. Our objective is to use machine learning (ML) to construct scanning sequences, parameterized by time delays and apodization weights, that produce high-quality B-mode images. Approach: We use a custom ML model in PyTorch with simulated RF data from Field II to probe the space of possible encoding sequences for those that minimize a loss function that describes image quality. This approach is made computationally feasible by a novel formulation of the derivative for delay-and-sum beamforming. Main Results: When trained for a specified experimental setting (imaging domain, hardware restrictions, etc.), our ML model produces optimized encoding sequences that, when deployed in the REFoCUS imaging framework, improve a number of standard quality metrics over conventional sequences including resolution, field of view, and contrast. We demonstrate these results experimentally on both wire targets and a tissue-mimicking phantom. Significance: This work demonstrates that the set of commonly used encoding schemes represent only a narrow subset of those available. Additionally, it demonstrates the value for ML tasks in synthetic transmit aperture imaging to consider the beamformer within the model, instead of purely as a post-processing step.
