ShadowBinding: Realizing Effective Microarchitectures for In-Core Secure Speculation Schemes

Amund Bergland Kvalsvik; Magnus Själander

ShadowBinding: Realizing Effective Microarchitectures for In-Core Secure Speculation Schemes

Amund Bergland Kvalsvik, Magnus Själander

TL;DR

ShadowBinding provides RTL-based microarchitectural designs for two in-core secure speculation schemes, NDA and STT, and demonstrates that in-core security incurs substantial, architecture-dependent costs. It shows that STA-Rename introduces a single-cycle YRoT dependency chain, while STT-Issue delays tainting to the issue stage to mitigate this, and that NDA offers a simpler, more timing-friendly alternative. RTL experiments on the RISC-V BOOM and comparisons with gem5 reveal IPC losses of approximately $18.1 ext{ extpercent}$ (STT-Rename), $15.5 ext{ extpercent}$ (STT-Issue), and $26.4 ext{ extpercent}$ (NDA), with overall performance slowdowns up to $34.5 ext{ extpercent}$ for STT-Rename and $26.8 ext{ extpercent}$ (STT-Issue) and $21.5 ext{ extpercent}$ (NDA) on the highest-performance core; extrapolation suggests even larger costs for leading processors. The findings challenge prior simulator-based estimates and highlight the need for careful microarchitectural design and evaluation when adopting secure speculation schemes. The work argues that NDA may currently offer the best practical balance among in-core defenses and emphasizes the importance of detailed hardware evaluation for industry adoption and sustainability considerations.

Abstract

Secure speculation schemes have shown great promise in the war against speculative side-channel attacks, and will be a key building block for developing secure, high-performance architectures moving forward. As the field matures, the need for rigorous microarchitectures, and corresponding performance and cost analysis, become critical for evaluating secure schemes and for enabling their future adoption. In ShadowBinding, we present effective microarchitectures for two state-of-the-art secure schemes, uncovering and mitigating fundamental microarchitectural limitations within the analyzed schemes, and provide important design characteristics. We uncover that Speculative Taint Tracking's (STT's) rename-based taint computation must be completed in a single cycle, creating an expensive dependency chain which greatly limits performance for wider processor cores. We also introduce a novel michroarchitectural approach for STT, named STT-Issue, which, by delaying the taint computation to the issue stage, eliminates the dependency chain, achieving better instructions per cycle (IPC), timing, area, and performance results. Through a comprehensive evaluation of our STT and Non-Speculative Data Access (NDA) microarchitectural designs on the RISC-V Berkeley Out-of-Order Machine, we find that the IPC impact of in-core secure schemes is higher than previously estimated, close to 20% for the highest performance core. With insights into timing from our RTL evaluation, the performance loss, created by the combined impact of IPC and timing, becomes even greater, at 35%, 27%, and 22% for STT-Rename, STT-Issue, and NDA, respectively. If these trends were to hold for leading processor core designs, the performance impact would be well over 30%, even for the best-performing scheme.

ShadowBinding: Realizing Effective Microarchitectures for In-Core Secure Speculation Schemes

TL;DR

Abstract

ShadowBinding: Realizing Effective Microarchitectures for In-Core Secure Speculation Schemes

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (10)