Table of Contents
Fetching ...

Safe Bayesian Optimization for the Control of High-Dimensional Embodied Systems

Yunyue Wei, Zeji Yi, Hongda Li, Saraswati Soedarmadji, Yanan Sui

TL;DR

HdSafeBO tackles safe optimization of high-dimensional embodied-control policies by marrying isometric dimension reduction with a local, optimistic safety strategy and a trust-region search. It provides probabilistic safety guarantees and a bound on cumulative safety violations while enabling efficient learning even when the input space spans hundreds to thousands of dimensions. The approach is validated on synthetic benchmarks, a high-dimensional musculoskeletal control task, and neural-stimulation induced human motion, where it outperforms strong baselines in both objective quality and safety metrics. This framework advances practical online safe optimization for complex, high-dimensional robotic systems and human-robot interaction scenarios.

Abstract

Learning to move is a primary goal for animals and robots, where ensuring safety is often important when optimizing control policies on the embodied systems. For complex tasks such as the control of human or humanoid control, the high-dimensional parameter space adds complexity to the safe optimization effort. Current safe exploration algorithms exhibit inefficiency and may even become infeasible with large high-dimensional input spaces. Furthermore, existing high-dimensional constrained optimization methods neglect safety in the search process. In this paper, we propose High-dimensional Safe Bayesian Optimization with local optimistic exploration (HdSafeBO), a novel approach designed to handle high-dimensional sampling problems under probabilistic safety constraints. We introduce a local optimistic strategy to efficiently and safely optimize the objective function, providing a probabilistic safety guarantee and a cumulative safety violation bound. Through the use of isometric embedding, HdSafeBO addresses problems ranging from a few hundred to several thousand dimensions while maintaining safety guarantees. To our knowledge, HdSafeBO is the first algorithm capable of optimizing the control of high-dimensional musculoskeletal systems with high safety probability. We also demonstrate the real-world applicability of HdSafeBO through its use in the safe online optimization of neural stimulation induced human motion control.

Safe Bayesian Optimization for the Control of High-Dimensional Embodied Systems

TL;DR

HdSafeBO tackles safe optimization of high-dimensional embodied-control policies by marrying isometric dimension reduction with a local, optimistic safety strategy and a trust-region search. It provides probabilistic safety guarantees and a bound on cumulative safety violations while enabling efficient learning even when the input space spans hundreds to thousands of dimensions. The approach is validated on synthetic benchmarks, a high-dimensional musculoskeletal control task, and neural-stimulation induced human motion, where it outperforms strong baselines in both objective quality and safety metrics. This framework advances practical online safe optimization for complex, high-dimensional robotic systems and human-robot interaction scenarios.

Abstract

Learning to move is a primary goal for animals and robots, where ensuring safety is often important when optimizing control policies on the embodied systems. For complex tasks such as the control of human or humanoid control, the high-dimensional parameter space adds complexity to the safe optimization effort. Current safe exploration algorithms exhibit inefficiency and may even become infeasible with large high-dimensional input spaces. Furthermore, existing high-dimensional constrained optimization methods neglect safety in the search process. In this paper, we propose High-dimensional Safe Bayesian Optimization with local optimistic exploration (HdSafeBO), a novel approach designed to handle high-dimensional sampling problems under probabilistic safety constraints. We introduce a local optimistic strategy to efficiently and safely optimize the objective function, providing a probabilistic safety guarantee and a cumulative safety violation bound. Through the use of isometric embedding, HdSafeBO addresses problems ranging from a few hundred to several thousand dimensions while maintaining safety guarantees. To our knowledge, HdSafeBO is the first algorithm capable of optimizing the control of high-dimensional musculoskeletal systems with high safety probability. We also demonstrate the real-world applicability of HdSafeBO through its use in the safe online optimization of neural stimulation induced human motion control.
Paper Structure (35 sections, 6 theorems, 17 equations, 8 figures, 7 tables, 2 algorithms)

This paper contains 35 sections, 6 theorems, 17 equations, 8 figures, 7 tables, 2 algorithms.

Key Result

Proposition 4.1

Let Assumptions assum: rkhs holds for the latent safety function $g$, and set $\beta_t$ satisfying $\Phi(\beta_t)\le 1-\alpha$. Then at every time step $t$: where $\Phi(\cdot)$ is the cumulative distribution function (CDF) of the standard normal distribution $\mathcal{N}(0, 1)$.

Figures (8)

  • Figure 1: Workflow of HdSafeBO. Local optimistic safe optimization is employed to efficiently optimize the objective function while guaranteeing probabilistic safety. We utilize isometric embedding to reduce the problem dimension to deal with high-dimensional inputs.
  • Figure 2: Optimization for the control of a musculoskeletal system. (a) Task illustration. (b) Optimization performance averaged over 50 independent runs. Arrows indicate the better direction.
  • Figure 3: Optimization for the control of neural stimulation induced human motion. (a) Task illustration. IL, RF, TA, BF, ST, GA are different group of target muscle on the lower limb. (b) Optimization performance on the control of neuromuscular model for semitendinosus (ST), and gastrocnemius (GA), averaged over 10 independent runs. Arrows indicate the better direction.
  • Figure 4: Distribution of SI for six muscle groups of different configurations used in SCS simulation experiment
  • Figure 5: 2d electrical map computation.
  • ...and 3 more figures

Theorems & Definitions (10)

  • Proposition 4.1
  • Theorem 4.2
  • Proposition 4.3
  • proof
  • Lemma A.1
  • Lemma A.2
  • proof
  • Lemma A.3
  • proof
  • proof