Stick-Breaking Mixture Normalizing Flows with Component-Wise Tail Adaptation for Variational Inference
Seungsu Han, Juyoung Hwang, Won Chang
TL;DR
The paper tackles posterior inference for highly multimodal and heavy-tailed distributions by replacing the standard Gaussian base in normalizing-flow variational inference with a stick-breaking mixture base, thereby reducing mode-seeking bias. It introduces a Monte Carlo tail-index estimator to guide per-component tail adaptivity and develops component-wise Tail Transform Flows to calibrate tails while preserving exact density evaluation. Empirical results on synthetic targets and real wind-speed data show that StiCTAF achieves near-MCMC accuracy in both bulk structure and tails, with superior forward KL and tail calibration compared to baselines. This approach enables more faithful and efficient posterior inference in complex Bayesian models where multimodality and heavy tails are prominent.
Abstract
Normalizing flows with a Gaussian base provide a computationally efficient way to approximate posterior distributions in Bayesian inference, but they often struggle to capture complex posteriors with multimodality and heavy tails. We propose a stick-breaking mixture base with component-wise tail adaptation (StiCTAF) for posterior approximation. The method first learns a flexible mixture base to mitigate the mode-seeking bias of reverse KL divergence through a weighted average of component-wise ELBOs. It then estimates local tail indices of unnormalized densities and finally refines each mixture component using a shared backbone combined with component-specific tail transforms calibrated by the estimated indices. This design enables accurate mode coverage and anisotropic tail modeling while retaining exact density evaluation and stable optimization. Experiments on synthetic posteriors demonstrate improved tail recovery and better coverage of multiple modes compared to benchmark models. We also present a real-data analysis illustrating the practical benefits of our approach for posterior inference.
