Representation and De-interleaving of Mixtures of Hidden Markov Processes
Jiadi Bao, Mengtao Zhu, Yunjie Li, Shafei Wang
TL;DR
This work introduces Interleaved Hidden Markov Processes (IHMP) to robustly represent mixtures of hidden Markov sequences observed in interleaved fashion. It develops an exact EM framework and two scalable variational inference methods (MFVI and SVI) to perform posterior inference and parameter learning under noisy and incomplete data, while also deriving a likelihood-ratio based error bound to gauge optimality. The approach reduces the alphabet-partition search space compared to prior IMP models and demonstrates strong de-interleaving performance on synthetic data and real-world tasks such as radar pulse sequence analysis and human motion separation. The results show that SVI often delivers the best accuracy-efficiency trade-off, with strong robustness to missing observations and non-ideal conditions, highlighting practical impact for time-series demixing in complex sensing systems.
Abstract
De-interleaving of the mixtures of Hidden Markov Processes (HMPs) generally depends on its representation model. Existing representation models consider Markov chain mixtures rather than hidden Markov, resulting in the lack of robustness to non-ideal situations such as observation noise or missing observations. Besides, de-interleaving methods utilize a search-based strategy, which is time-consuming. To address these issues, this paper proposes a novel representation model and corresponding de-interleaving methods for the mixtures of HMPs. At first, a generative model for representing the mixtures of HMPs is designed. Subsequently, the de-interleaving process is formulated as a posterior inference for the generative model. Secondly, an exact inference method is developed to maximize the likelihood of the complete data, and two approximate inference methods are developed to maximize the evidence lower bound by creating tractable structures. Then, a theoretical error probability lower bound is derived using the likelihood ratio test, and the algorithms are shown to get reasonably close to the bound. Finally, simulation results demonstrate that the proposed methods are highly effective and robust for non-ideal situations, outperforming baseline methods on simulated and real-life data.
