H.265/HEVC Video Steganalysis Based on CU Block Structure Gradients and IPM Mapping
Xiang Zhang, Haiyang Xia, Ziwen He, Wenbin Huang, Fei Peng, Zhangjie Fu
TL;DR
The paper tackles the challenge of detecting CU-block level steganography in H.265/HEVC videos by shifting focus from pixel-domain statistics to coding-structure perturbations. It proposes a joint representation built from CU block-structure gradients and intra prediction mode (IPM) mappings, fused and processed by GradIPMFormer, a Transformer-based detector. The approach achieves superior detection across multiple CU-based steganographic methods, resolutions, and quality settings, and demonstrates robustness to mixed conditions and cross-domain scenarios. This work provides a new CU-level analysis paradigm with practical implications for video security and covert-communication detection.
Abstract
Existing H.265/HEVC video steganalysis research mainly focuses on statistical feature modeling at the levels of motion vectors (MV), intra prediction modes (IPM), or transform coefficients. In contrast, studies targeting the coding-structure level - especially the analysis of block-level steganographic behaviors in Coding Units (CUs) - remain at an early stage. As a core component of H.265/HEVC coding decisions, the CU partition structure often exhibits steganographic perturbations in the form of structural changes and reorganization of prediction relationships, which are difficult to characterize effectively using traditional pixel-domain features or mode statistics. To address this issue, this paper, for the first time from the perspective of CU block-level steganalysis, proposes an H.265/HEVC video steganalysis method based on CU block-structure gradients and intra prediction mode mapping. The proposed method constructs a CU block-structure gradient map to explicitly describe changes in coding-unit partitioning, and combines it with a block-level mapping representation of IPM to jointly model the structural perturbations introduced by CU-level steganographic embedding. On this basis, we design a Transformer network, GradIPMFormer, tailored for CU-block steganalysis, thereby effectively enhancing the capability to perceive CU-level steganographic behaviors. Experimental results show that under different quantization parameters and resolution settings, the proposed method consistently achieves superior detection performance across multiple H.265/HEVC steganographic algorithms, validating the feasibility and effectiveness of conducting video steganalysis from the coding-structure perspective. This study provides a new CU block-level analysis paradigm for H.265/HEVC video steganalysis and has significant research value for covert communication security detection.
