Table of Contents
Fetching ...

MDM: Advancing Multi-Domain Distribution Matching for Automatic Modulation Recognition Dataset Synthesis

Dongwei Xu, Jiajun Chen, Yao Lu, Tianhao Xia, Qi Xuan, Wei Wang, Yun Lin, Xiaoniu Yang

TL;DR

This work tackles the data scarcity challenge in automatic modulation recognition (AMR) by introducing Multi-domain Distribution Matching (MDM), a dataset distillation method that jointly leverages time-domain and frequency-domain information. By transforming signals with the discrete Fourier transform and applying distribution matching losses in both domains, MDM updates a small synthetic dataset using the combined objective $L=L^{TD}+\alpha L^{FD}$. Experiments on three AMR datasets show that MDM outperforms baseline condensation methods and generalizes well across unseen architectures, highlighting the benefit of incorporating frequency-domain structure for AMR data synthesis. The approach holds practical significance for reducing storage and training costs while maintaining high recognition performance across diverse models and datasets.

Abstract

Recently, deep learning technology has been successfully introduced into Automatic Modulation Recognition (AMR) tasks. However, the success of deep learning is all attributed to the training on large-scale datasets. Such a large amount of data brings huge pressure on storage, transmission and model training. In order to solve the problem of large amount of data, some researchers put forward the method of data distillation, which aims to compress large training data into smaller synthetic datasets to maintain its performance. While numerous data distillation techniques have been developed within the realm of image processing, the unique characteristics of signals set them apart. Signals exhibit distinct features across various domains, necessitating specialized approaches for their analysis and processing. To this end, a novel dataset distillation method--Multi-domain Distribution Matching (MDM) is proposed. MDM employs the Discrete Fourier Transform (DFT) to translate timedomain signals into the frequency domain, and then uses a model to compute distribution matching losses between the synthetic and real datasets, considering both the time and frequency domains. Ultimately, these two losses are integrated to update the synthetic dataset. We conduct extensive experiments on three AMR datasets. Experimental results show that, compared with baseline methods, our method achieves better performance under the same compression ratio. Furthermore, we conduct crossarchitecture generalization experiments on several models, and the experimental results show that our synthetic datasets can generalize well on other unseen models.

MDM: Advancing Multi-Domain Distribution Matching for Automatic Modulation Recognition Dataset Synthesis

TL;DR

This work tackles the data scarcity challenge in automatic modulation recognition (AMR) by introducing Multi-domain Distribution Matching (MDM), a dataset distillation method that jointly leverages time-domain and frequency-domain information. By transforming signals with the discrete Fourier transform and applying distribution matching losses in both domains, MDM updates a small synthetic dataset using the combined objective . Experiments on three AMR datasets show that MDM outperforms baseline condensation methods and generalizes well across unseen architectures, highlighting the benefit of incorporating frequency-domain structure for AMR data synthesis. The approach holds practical significance for reducing storage and training costs while maintaining high recognition performance across diverse models and datasets.

Abstract

Recently, deep learning technology has been successfully introduced into Automatic Modulation Recognition (AMR) tasks. However, the success of deep learning is all attributed to the training on large-scale datasets. Such a large amount of data brings huge pressure on storage, transmission and model training. In order to solve the problem of large amount of data, some researchers put forward the method of data distillation, which aims to compress large training data into smaller synthetic datasets to maintain its performance. While numerous data distillation techniques have been developed within the realm of image processing, the unique characteristics of signals set them apart. Signals exhibit distinct features across various domains, necessitating specialized approaches for their analysis and processing. To this end, a novel dataset distillation method--Multi-domain Distribution Matching (MDM) is proposed. MDM employs the Discrete Fourier Transform (DFT) to translate timedomain signals into the frequency domain, and then uses a model to compute distribution matching losses between the synthetic and real datasets, considering both the time and frequency domains. Ultimately, these two losses are integrated to update the synthetic dataset. We conduct extensive experiments on three AMR datasets. Experimental results show that, compared with baseline methods, our method achieves better performance under the same compression ratio. Furthermore, we conduct crossarchitecture generalization experiments on several models, and the experimental results show that our synthetic datasets can generalize well on other unseen models.
Paper Structure (15 sections, 11 equations, 2 figures, 2 tables, 1 algorithm)

This paper contains 15 sections, 11 equations, 2 figures, 2 tables, 1 algorithm.

Figures (2)

  • Figure 1: The framework of Multi-domain Distribution Matching.
  • Figure 2: Visualization of condensed 50 signals/class using AlexNet on RML2016.10a-high.