Negative to Positive Co-learning with Aggressive Modality Dropout
Nicholas Magal, Minh Tran, Riku Arakawa, Suzanne Nie
TL;DR
The paper tackles the problem of negative co-learning (NCL) in multimodal models when modalities are unavailable at test time. It introduces aggressive modality dropout during training to force reliance on multiple modalities and to prepare for unimodal deployment, enabling reversal of NCL to positive co-learning (PCL). Evaluations with bi-EFLSTM and Memory Fusion Network on IEMOCAP and MOSI show that high dropout (around 0.8) on non-language modalities can dramatically boost unimodal performance under NCL, with more modest gains in PCL. The results suggest that modality dropout enhances robustness for deployment scenarios with missing modalities and offers guidance for future work on dropout levels and modality selection.
Abstract
This paper aims to document an effective way to improve multimodal co-learning by using aggressive modality dropout. We find that by using aggressive modality dropout we are able to reverse negative co-learning (NCL) to positive co-learning (PCL). Aggressive modality dropout can be used to "prep" a multimodal model for unimodal deployment, and dramatically increases model performance during negative co-learning, where during some experiments we saw a 20% gain in accuracy. We also benchmark our modality dropout technique against PCL to show that our modality drop out technique improves co-learning during PCL, although it does not have as much as an substantial effect as it does during NCL. Github: https://github.com/nmagal/modality_drop_for_colearning
