DUA-D2C: Dynamic Uncertainty Aware Method for Overfitting Remediation in Deep Learning

Md. Saiful Bari Siddiqui; Md Mohaiminul Islam; Md. Golam Rabiul Alam

DUA-D2C: Dynamic Uncertainty Aware Method for Overfitting Remediation in Deep Learning

Md. Saiful Bari Siddiqui, Md Mohaiminul Islam, Md. Golam Rabiul Alam

TL;DR

DUA-D2C tackles deep-learning overfitting by partitioning data into $N$ shards, training edge models, and dynamically weighting their contributions on a shared validation set using both accuracy $a_i$ and uncertainty $u_i$. The central model is updated as a weighted sum $\theta_c \leftarrow \sum_i \alpha_i \theta'_i$, with $\alpha_i$ derived from $s_i = \lambda a_i + (1-\lambda) u_i$, enabling variance reduction beyond uniform averaging. The framework is validated across image, audio, and text tasks, showing improved generalization, delayed overfitting, and smoother decision boundaries, while also allowing augmentation to further bolster performance. While acknowledging increased training time, the approach demonstrates robust gains and compatibility with existing regularization techniques, making it a practical tool for generalization in modern deep learning, especially in large-scale settings.

Abstract

Overfitting remains a significant challenge in deep learning, often arising from data outliers, noise, and limited training data. To address this, the Divide2Conquer (D2C) method was previously proposed, which partitions training data into multiple subsets and trains identical models independently on each. This strategy enables learning more consistent patterns while minimizing the influence of individual outliers and noise. However, D2C's standard aggregation typically treats all subset models equally or based on fixed heuristics (like data size), potentially underutilizing information about their varying generalization capabilities. Building upon this foundation, we introduce Dynamic Uncertainty-Aware Divide2Conquer (DUA-D2C), an advanced technique that refines the aggregation process. DUA-D2C dynamically weights the contributions of subset models based on their performance on a shared validation set, considering both accuracy and prediction uncertainty. This intelligent aggregation allows the central model to preferentially learn from subsets yielding more generalizable and confident edge models, thereby more effectively combating overfitting. Empirical evaluations on benchmark datasets spanning multiple domains demonstrate that DUA-D2C significantly improves generalization. Our analysis includes evaluations of decision boundaries, loss curves, and other performance metrics, highlighting the effectiveness of DUA-D2C. This study demonstrates that DUA-D2C improves generalization performance even when applied on top of other regularization methods, establishing it as a theoretically grounded and effective approach to combating overfitting in modern deep learning. Our codes are publicly available at: https://github.com/Saiful185/DUA-D2C.

DUA-D2C: Dynamic Uncertainty Aware Method for Overfitting Remediation in Deep Learning

TL;DR

Abstract

DUA-D2C: Dynamic Uncertainty Aware Method for Overfitting Remediation in Deep Learning

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (15)