The Master Key Filters Hypothesis: Deep Filters Are General
Zahra Babaiee, Peyman M. Kiasari, Daniela Rus, Radu Grosu
TL;DR
The paper questions the assumption that deeper CNN layers learn increasingly task-specific filters, focusing on depthwise separable CNNs (DS-CNNs). It introduces the Master Key Filters Hypothesis, asserting that depthwise filters converge to general spatial primitives that transfer across datasets, domains, and architectures. Through extensive cross-domain, cross-architecture, and semantically split ImageNet experiments, the study shows deep depthwise filters retain generality, while pointwise filters exhibit optimization-related challenges. These findings have practical implications for transfer learning and model design, suggesting larger, more diverse training data can yield universally reusable spatial features.
Abstract
This paper challenges the prevailing view that convolutional neural network (CNN) filters become increasingly specialized in deeper layers. Motivated by recent observations of clusterable repeating patterns in depthwise separable CNNs (DS-CNNs) trained on ImageNet, we extend this investigation across various domains and datasets. Our analysis of DS-CNNs reveals that deep filters maintain generality, contradicting the expected transition to class-specific filters. We demonstrate the generalizability of these filters through transfer learning experiments, showing that frozen filters from models trained on different datasets perform well and can be further improved when sourced from larger datasets. Our findings indicate that spatial features learned by depthwise separable convolutions remain generic across all layers, domains, and architectures. This research provides new insights into the nature of generalization in neural networks, particularly in DS-CNNs, and has significant implications for transfer learning and model design.
