Learned Feature Importance Scores for Automated Feature Engineering
Yihe Dong, Sercan Arik, Nathanael Yoder, Tomas Pfister
TL;DR
AutoMAN addresses automated feature engineering by learning input-feature masks over a small, expert-curated transform space to optimize downstream tasks end-to-end. It avoids explicit enumeration of all transform-feature combinations by using local and global importance masks to implicitly discover useful engineered features, and it extends to time-series with learnable temporal masks. The method achieves state-of-the-art downstream performance with significantly lower latency compared with baselines, demonstrating scalability to large feature spaces and applicability across tabular and time-series data. The approach is grounded in theoretical motivation that a small basis of transforms can approximate broader function spaces, and it offers practical benefits for model deployment and maintenance.
Abstract
Feature engineering has demonstrated substantial utility for many machine learning workflows, such as in the small data regime or when distribution shifts are severe. Thus automating this capability can relieve much manual effort and improve model performance. Towards this, we propose AutoMAN, or Automated Mask-based Feature Engineering, an automated feature engineering framework that achieves high accuracy, low latency, and can be extended to heterogeneous and time-varying data. AutoMAN is based on effectively exploring the candidate transforms space, without explicitly manifesting transformed features. This is achieved by learning feature importance masks, which can be extended to support other modalities such as time series. AutoMAN learns feature transform importance end-to-end, incorporating a dataset's task target directly into feature engineering, resulting in state-of-the-art performance with significantly lower latency compared to alternatives.
