FAMOSE: A ReAct Approach to Automated Feature Discovery
Keith Burghardt, Jienan Liu, Sadman Sakib, Yuning Hao, Bo Li
TL;DR
FAMOSE tackles the bottleneck of feature engineering for tabular data by introducing a ReAct-driven autonomous feature discovery framework. It iteratively proposes, tests, and refines candidate features with data-driven evaluation and concludes with a compact, non-redundant feature set via $mRMR$. On a diverse banner of 20 classification and 7 regression tasks, FAMOSE achieves near-state-of-the-art ROC-AUC on large-classification datasets (average gain $0.23\%$) and state-of-the-art RMSE reductions of $2.0\%$, with robustness across multiple LLMs and predictive models. The work demonstrates that AI agents can perform inventive, interpretable feature engineering in an automated loop, potentially accelerating end-to-end ML pipelines while reducing reliance on deep domain expertise.
Abstract
Feature engineering remains a critical yet challenging bottleneck in machine learning, particularly for tabular data, as identifying optimal features from an exponentially large feature space traditionally demands substantial domain expertise. To address this challenge, we introduce FAMOSE (Feature AugMentation and Optimal Selection agEnt), a novel framework that leverages the ReAct paradigm to autonomously explore, generate, and refine features while integrating feature selection and evaluation tools within an agent architecture. To our knowledge, FAMOSE represents the first application of an agentic ReAct framework to automated feature engineering, especially for both regression and classification tasks. Extensive experiments demonstrate that FAMOSE is at or near the state-of-the-art on classification tasks (especially tasks with more than 10K instances, where ROC-AUC increases 0.23% on average), and achieves the state-of-the-art for regression tasks by reducing RMSE by 2.0% on average, while remaining more robust to errors than other algorithms. We hypothesize that FAMOSE's strong performance is because ReAct allows the LLM context window to record (via iterative feature discovery and evaluation steps) what features did or did not work. This is similar to a few-shot prompt and guides the LLM to invent better, more innovative features. Our work offers evidence that AI agents are remarkably effective in solving problems that require highly inventive solutions, such as feature engineering.
