Table of Contents
Fetching ...

APAR: Modeling Irregular Target Functions in Tabular Regression via Arithmetic-Aware Pre-Training and Adaptive-Regularized Fine-Tuning

Hong-Wei Wu, Wei-Yao Wang, Kuang-Da Wang, Wen-Chih Peng

TL;DR

APAR tackles irregular target functions in tabular regression by introducing arithmetic-aware pre-training and adaptive-regularized fine-tuning for Transformer-based tabular models. The arithmetic-aware pre-training leverages continuous labels to learn inter-sample relationships, while adaptive regularization learns feature importance and data augmentation guided by feature correlations. Empirical results on 10 real-world datasets show consistent RMSE gains over GBDT and pretrain-finetune NN baselines, with additional insight into which arithmetic operations are most effective. The framework mitigates overfitting to uninformative features and provides a flexible end-to-end pretrain-finetune pipeline for tabular regression, with potential extensions to automatic operation selection and classification tasks.

Abstract

Tabular data are fundamental in common machine learning applications, ranging from finance to genomics and healthcare. This paper focuses on tabular regression tasks, a field where deep learning (DL) methods are not consistently superior to machine learning (ML) models due to the challenges posed by irregular target functions inherent in tabular data, causing sensitive label changes with minor variations from features. To address these issues, we propose a novel Arithmetic-Aware Pre-training and Adaptive-Regularized Fine-tuning framework (APAR), which enables the model to fit irregular target function in tabular data while reducing the negative impact of overfitting. In the pre-training phase, APAR introduces an arithmetic-aware pretext objective to capture intricate sample-wise relationships from the perspective of continuous labels. In the fine-tuning phase, a consistency-based adaptive regularization technique is proposed to self-learn appropriate data augmentation. Extensive experiments across 10 datasets demonstrated that APAR outperforms existing GBDT-, supervised NN-, and pretrain-finetune NN-based methods in RMSE (+9.43% $\sim$ 20.37%), and empirically validated the effects of pre-training tasks, including the study of arithmetic operations. Our code and data are publicly available at https://github.com/johnnyhwu/APAR.

APAR: Modeling Irregular Target Functions in Tabular Regression via Arithmetic-Aware Pre-Training and Adaptive-Regularized Fine-Tuning

TL;DR

APAR tackles irregular target functions in tabular regression by introducing arithmetic-aware pre-training and adaptive-regularized fine-tuning for Transformer-based tabular models. The arithmetic-aware pre-training leverages continuous labels to learn inter-sample relationships, while adaptive regularization learns feature importance and data augmentation guided by feature correlations. Empirical results on 10 real-world datasets show consistent RMSE gains over GBDT and pretrain-finetune NN baselines, with additional insight into which arithmetic operations are most effective. The framework mitigates overfitting to uninformative features and provides a flexible end-to-end pretrain-finetune pipeline for tabular regression, with potential extensions to automatic operation selection and classification tasks.

Abstract

Tabular data are fundamental in common machine learning applications, ranging from finance to genomics and healthcare. This paper focuses on tabular regression tasks, a field where deep learning (DL) methods are not consistently superior to machine learning (ML) models due to the challenges posed by irregular target functions inherent in tabular data, causing sensitive label changes with minor variations from features. To address these issues, we propose a novel Arithmetic-Aware Pre-training and Adaptive-Regularized Fine-tuning framework (APAR), which enables the model to fit irregular target function in tabular data while reducing the negative impact of overfitting. In the pre-training phase, APAR introduces an arithmetic-aware pretext objective to capture intricate sample-wise relationships from the perspective of continuous labels. In the fine-tuning phase, a consistency-based adaptive regularization technique is proposed to self-learn appropriate data augmentation. Extensive experiments across 10 datasets demonstrated that APAR outperforms existing GBDT-, supervised NN-, and pretrain-finetune NN-based methods in RMSE (+9.43% 20.37%), and empirically validated the effects of pre-training tasks, including the study of arithmetic operations. Our code and data are publicly available at https://github.com/johnnyhwu/APAR.

Paper Structure

This paper contains 25 sections, 18 equations, 3 figures, 5 tables, 3 algorithms.

Figures (3)

  • Figure 1: Illustrations of the impacts of irregular target functions commonly found in tabular regression tasks for finance (stock price prediction) and medical (health risk prediction) data. Small changes in features (marked in red) can lead to significant changes in the target variable.
  • Figure 2: Illustration of the Arithmetic-Aware Pre-Train phase of APAR. Sample pairs are processed through the Feature Tokenizer and Feature Encoder, the outputs of which are concatenated for arithmetic prediction, enabling the model to understand inter-sample relationships in tabular regression.
  • Figure 3: Illustration of the Adaptive Regularization Fine-Tuning phase of APAR. In this phase, an input sample is processed through the Feature Tokenizer to generate feature embeddings, which are augmented using a dynamically adaptive gate vector. The model is trained to predict consistent labels from varying inputs, which enhances the model's robustness to uninformative features and performance on the target task.