Bootstrapping OTS-Funcimg Pre-training Model (Botfip) -- A Comprehensive Symbolic Regression Framework
Tianhao Chen, Pengbo Xu, Haibiao Zheng
TL;DR
Botfip tackles the SR bottleneck by introducing a multimodal framework that aligns function-image representations with operation-tree sequences using a BLIP-inspired encoder-decoder. It combines a ViT-based Funcimg encoder with a BERT-like OTS encoder/decoder, shared weights, and contrastive/matching pretraining plus an L-BFGS refinement step for constants, achieving strong in-domain SR performance on simpler expressions. The approach demonstrates robustness and efficiency benefits for small-to-moderate SR problems, while highlighting extrapolation limitations and the need for more data and error-correction strategies to scale to complex expressions and broader AI-for-Math tasks. Overall, Botfip presents a promising direction for integrating multimodal signals into symbolic regression, potentially enabling larger models and broader scientific computing applications with improved generalization and speed.
Abstract
In the field of scientific computing, many problem-solving approaches tend to focus only on the process and final outcome, even in AI for science, there is a lack of deep multimodal information mining behind the data, missing a multimodal framework akin to that in the image-text domain. In this paper, we take Symbolic Regression(SR) as our focal point and, drawing inspiration from the BLIP model in the image-text domain, propose a scientific computing multimodal framework based on Function Images (Funcimg) and Operation Tree Sequence (OTS), named Bootstrapping OTS-Funcimg Pre-training Model (Botfip). In SR experiments, we validate the advantages of Botfip in low-complexity SR problems, showcasing its potential. As a MED framework, Botfip holds promise for future applications in a broader range of scientific computing problems.
