Reinforcement Feature Transformation for Polymer Property Performance Prediction

Xuanming Hu; Dongjie Wang; Wangyang Ying; Yanjie Fu

Reinforcement Feature Transformation for Polymer Property Performance Prediction

Xuanming Hu, Dongjie Wang, Wangyang Ying, Yanjie Fu

TL;DR

This work tackles polymer property performance prediction under data quality constraints by introducing GRFG, a framework that automatically reconstructs an explainable descriptor space through group-wise reinforcement learning and descriptor crossing. By partitioning descriptors into groups, employing three cascading agents, and utilizing two generation strategies, GRFG learns policies that maximize predictive performance while preserving interpretability. Empirical results on a 20-descriptor polymer dataset show GRFG outperforms multiple baselines and remains robust across different downstream models, with case studies illustrating the traceable, semantically meaningful descriptors produced. The approach offers a scalable, explainable pathway to improve polymer property predictions and can be extended to other material science tasks.

Abstract

Polymer property performance prediction aims to forecast specific features or attributes of polymers, which has become an efficient approach to measuring their performance. However, existing machine learning models face challenges in effectively learning polymer representations due to low-quality polymer datasets, which consequently impact their overall performance. This study focuses on improving polymer property performance prediction tasks by reconstructing an optimal and explainable descriptor representation space. Nevertheless, prior research such as feature engineering and representation learning can only partially solve this task since they are either labor-incentive or unexplainable. This raises two issues: 1) automatic transformation and 2) explainable enhancement. To tackle these issues, we propose our unique Traceable Group-wise Reinforcement Generation Perspective. Specifically, we redefine the reconstruction of the representation space into an interactive process, combining nested generation and selection. Generation creates meaningful descriptors, and selection eliminates redundancies to control descriptor sizes. Our approach employs cascading reinforcement learning with three Markov Decision Processes, automating descriptor and operation selection, and descriptor crossing. We utilize a group-wise generation strategy to explore and enhance reward signals for cascading agents. Ultimately, we conduct experiments to indicate the effectiveness of our proposed framework.

Reinforcement Feature Transformation for Polymer Property Performance Prediction

TL;DR

Abstract

Paper Structure (19 sections, 6 equations, 6 figures, 2 tables)

This paper contains 19 sections, 6 equations, 6 figures, 2 tables.

Introduction
Related Works
Preliminaries
Important Definitions
Problem Statement
Methodology
Framework Overview
Descriptor Clustering for Generation
Selection of Descriptors and Operation
Group-wise Descriptor Generation
Experiments
Experimental Setup
Experimental Result
Overall Comparison
Ablation Study
...and 4 more sections

Figures (6)

Figure 1: An illustration of automated transformation. We identify the optimal descriptor space that is explicit and interpretable and performs optimally in polymer property performance prediction by iteratively reconstructing the descriptor space.
Figure 2: An overview of our proposed framework.
Figure 3: The cascading agents include the descriptor group agent 1, the operation agent, and the descriptor group agent 2. They choose two candidate descriptor groups and a operation.
Figure 4: Ablation study for GRFG. In this experiments, we validate the necessity of different components in GRFG.
Figure 5: Comparison of the most important features in the original descriptor space, and the GRFG generated space.
...and 1 more figures

Reinforcement Feature Transformation for Polymer Property Performance Prediction

TL;DR

Abstract

Reinforcement Feature Transformation for Polymer Property Performance Prediction

Authors

TL;DR

Abstract

Table of Contents

Figures (6)