Attribute reduction algorithm of rough sets based on spatial optimization
Xuchang Guo, Houbiao Li
TL;DR
The paper addresses the problem that traditional rough set attribute reduction primarily minimizes the number of reduced attributes without considering spatial alignment with the decision attribute, potentially producing many rules and limited generality. It introduces a spatial optimization framework that uses spatial similarity between partitions and a spatially weighted metric, SPS, to guide attribute selection via SigSPS, culminating in a Spatial Optimal Rough Set attribute Reduction (SRS) algorithm. The approach combines cosine-based partition similarity and positive-region coverage to favor reductions that closely reflect the decision partition, and it demonstrates through experiments on 10 UCI datasets that SRS yields higher spatial similarity and often fewer rules than existing methods like MIBARK and AO. This work advances rough-set-based rule learning by producing more concise, generalizable rule sets and offers a scalable alternative to traditional discernibility-matrix–driven reductions.
Abstract
Rough set is one of the important methods for rule acquisition and attribute reduction. The current goal of rough set attribute reduction focuses more on minimizing the number of reduced attributes, but ignores the spatial similarity between reduced and decision attributes, which may lead to problems such as increased number of rules and limited generality. In this paper, a rough set attribute reduction algorithm based on spatial optimization is proposed. By introducing the concept of spatial similarity, to find the reduction with the highest spatial similarity, so that the spatial similarity between reduction and decision attributes is higher, and more concise and widespread rules are obtained. In addition, a comparative experiment with the traditional rough set attribute reduction algorithms is designed to prove the effectiveness of the rough set attribute reduction algorithm based on spatial optimization, which has made significant improvements on many datasets.
