BERT4FCA: A Method for Bipartite Link Prediction using Formal Concept Analysis and BERT
Siqi Peng, Hongyuan Yang, Akihiro Yamamoto
TL;DR
BERT4FCA addresses bipartite link prediction by marrying formal concept analysis with a pre-trained BERT framework to exploit rich information in concept lattices. The method converts bipartite networks into formal contexts, extracts maximal bi-cliques as formal concepts, and uses neighboring relations in the concept lattice through a dedicated pre-training regime with tasks for masked token prediction and concept-neighbor prediction. Fine-tuning then handles both O-O and O-A prediction tasks, leveraging two specialized models (object and attribute) and eliminating position embeddings to reflect order-free concept structures. Empirical results on three real-world datasets show that BERT4FCA consistently outperforms previous FCA-based methods and classic baselines, with ablations confirming the value of incorporating neighboring relations and concept-lattice information. Overall, BERT4FCA provides a general framework for applying BERT to FCA-derived representations, with potential applicability to broader FCA-related prediction and discovery tasks.
Abstract
We propose BERT4FCA, a novel method for link prediction in bipartite networks, using formal concept analysis (FCA) and BERT. Link prediction in bipartite networks is an important task that can solve various practical problems like friend recommendation in social networks and co-authorship prediction in author-paper networks. Recent research has found that in bipartite networks, maximal bi-cliques provide important information for link prediction, and they can be extracted by FCA. Some FCA-based bipartite link prediction methods have achieved good performance. However, we figured out that their performance could be further improved because these methods did not fully capture the rich information of the extracted maximal bi-cliques. To address this limitation, we propose an approach using BERT, which can learn more information from the maximal bi-cliques extracted by FCA and use them to make link prediction. We conduct experiments on three real-world bipartite networks and demonstrate that our method outperforms previous FCA-based methods, and some classic methods such as matrix-factorization and node2vec.
