To be Continuous, or to be Discrete, Those are Bits of Questions
Yiran Wang, Masao Utiyama
TL;DR
This work introduces binary, bit-level outputs for structured prediction by extending CKY to handle binary labels and by formulating a span-marginal similarity that combines label and structural information. It unifies parsing and hashing under a single structured contrastive learning objective, deploying a max-based instance selection loss to overcome the geometric center issue. Empirical results on constituency parsing and nested NER show competitive performance using only a small number of bits (around 12 for parsing and 8 for NER), underscoring memory and efficiency gains and revealing implicit label clustering within codes. The approach offers a versatile pathway for bridging continuous deep learning representations with the discrete nature of natural language, with potential impact on scalable, interpretable NLP models.
Abstract
Recently, binary representation has been proposed as a novel representation that lies between continuous and discrete representations. It exhibits considerable information-preserving capability when being used to replace continuous input vectors. In this paper, we investigate the feasibility of further introducing it to the output side, aiming to allow models to output binary labels instead. To preserve the structural information on the output side along with label information, we extend the previous contrastive hashing method as structured contrastive hashing. More specifically, we upgrade CKY from label-level to bit-level, define a new similarity function with span marginal probabilities, and introduce a novel contrastive loss function with a carefully designed instance selection strategy. Our model achieves competitive performance on various structured prediction tasks, and demonstrates that binary representation can be considered a novel representation that further bridges the gap between the continuous nature of deep learning and the discrete intrinsic property of natural languages.
