Semiotics Networks Representing Perceptual Inference
David Kupeev, Eyal Nitzany
TL;DR
This work introduces Conscious Neural Networks (CONNs), a framework that models object perception and cross-person communication through observed-to-seen cycles in internal representations, enabling interpretable visualization of perceptual processes. It formalizes attractor-based dynamics and bipartite orbit concepts to capture how individuals converge on stable percepts and how dialogue between agents can settle into periodic patterns. The authors build vanilla and stochastic CONN classifiers by embedding perceptual layers based on autoencoders before a baseline classifier, achieving improved performance with limited data while preserving interpretability. The approach generalizes beyond human agents to any system with latent-to-raw representation processing, offering a pathway toward interpretable perceptual inference and robust multi-modal communication in AI systems.
Abstract
Every day, humans perceive objects and communicate these perceptions through various channels. In this paper, we present a computational model designed to track and simulate the perception of objects, as well as their representations as conveyed in communication. We delineate two fundamental components of our internal representation, termed "observed" and "seen", which we correlate with established concepts in computer vision, namely encoding and decoding. These components are integrated into semiotic networks, which simulate perceptual inference of object perception and human communication. Our model of object perception by a person allows us to define object perception by {\em a network}. We demonstrate this with an example of an image baseline classifier by constructing a new network that includes the baseline classifier and an additional layer. This layer produces the images "perceived" by the entire network, transforming it into a perceptualized image classifier. This facilitates visualization of the acquired network. Within our network, the image representations become more efficient for classification tasks when they are assembled and randomized. In our experiments, the perceptualized network outperformed the baseline classifier on MNIST training databases consisting of a restricted number of images. Our model is not limited to persons and can be applied to any system featuring a loop involving the processing from "internal" to "external" representations.
