Non-Random Data Encodes its Geometric and Topological Dimensions
Hector Zenil, Felipe S. Abrahão, Luan C. S. M. Ozelim
TL;DR
The paper tackles the problem of decoding non-random signals embedded in unknown multidimensional spaces without any prior knowledge of the encoding scheme, by introducing Algorithmic Information Dynamics (AID) and a perturbation-based reconstruction approach. It combines perturbation analysis with the Block Decomposition Method (BDM) to estimate algorithmic complexity and identify the original multidimensional space $\mathcal{S}$ that best explains a received message, achieving a zero-knowledge one-way communication framework. Empirically, it demonstrates that downward spikes in complexity landscapes reliably indicate correct dimensions across text, image, and audio-like data, and it provides a theoretical foundation connecting algorithmic information theory to a generalized noisy-channel setting. The work offers a universal methodology for decoding without priors, with potential impact on signal processing, cryptography, and detection of technosignatures, and it ties together geometry, topology, and semantics through compression-based information measures.
Abstract
Based on the principles of information theory, measure theory, and theoretical computer science, we introduce a signal deconvolution method with a wide range of applications to coding theory, particularly in zero-knowledge one-way communication channels, such as in deciphering messages (i.e., objects embedded into multidimensional spaces) from unknown generating sources about which no prior knowledge is available and to which no return message can be sent. Our multidimensional space reconstruction method from an arbitrary received signal is proven to be agnostic vis-à-vis the encoding-decoding scheme, computation model, programming language, formal theory, the computable (or semi-computable) method of approximation to algorithmic complexity, and any arbitrarily chosen (computable) probability measure. The method derives from the principles of an approach to Artificial General Intelligence (AGI) capable of building a general-purpose model of models independent of any arbitrarily assumed prior probability distribution. We argue that this optimal and universal method of decoding non-random data has applications to signal processing, causal deconvolution, topological and geometric properties encoding, cryptography, and bio- and technosignature detection.
