Iterating Pointers: Enabling Static Analysis for Loop-based Pointers
Andrea Lepori, Alexandru Calotoiu, Torsten Hoefler
TL;DR
Pointer-based data accesses in C complicate static analysis and automatic parallelization. The authors introduce pointer disaggregation, a transformation that splits a pointer into a data container and an offset (adjunct), making memory accesses explicit and enabling stronger data-dependency analysis. Their source-to-source implementation uses libclang to generate adjuncts and integrate with data-centric compilers like DaCe, unlocking parallelization opportunities in real codes such as PBKDF2 and HPCCG, with substantial speedups in some cases. Across OpenSSL PBKDF2, Mantevo HPCCG, and LZO benchmarks, the approach yields automatic parallelization when possible and notable performance improvements, demonstrating practical impact for pointer-heavy C programs. The work positions pointer-aware transformations as a principled path to leverage modern parallelizing frameworks for codes traditionally resistant to static optimization.
Abstract
Pointers are an integral part of C and other programming languages. They enable substantial flexibility from the programmer's standpoint, allowing the user fine, unmediated control over data access patterns. However, accesses done through pointers are often hard to track, and challenging to understand for optimizers, compilers, and sometimes, even for the developers themselves because of the direct memory access they provide. We alleviate this problem by exposing additional information to analyzers and compilers. By separating the concept of a pointer into a data container and an offset, we can optimize C programs beyond what other state-of-the-art approaches are capable of, in some cases even enabling auto-parallelization. Using this process, we are able to successfully analyze and optimize code from OpenSSL, the Mantevo benchmark suite, and the Lempel-Ziv-Oberhumer compression algorithm. We provide the only automatic approach able to find all parallelization opportunities in the HPCCG benchmark from the Mantevo suite the developers identified and even outperform the reference implementation by up to 18%, as well as speed up the PBKDF2 algorithm implementation from OpenSSL by up to 11x.
