KamerRaad: Enhancing Information Retrieval in Belgian National Politics through Hierarchical Summarization and Conversational Interfaces
Alexander Rogiers, Maarten Buyl, Bo Kang, Tijl De Bie
TL;DR
KamerRaad addresses the challenge of making extensive and heterogeneous parliamentary records accessible to citizens by using hierarchical summarization within a Retrieval-Augmented Generation framework to condense long documents into context-friendly chunks while preserving source provenance. It introduces metadata tagging and a two-level summarization approach (comprehensive and one-line) to optimize LLM prompts and enable source-grounded, conversational dialogue via a Streamlit UI and open-source back-end models. The system ranks relevant chunks with cosine similarity and maintains direct links to source documents, facilitating traceability for policymakers and the public alike. By combining hierarchical summarization with source-driven dialogue, KamerRaad enhances accessibility, transparency, and trust in political information, potentially improving democratic participation and informed decision-making.
Abstract
KamerRaad is an AI tool that leverages large language models to help citizens interactively engage with Belgian political information. The tool extracts and concisely summarizes key excerpts from parliamentary proceedings, followed by the potential for interaction based on generative AI that allows users to steadily build up their understanding. KamerRaad's front-end, built with Streamlit, facilitates easy interaction, while the back-end employs open-source models for text embedding and generation to ensure accurate and relevant responses. By collecting feedback, we intend to enhance the relevancy of our source retrieval and the quality of our summarization, thereby enriching the user experience with a focus on source-driven dialogue.
