SCI 3.0: A Web-based Schema Curation Interface for Graphical Event Representations
Reece Suchocki, Mary Martin, Martha Palmer, Susan Brown
TL;DR
The paper addresses the challenge of representing complex real-world events as structured, hierarchical schemas for NLP and information extraction. It introduces Schema Curation Interface 3.0 (SCI 3.0), a web-based tool built with React, Flask, and Cytoscape that enables real-time graph-based editing of events, entities, relations, and outlinks, bridging automatic schema induction with human curation. The workflow combines an automatic induction stage using hierarchicalschema2023 and GPT-3 with manual refinement via SCI 3.0, followed by a schema-guided prediction loop using the RESIN pipeline to improve coverage. The work demonstrates significant improvements in schema coverage through iterative curation, discusses potential expansions to AMR/UMR and sub-schema libraries, and highlights practical impacts for information extraction, data mining, and policy-relevant analytics.
Abstract
To understand the complexity of global events, one must navigate a web of interwoven sub-events, identifying those most impactful elements within the larger, abstract macro-event framework at play. This concept can be extended to the field of natural language processing (NLP) through the creation of structured event schemas which can serve as representations of these abstract events. Central to our approach is the Schema Curation Interface 3.0 (SCI 3.0), a web application that facilitates real-time editing of event schema properties within a generated graph e.g., adding, removing, or editing sub-events, entities, and relations directly through an interface.
