Analyzing Regional Impacts of Climate Change using Natural Language Processing Techniques
Tanwi Mallick, John Murphy, Joshua David Bergerson, Duane R. Verner, John K Hutchison, Leslie-Anne Levy
TL;DR
The paper addresses the challenge of extracting region-specific climate impacts from a rapidly growing climate literature corpus. It presents a geo-centric NLP pipeline that uses BERT-based NER for location extraction, LocationTagger for fine-grained geography, BERTopic for regional topic modeling, and LLMs for concise topic summarization, applied to a climate-focused subset of the S2ORC containing over 600,000 documents. Key contributions include an end-to-end NLP workflow, a large geo-tagged corpus, identification of country and state-level research activity, and an interactive CIACC tool that makes geo-specific insights accessible. The work enables data-driven, location-tailored adaptation and mitigation planning for policymakers, engineers, and environmentalists.
Abstract
Understanding the multifaceted effects of climate change across diverse geographic locations is crucial for timely adaptation and the development of effective mitigation strategies. As the volume of scientific literature on this topic continues to grow exponentially, manually reviewing these documents has become an immensely challenging task. Utilizing Natural Language Processing (NLP) techniques to analyze this wealth of information presents an efficient and scalable solution. By gathering extensive amounts of peer-reviewed articles and studies, we can extract and process critical information about the effects of climate change in specific regions. We employ BERT (Bidirectional Encoder Representations from Transformers) for Named Entity Recognition (NER), which enables us to efficiently identify specific geographies within the climate literature. This, in turn, facilitates location-specific analyses. We conduct region-specific climate trend analyses to pinpoint the predominant themes or concerns related to climate change within a particular area, trace the temporal progression of these identified issues, and evaluate their frequency, severity, and potential development over time. These in-depth examinations of location-specific climate data enable the creation of more customized policy-making, adaptation, and mitigation strategies, addressing each region's unique challenges and providing more effective solutions rooted in data-driven insights. This approach, founded on a thorough exploration of scientific texts, offers actionable insights to a wide range of stakeholders, from policymakers to engineers to environmentalists. By proactively understanding these impacts, societies are better positioned to prepare, allocate resources wisely, and design tailored strategies to cope with future climate conditions, ensuring a more resilient future for all.
