LAET: A Layer-wise Adaptive Ensemble Tuning Framework for Pretrained Language Models

Jawad Ibn Ahad; Muhammad Rafsan Kabir; Robin Krambroeckers; Sifat Momen; Nabeel Mohammed; Shafin Rahman

LAET: A Layer-wise Adaptive Ensemble Tuning Framework for Pretrained Language Models

Jawad Ibn Ahad, Muhammad Rafsan Kabir, Robin Krambroeckers, Sifat Momen, Nabeel Mohammed, Shafin Rahman

TL;DR

LAET tackles the heavy compute barrier of domain-specific LLMs in finance by identifying and fine-tuning only the most impactful layers through per-layer probing, then aggregating predictions via voting. The approach reduces training cost while delivering competitive or superior results across 23 finance-focused datasets spanning textual analysis, forecasting, and risk management, against strong baselines including GPT-4. The findings show that small, carefully tuned LLMs can rival larger models when guided by layer-wise relevance and ensemble decision-making, with substantial layer-reduction (up to 60%) without sacrificing accuracy. The work provides practical insights into layer usefulness, representation choice (last token), and a scalable pipeline for efficient financial NLP deployment across domains.

Abstract

Natural Language Processing (NLP) has transformed the financial industry, enabling advancements in areas such as textual analysis, risk management, and forecasting. Large language models (LLMs) like BloombergGPT and FinMA have set new benchmarks across various financial NLP tasks, including sentiment analysis, stock movement prediction, and credit risk assessment. Furthermore, FinMA-ES, a bilingual financial LLM, has also demonstrated strong performance using the FLARE and FLARE-ES benchmarks. However, the high computational demands of these models limit the accessibility of many organizations. To address this, we propose Layer-wise Adaptive Ensemble Tuning (LAET), a novel strategy that selectively fine-tunes the most effective layers of pre-trained LLMs by analyzing hidden state representations while freezing less critical layers. LAET significantly reduces computational overhead while enhancing task-specific performance. Our approach shows strong results in financial NLP tasks, outperforming existing benchmarks and state-of-the-art LLMs such as GPT-4, even with smaller LLMs ($\sim$3B parameters). This work bridges cutting-edge financial NLP research and real-world deployment with efficient and scalable models for financial applications.

LAET: A Layer-wise Adaptive Ensemble Tuning Framework for Pretrained Language Models

TL;DR

Abstract

LAET: A Layer-wise Adaptive Ensemble Tuning Framework for Pretrained Language Models

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (4)