From Text to Returns: Using Large Language Models for Mutual Fund Portfolio Optimization and Risk-Adjusted Allocation

Abrar Hossain; Mufakir Qamar Ansari; Haziq Jeelani; Monia Digra; Fayeq Jeelani Syed

From Text to Returns: Using Large Language Models for Mutual Fund Portfolio Optimization and Risk-Adjusted Allocation

Abrar Hossain, Mufakir Qamar Ansari, Haziq Jeelani, Monia Digra, Fayeq Jeelani Syed

TL;DR

The paper investigates using Retrieval-Augmented Generation with open-source LLMs to optimize mutual fund sector allocations, aiming to improve returns and risk management. It compares Microsoft Phi-2, Mistral 7B, and Zypher 7B within a synthetic, RAG-enabled framework that converts data to embeddings, retrieves contextual signals, and applies a blended optimization step. Results show Zypher 7B delivering the strongest returns and risk-adjusted performance across Funds A and C, with Mistral 7B also performing well, while Phi-2 underperforms notably. The study demonstrates the viability of GenAI-driven, data-driven portfolio optimization while acknowledging limitations (synthetic data, interpretability, and computational demands) and outlining future work on real-time data integration and enhanced transparency for AI-assisted finance.

Abstract

Generative AI (GenAI) has enormous potential for improving two critical areas in investing, namely portfolio optimization (choosing the best combination of assets) and risk management (protecting those investments). Our study works at this intersection, using Large Language Models (LLMs) to upgrade how financial decisions are traditionally made. This research specifically tested how well advanced LLMs like Microsoft Phi 2, Mistral 7B, and Zypher 7B can create practical, risk-aware strategies for investing mutual funds in different sectors of the economy. Our method is sophisticated: it combines a Retrieval-Augmented Generation (RAG) pipeline, which enables the LLM to check external, real-time data with standard financial optimization methods. The model's advice is context-aware because we feed it large economic signals, like changes in the global economy. The Zypher 7B model was the clear winner. It consistently produced strategies that maximized investment returns while delivering better risk-adjusted results than the other models. Its ability to process complex relationships and contextual information makes it a highly powerful tool for financial allocation. In conclusion, our findings show that GenAI substantially improves performance over basic allocation methods. By connecting GenAI to real-world financial applications, this work lays the groundwork for creating smarter, more efficient, and more adaptable solutions for asset management professionals.

From Text to Returns: Using Large Language Models for Mutual Fund Portfolio Optimization and Risk-Adjusted Allocation

TL;DR

Abstract

From Text to Returns: Using Large Language Models for Mutual Fund Portfolio Optimization and Risk-Adjusted Allocation

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (4)