MemPal: Leveraging Multimodal AI and LLMs for Voice-Activated Object Retrieval in Homes of Older Adults

Natasha Maniar; Samantha W. T. Chan; Wazeer Zulfikar; Scott Ren; Christine Xu; Pattie Maes

MemPal: Leveraging Multimodal AI and LLMs for Voice-Activated Object Retrieval in Homes of Older Adults

Natasha Maniar, Samantha W. T. Chan, Wazeer Zulfikar, Scott Ren, Christine Xu, Pattie Maes

TL;DR

MemPal addresses the memory challenges of older adults by combining a wearable egocentric camera, a voice-based LLM interface, and a vision-language system to support retrospective object retrieval via natural conversation. The approach yields improved object-finding performance over baseline and comparable results to visual aids, while also enabling an activity diary for context-based queries and potential safety reminders. User feedback indicates overall usefulness and acceptable usability, though comfort, accuracy, and onboarding require refinement for broader adoption. The work demonstrates the feasibility of a multimodal memory assistant that preserves privacy by storing textual rather than image data and points to future directions for personalized, proactive, and privacy-preserving memory support in home environments.

Abstract

Older adults have increasing difficulty with retrospective memory, hindering their abilities to perform daily activities and posing stress on caregivers to ensure their wellbeing. Recent developments in Artificial Intelligence (AI) and large context-aware multimodal models offer an opportunity to create memory support systems that assist older adults with common issues like object finding. This paper discusses the development of an AI-based, wearable memory assistant, MemPal, that helps older adults with a common problem, finding lost objects at home, and presents results from tests of the system in older adults' own homes. Using visual context from a wearable camera, the multimodal LLM system creates a real-time automated text diary of the person's activities for memory support purposes, offering object retrieval assistance using a voice-based interface. The system is designed to support additional use cases like context-based proactive safety reminders and recall of past actions. We report on a quantitative and qualitative study with N=15 older adults within their own homes that showed improved performance of object finding with audio-based assistance compared to no aid and positive overall user perceptions on the designed system. We discuss further applications of MemPal's design as a multi-purpose memory aid and future design guidelines to adapt memory assistants to older adults' unique needs.

MemPal: Leveraging Multimodal AI and LLMs for Voice-Activated Object Retrieval in Homes of Older Adults

TL;DR

Abstract

MemPal: Leveraging Multimodal AI and LLMs for Voice-Activated Object Retrieval in Homes of Older Adults

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (16)