SkillGen: Learning Domain Skills for In-Context Sequential Decision Making
Ruomeng Ding, Wei Cheng, Minglai Shao, Chen Zhao
TL;DR
SkillGen advances in-context learning for sequential decision making by learning domain-oriented, action-centric skills and using them to generate focused, step-aware prompts. It builds a domain knowledge graph from trajectories, assigns action utility with TD($\lambda$) credit, and extracts reusable skills centered on actions. A golden segment plus retrieved step-wise skills guide a frozen LLM to act without parameter updates, with theoretical results showing improved task identifiability from focused prompts. Empirically, SkillGen yields consistent gains across ALFWorld, BabyAI, and ScienceWorld on multiple models, and ablations confirm the contribution of golden segments, step-wise skills, and credit assignment. The approach also demonstrates strong token efficiency and adaptability to varying sampling and retrieval configurations, highlighting practical benefits for scalable, knowledge-augmented prompting in real-world robotic and reasoning tasks.
Abstract
Large language models (LLMs) are increasingly applied to sequential decision-making through in-context learning (ICL), yet their effectiveness is highly sensitive to prompt quality. Effective prompts should meet three principles: focus on decision-critical information, provide step-level granularity, and minimize reliance on expert annotations through label efficiency. However, existing ICL methods often fail to satisfy all three criteria simultaneously. Motivated by these challenges, we introduce SkillGen, a skill-based ICL framework for structured sequential reasoning. It constructs an action-centric, domain-level graph from sampled trajectories, identifies high-utility actions via temporal-difference credit assignment, and retrieves step-wise skills to generate fine-grained, context-aware prompts. We further present a theoretical analysis showing that focusing on high-utility segments supports task identifiability and informs more effective ICL prompt design. Experiments on ALFWorld, BabyAI, and ScienceWorld, using both open-source and proprietary LLMs, show that SkillGen achieves consistent gains, improving progress rate by 5.9%-16.5% on average across models.
