Instruction Tuning for Secure Code Generation
Jingxuan He, Mark Vero, Gabriela Krasnopolska, Martin Vechev
TL;DR
SafeCoder addresses the gap in instruction tuning for secure code by introducing a security-centric fine-tuning phase that jointly optimizes safety and utility. It combines a masked language modeling loss on secure code with a masked unlikelihood loss on insecure code, trained alongside standard instruction-tuning data, and relies on an automated GitHub-CodeQL data pipeline to assemble diverse vulnerability samples. Empirical results across multiple coding and general-purpose LMs show substantial security gains (~30% improvement) with minimal impact on utility across benchmarks like HumanEval and MBPP, while revealing limited generalization to unseen CWEs and outlining the need for future work in broader generalization and reinforcement learning approaches. The work also provides an automated, scalable data collection pipeline and open-sources its resources, enabling broader adoption and further advances in secure code generation.
Abstract
Modern language models (LMs) have gained widespread acceptance in everyday and professional contexts, particularly in programming. An essential procedure enabling this adoption is instruction tuning, which substantially enhances LMs' practical utility by training them to follow user instructions and human preferences. However, existing instruction tuning schemes overlook a crucial aspect: the security of generated code. As a result, even the state-of-the-art instruction-tuned LMs frequently produce unsafe code, posing significant security risks. In this work, we introduce SafeCoder to address this gap. SafeCoder performs security-centric fine-tuning using a diverse and high-quality dataset that we collected using an automated pipeline. We integrate the security fine-tuning with standard instruction tuning, to facilitate a joint optimization of both security and utility. Despite its simplicity, we show that SafeCoder is effective across a variety of popular LMs and datasets. It is able to drastically improve security (by about 30%), while preserving utility.
