The Impact of Environment Configurations on the Stability of AI-Enabled Systems

Musfiqur Rahman; SayedHassan Khatoonabadi; Ahmad Abdellatif; Haya Samaana; Emad Shihab

The Impact of Environment Configurations on the Stability of AI-Enabled Systems

Musfiqur Rahman, SayedHassan Khatoonabadi, Ahmad Abdellatif, Haya Samaana, Emad Shihab

TL;DR

This paper addresses stability challenges in AI-enabled software resulting from environment configurations, focusing on operating system, Python version, and CPU architecture. It adopts an empirical, Travis CI–based methodology across eight configurations and 30 open-source projects, evaluating model performance, processing time, and expense. The study finds pervasive instability across metrics, especially in processing time and cost, with Linux generally offering faster and cheaper runs while MacOS may trade speed for marginal model-performance gains, and ARM64 often underperforming relative to AMD64. The work underscores the importance of dev/prod parity and testing across configurations to identify the most stable deployment setup, offering practical guidance for reducing instability and informing future research into its causes.

Abstract

Nowadays, software systems tend to include Artificial Intelligence (AI) components. Changes in the operational environment have been known to negatively impact the stability of AI-enabled software systems by causing unintended changes in behavior. However, how an environment configuration impacts the behavior of such systems has yet to be explored. Understanding and quantifying the degree of instability caused by different environment settings can help practitioners decide the best environment configuration for the most stable AI systems. To achieve this goal, we performed experiments with eight different combinations of three key environment variables (operating system, Python version, and CPU architecture) on $30$ open-source AI-enabled systems using the Travis CI platform. We determine the existence and the degree of instability introduced by each configuration using three metrics: the output of an AI component of the system (model performance), the time required to build and run the system (processing time), and the cost associated with building and running the system (expense). Our results indicate that changes in environment configurations lead to instability across all three metrics; however, it is observed more frequently with respect to processing time and expense rather than model performance. For example, between Linux and MacOS, instability is observed in 23\%, 96.67\%, and 100\% of the studied projects in model performance, processing time, and expense, respectively. Our findings underscore the importance of identifying the optimal combination of configuration settings to mitigate drops in model performance and reduce the processing time and expense before deploying an AI-enabled system.

The Impact of Environment Configurations on the Stability of AI-Enabled Systems

TL;DR

Abstract

open-source AI-enabled systems using the Travis CI platform. We determine the existence and the degree of instability introduced by each configuration using three metrics: the output of an AI component of the system (model performance), the time required to build and run the system (processing time), and the cost associated with building and running the system (expense). Our results indicate that changes in environment configurations lead to instability across all three metrics; however, it is observed more frequently with respect to processing time and expense rather than model performance. For example, between Linux and MacOS, instability is observed in 23\%, 96.67\%, and 100\% of the studied projects in model performance, processing time, and expense, respectively. Our findings underscore the importance of identifying the optimal combination of configuration settings to mitigate drops in model performance and reduce the processing time and expense before deploying an AI-enabled system.

Paper Structure (18 sections, 2 equations, 4 figures, 10 tables)

This paper contains 18 sections, 2 equations, 4 figures, 10 tables.

Introduction
Methodology and Background
Environment Configurations in Travis CI
Dataset
Analysis of Instability
Evaluation Metrics:
Result Analysis:
RQ1: (Operating System) How much instability is introduced by changing the operating system in AI-enabled systems?
Instability with respect to Operating System
Instability with respect to Linux Distribution
RQ2: (Python Version) How much does changing the Python version introduce instability in AI-enabled systems?
RQ3: (CPU Architecture) How much does changing the CPU architecture introduce instability in AI-enabled systems?
Discussion
Interpretation
Implications
...and 3 more sections

Figures (4)

Figure 1: Distributions of instability with respect to Operating Systems.
Figure 2: Distributions of instability with respect to Linux Distributions.
Figure 3: Distributions of instability with respect to Python versions.
Figure 4: Distributions of instability with respect to CPU architectures.

The Impact of Environment Configurations on the Stability of AI-Enabled Systems

TL;DR

Abstract

The Impact of Environment Configurations on the Stability of AI-Enabled Systems

Authors

TL;DR

Abstract

Table of Contents

Figures (4)