Decoding AI: The inside story of data analysis in ChatGPT
Ozan Evkaya, Miguel de Carvalho
TL;DR
This paper evaluates ChatGPT's Data Analysis extension as a tool for data exploration and modeling. It documents a workflow where prompts are translated into Python code executed in a sandbox, enabling descriptive analytics, visualization, and supervised/unsupervised learning demonstrations. Key findings show strong utility for exploratory tasks and stepwise modeling guidance, but highlight limitations in model diagnostics, metric choices for nonlinear models, and occasional misinterpretations by the DA system. The authors advocate human oversight, careful prompting, and domain expertise to avoid misleading conclusions, while highlighting the reproducibility of background Python code and the potential for LLM-driven augmentation of traditional statistical tools.
Abstract
As a result of recent advancements in generative AI, the field of Data Science is prone to various changes. This review critically examines the Data Analysis (DA) capabilities of ChatGPT assessing its performance across a wide range of tasks. While DA provides researchers and practitioners with unprecedented analytical capabilities, it is far from being perfect, and it is important to recognize and address its limitations.
