Chained computerized adaptive testing for the Force Concept Inventory

Jun-ichiro Yasuda; Michael M. Hull; Naohiro Mae; Kentaro Kojima

Chained computerized adaptive testing for the Force Concept Inventory

Jun-ichiro Yasuda, Michael M. Hull, Naohiro Mae, Kentaro Kojima

TL;DR

This paper presents Chain-CAT, a chained computerized adaptive testing approach that uses collateral information from prior test administrations to repeatedly assess student understanding of Newtonian mechanics via the Force Concept Inventory. Through numerical simulations, it demonstrates that collateral information can dramatically improve test efficiency, potentially reducing the total item burden below the conventional 60-item pre‑post scheme (e.g., 45 items with L=5 across nine administrations) without sacrificing accuracy or precision. The study also highlights practical constraints: a 30-item FCI item bank, without balancing or exposure controls, can achieve competitive performance, but incorporating content balancing and exposure limits diminishes efficiency unless the item bank is expanded with highly discriminative items from the same or other inventories. Overall, Chain-CAT offers a promising formative assessment tool for tracking conceptual change over a course, contingent on expanding item banks and validating the approach in real classroom settings.

Abstract

Although conceptual assessment tests are commonly administered at the beginning and end of a semester, this pre-post approach has inherent limitations. Specifically, education researchers and instructors have limited ability to observe the progression of student conceptual understanding throughout the course. Furthermore, instructors are limited in the usefulness of the feedback they can give to the students involved. To address these challenges, we propose an alternative approach that leverages computerized adaptive testing (CAT) and increasing the frequency of CAT-based assessments during the course, while reducing the test length per administration, thus keeping or decreasing the total number of test items administered throughout the course. The feasibility of this idea depends on how far the test length per administration can be reduced without compromising the test accuracy and precision. Specifically, the overall test length is desired to be shorter than when the full assessment is administered as a pretest and subsequent post-test. To achieve this goal, we developed a CAT algorithm that we call Chain-CAT. This algorithm sequentially links the results of each CAT administration using collateral information. We developed the Chain-CAT algorithm using the items of the Force Concept Inventory (FCI) and analyzed the efficiency by numerical simulations. We found that collateral information significantly improved the test efficiency, and the overall test length could be shorter than the pre-post method. Without constraints for item balancing and exposure control, simulation results indicated that the efficiency of Chain-CAT is comparable to that of the pre-post method even if the length of each CAT administration is only 5 items and the CAT is administered 9 times throughout the semester. (To continue, see text.)

Chained computerized adaptive testing for the Force Concept Inventory

TL;DR

Abstract

Chained computerized adaptive testing for the Force Concept Inventory

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (4)