Consistency Theory of General Nonparametric Classification Methods in Cognitive Diagnosis
Chengyu Cui, Yanlong Liu, Gongjun Xu
TL;DR
This work addresses clustering consistency for nonparametric cognitive diagnosis methods, notably NPC and GNPC, by deriving consistency results under weaker, more practical conditions than prior theory. It provides two main contributions: (i) consistency and finite-sample error bounds for the modified GNPC method without requiring consistent initial memberships or calibration data, and (ii) a parallel result for the original GNPC method with an additional 0/1 constraint, quantifying an extra error term when the constraint is violated. The results show that latent-attribute classification can be consistently recovered with rates that improve as the item set grows, and item-parameter estimates converge uniformly when sample size and test length are large, offering practical guidance for nonparametric CDMs in education and psychology. The simulations corroborate theoretical claims, demonstrating robust performance across DINA and GDINA data and highlighting the impact of test length, noise, and model complexity on accuracy, with the modified GNPC often outperforming in complex scenarios. Overall, the paper advances nonparametric CDM theory by delivering less restrictive guarantees and actionable insights for large-scale assessments and future extensions to unknown Q-matrices and inference.
Abstract
Cognitive diagnosis models have been popularly used in fields such as education, psychology, and social sciences. While parametric likelihood estimation is a prevailing method for fitting cognitive diagnosis models, nonparametric methodologies are attracting increasing attention due to their ease of implementation and robustness, particularly when sample sizes are relatively small. However, existing clustering consistency results of the nonparametric estimation methods often rely on certain restrictive conditions, which may not be easily satisfied in practice. In this article, the clustering consistency of the general nonparametric classification method is reestablished under weaker and more practical conditions.
