本研究主要在Leighton,Gierl和Hunka(2004)的四种属性层级结构测验情景下,考察不同被试知识状态(knowledgestates)分布形态、不同样本容量和不同认知属性数3种实验条件下,分别比较、分析五种常用认知诊断模型的属性诊断正确率(含边际判准率和模式判准率)及其影响因素,从而深入探讨每种模型的计量性能及模型属性诊断正确率的影响因素等,试图为实际应用者在模型比较与选用上提供借鉴和指导。
Cognitive diagnosis is an important topic in modern psychometric area. Now more than 70 cognitive diagnosis models (CDMs) are developed. There are some questions among these models: (1) When the attribute hierarchy structure is known, how to choose the most suitable model? (2) When the attribute hierarchy structure is unknown, and cognitive diagnosis is required, how to do it? These problems seems especially more puzzled for the practice workers. This paper only paid main attention on three international popular models. Therefore, five cognitive diagnosis models (RSM, AHM_A, GDD, DINA and DINA_HC) were compared corresponding to the above questions from psychometric opinion. In this paper, Monte Carlo simulation study was used. Although the number of slips and the hierarchy structure are two important factors that affect the performance on corrected match ratio of cognitive diagnosis, this study would pay attention on other three factors: the distribution of cognitive pattern, the sample size, the number of attributes. The findings identified: (1) When the characteristic of data was known, focusing on specific factor, the five methods had different advantages. a) For the distributions of cognitive pattern, although they have different effects on different methods, the same conclusion could find that the performance on negative bias distribution was the best, and that of DINA HC and DINA were better than the rest methods on any discussed distributions. b) Considering the sample size, the performance of GDD with small scale assessment (100/20, persons/items)was the best one; with medium and large scale assessment (1000/60, 5000/100, persons/items), the performance of DINA_HC and DINA were better than the rest c) For the number of attributes, the more the attributes are the worse the performance will be. But for the methods, the performance the performance of DINA_HC and DINA were also better than the rest. All these reflected that the most suitable method could be ada