东篱科研大数据发现系统（DRDS）

位置：成果数据库 > 期刊 > 期刊详情页

五大认知诊断模型的诊断正确率比较及其影响因素:基于分布形态、属性数及样本容量的比较

ISSN号：0439-755X
期刊名称：心理学报
时间：2013.8
页码：1295-1304
分类：B841[哲学宗教—基础心理学;哲学宗教—心理学]
作者机构：[1]江西师范大学心理学院,南昌330022, [2]江西师范大学计算机信息工程学院,南昌330022
相关基金：教育部人文社科基金（11YJC190002）,国家自然科学基金（31100756,31160203）,高等院校博士点基金项目（20103604120001,20123604120001）,江西省教育科学规划项目（13YB029,12YB088）,江西师范大学青年英才培育资助计划项目资助.
相关项目：基于现代测量理论的认知诊断CAT研究

关键词：认知属性, 认知诊断, 认知诊断模型, 属性诊断正确率, cognitive attributes, cognitive diagnosis, cognitive diagnosis models, attributes Match Ratio

中文摘要：

本研究主要在Leighton，Gierl和Hunka（2004）的四种属性层级结构测验情景下，考察不同被试知识状态（knowledgestates）分布形态、不同样本容量和不同认知属性数3种实验条件下，分别比较、分析五种常用认知诊断模型的属性诊断正确率（含边际判准率和模式判准率）及其影响因素，从而深入探讨每种模型的计量性能及模型属性诊断正确率的影响因素等，试图为实际应用者在模型比较与选用上提供借鉴和指导。

英文摘要：

Cognitive diagnosis is an important topic in modern psychometric area. Now more than 70 cognitive diagnosis models （CDMs） are developed. There are some questions among these models：（1） When the attribute hierarchy structure is known, how to choose the most suitable model？（2） When the attribute hierarchy structure is unknown, and cognitive diagnosis is required, how to do it？ These problems seems especially more puzzled for the practice workers. This paper only paid main attention on three international popular models. Therefore, five cognitive diagnosis models （RSM, AHM_A, GDD, DINA and DINA_HC） were compared corresponding to the above questions from psychometric opinion. In this paper, Monte Carlo simulation study was used. Although the number of slips and the hierarchy structure are two important factors that affect the performance on corrected match ratio of cognitive diagnosis, this study would pay attention on other three factors： the distribution of cognitive pattern, the sample size, the number of attributes. The findings identified：（1） When the characteristic of data was known, focusing on specific factor, the five methods had different advantages. a） For the distributions of cognitive pattern, although they have different effects on different methods, the same conclusion could find that the performance on negative bias distribution was the best, and that of DINA HC and DINA were better than the rest methods on any discussed distributions. b） Considering the sample size, the performance of GDD with small scale assessment （100/20, persons/items）was the best one; with medium and large scale assessment （1000/60, 5000/100, persons/items）, the performance of DINA_HC and DINA were better than the rest c） For the number of attributes, the more the attributes are the worse the performance will be. But for the methods, the performance the performance of DINA_HC and DINA were also better than the rest. All these reflected that the most suitable method could be ada

同期刊论文项目