各种心理调查、心理实验中,数据的缺失随处可见。由于数据缺失,给概化理论分析非平衡数据的方差分量带来一系列问题。基于概化理论框架下,运用Matlab7.0软件,自编程序模拟产生随机双面交叉设计pxixr缺失数据,比较和探讨公式法、REML法、拆分法和MCMC法在估计各个方差分量上的性能优劣。结果表明:(1)MCMC方法估计随机双面交叉设计pxixr缺失数据方差分量,较其它3种方法表现出更强的优势;(2)题目和评分者是缺失数据方差分量估计重要的影响因素。
Missing observations are common in operational performance assessment settings or psychological surveys and experiments. Since these assessments are time-consuming to administer and score, examinees seldom respond to all test items and raters seldom evaluate all examinee responses. As a result, a frequent problem encountered by those using generalizability theory with large-scale performance assessments is working with missing data. Data from such examinations compose a missing data matrix. Researchers usually concern about how to make good use of the full data and often ignore missing data. As for these missing data, a common practice is to delete them or make an imputaion for missing records; however, it may cause problems in following aspects. Firstly, deleting or interpolating missing data may result in ineffective statistical analysis. Secondly, it is difficult for researchers to choose an unbiased method among diverse rules of interpolation. As a result of missing data, a series of problems may be caused when estimating variance components of unbalanced data in generalizability theory. A key issue with generalizability theory lies in how to effectively utilize the existing missing data to their maximum statistical analysis capacity. This article provides four methods to estimate variance components of missing data for unbalanced random pxixr design of generalizability theory: formulas method, restricted maximum likelihood estimation (REML) method, subdividing method, and Markov Chain Monte Carlo (MCMC) method. Based on the estimating formulas ofp xi design by Brennan (2001), formulas method is the deduction of estimating variance components formulas for pxixr design with missing data. The aim of this article is to investigate which method is superior in estimating variance components of missing data rapidly and effectively. MATLAB 7.0 was used to simulate data, and generalizability theory was used to estimate variance com- ponents. Three conditions were simulated respectively: (1) pers