测验垂直等值是指将测试同一心理特质的不同水平的测验转换到同一个分数量尺上的过程。IRT与MIRT是实现垂直等值的主要方法。IRT无需假设被试的能力分布,参数估计不依赖于样本,是构建垂直量表的有效方法,但测验不满足单维假设时其应用受到限制。MIRT结合IRT和因素分析的特点对IRT进行了拓展,可更有效估计多维测验的项目参数和被试能力参数,在垂直等值中有重要应用。已有研究主要探讨IRT和MIRT在垂直等值应用中的适用性、标定方法和参数估计方法,比较研究两种方法的特性。未来研究应纳入更多变量条件进行比较研究,拓展方法的应用。
Test vertical equating refers to transforming the performance of students in tests of various levels to the same scale. Item Response Theory (IRT) and Multidimensional Item Response Theory (MIRT) are the main methods used in vertical equating. IRT method is an effective way to build a vertical scale because IRT models impose no requirement on the distribution of test-takers' ability and that item parameters are independent of the samples. However, its application is restricted when the test dose not satisfy the unidimensional assumption. MIRT is a combination of IRT and factor analysis, making it an important tool in vertical equating to estimate ability parameter and item parameter precisely in multidimensional test. Current researches focus on the applicability of IRT and MIRT, calibration method and parameter estimation method in vertical equating and comparison study of the two methods. Future studies should concentrate on including more variables when comparing IRT and MIRT in order to expand their applications.