Psychological Science ›› 2011, Vol. 34 ›› Issue (5): 1189-1194.

Previous Articles     Next Articles

New progress on IRT: the Mixture Model Based on 3PLM and GRM

  

  • Received:2010-06-17 Revised:2011-05-13 Online:2011-09-20 Published:2011-09-28
  • Contact: Tu Dong-Bo

项目反应理论新进展:基于3PLM和GRM的混合模型

涂冬波1,蔡艳2,戴海琦2,丁树良2   

  1. 1. 江西师范大学
    2.
  • 通讯作者: 涂冬波
  • 基金资助:

    教育部人文社科一般项目青年基金;江西省教育厅科技项目青年基金;江西省教育厅高校人文社科项目青年基金;国家自然科学基金

Abstract:

There are many IRT models for available for realistic work which are adaptive to different datas. In china, there exits many kinds of examinations, and the item types are rich. In realistic work, only one IRT model can not reflect all data’s features. Thus one more IRT models are considered called mixture model to realize the optimized data fit. This paper explored the idea, principle, parameter estimation and the properties of mixture model based on 3PLM and GRM. To explore the parameters estimation precision, and to probe the properties of mixture model, Monte Carlo method was used here. The result showed: (1) The parameters estimation precision of Mix_Tu program was preferably great, which equivalent to the precision of Parscale program. (2) When “item bugs”, the estimating of parameter b and c with Mix_Tu program were more affected by the extent of item bugs than that with Parscale program, the estimating of parameter a was vice versa, while the estimating of theta was similar. (3) When “examination bugs”, the estimating of all parameters with Mix_Tu program were less affected by the degree of examination bugs than that with Parscale program. The estimating was more robust than that with Parscale program.

Key words: Item Response Theory, three parameter Logistic model, Graded Response Theory, Mixture model

摘要:

IRT中的计量模型较多,不同计量模型适合不同特点的数据资料,实际工作者应根据实际情况选择适当的IRT模型来分析数据。我国是个考试、测评大国,测评的题型丰富多样,在实际应用IRT时,一个模型往往很难反应所有数据资料本身的特点,这时可考虑应用多个IRT模型(即“混合模型”)来分析,以达到对数据的最佳拟合。本文对混合模型的思想方法及原理、参数估计的实现、以及模型性能进行了研究,发现:(1)本文自主开发的混合模型参数估计程序Mix_Tu具有较高的返真性,且与国际知名测量软件Parscale相当。(2)在“项目异常”情况下,Mix_Tu程序对参数b和c的估计受数据异常程度的影响要大于Parscale程序,而对参数a的估计受数据异常程度的影响要小于Parscale程序,而在参数theta上两个程序相当。(3)在“被试异常”情况下,Mix_Tu程序对所有参数的估计受数据异常程度的影响均要小于Parscale程序,Mix_Tu程序表现的更为稳健。

关键词: 项目反应理论, 3PLM, 等级反应模型, 混合模型