高考数学中考试评价的研究——基于CTT与IRT的实证比较

闫成海; 杜文久; 宋乃庆; 张健

高考数学中考试评价的研究——基于CTT与IRT的实证比较

1.
西安文理学院数学与计算机工程学院，西安 710065
2.
西南大学数学与统计学院，重庆 400715
3.
重庆市教育考试院，重庆 401147

基金项目:

教育部哲学社会科学研究后期资助项目"中国基础教育改革与发展研究" 11JHQ001

重庆市教育科学规划项目"项目反应理论在普通高考中的应用" 2011 KS035

丁树良、罗芬等：《项目反应理论新进展专题研究》，北京：北京师范大学出版社, 2012年第109页。
杜文久：《高等项目反应理论》，重庆：西南师大出版社，2007年第71-88页。
杜文久：《高等项目反应理论》，重庆：西南师大出版社，2007年第153-156页。
沈南山：《基于IRT模型的数学学业成就水平测试分析》，《安徽师范大学学报》(社科版)2012年第1期。
Lord，F.M.Applications of item response theory to practical testing problems. Hillsdale, NJ: Lawrence Erlbaum Associates, 1980.
王晓华、文剑冰：《项目反应理论在教育考试命题质量评价中的应用》，《教育科学》2010年第3期。
朱德全、宋乃庆：《教育统计与测评技术》，重庆：西南师大出版社，2008年第67页。
赵守盈、石艳梅等：《项目反应理论在大规模选拔性考试试题质量评价中的应用》，《教育学报》2013年第1期。
Hambleton, R.K. Swaminathan，H.Item Response Theory:Principles and Applications.Kluwer-Nijhoff Publishing, 1985.
陈谨、何静等：《英语标准化考试评价中IRT与CTT的比较研究》，《数学的实践与认识》2011年第20期。
何穗、吴慧萍：《基于教育测量理论的中学数学试卷质量评价研究》，《教育测量与评价》(理论版)2012年第8期。
计量
- 文章访问数: 214
- HTML全文浏览量: 171
- PDF下载量: 1
- 被引次数: 0
出版历程
- 刊出日期: 2014-09-20

Evaluation of Examination in Math in College Entrance Examination: A Empirical Comparative Study on CTT and IRT

1.
School of Mathematics and Computer engineering, Xi'an University, Xi'an 710065, China
2.
School of Mathematics and Statistics, Southwest University, Chongqing 400715, China
3.
School of Chongqing Educational Examination, 401147, China

摘要

摘要: 相关研究表明，IRT在教育考试评价中比CTT具有诸多优点。本文以某地区高考数学考试数据为基础，比较CTT与IRT在项目参数、评价方式、精度估计三个方面之间的差异。研究结果证明，在IRT下参数更容易反映观测各个项目的特征属性，IRT参数比CTT参数更具精确性，项目信息函数能更好的反映试题信息；CTT与IRT的评价方式不同，IRT下的能力分数优于CTT下的测验分数，更能反映学生能力水平；CTT与IRT精度估计不同，IRT测验信息函数和能力置信区间比CTT有更好的精度。实证展示出IRT在高考数学考试评价中的优越性，具有重要的价值和应用前景。
- CTT /
- IRT /
- 考试评价
Abstract: The previous research shows that ITR has more merits than CTT in the evaluation of examination. Based on the data collected in math subject in National College Entrance Examination of one district, the paper compares the differences between CTT and IRT on project parameters, ways of evaluation and accuracy assessment. The research shows that the characteristics of each project properties are more easily reflected under the IRT parameters, and the IRT parameters are more accurate than the CTT. The function of the project information can reflect the information of the tests better. IRT and CTT have different evaluation ways and the scores under the evaluation of IRT, which represents students' abilities, are higher than those of CTT. CTT is different from IRT in precision estimation, but the test information function and confidence interval of IRT are more accurate than that of CTT. The Empirical study shows the advantages of IRT in evaluating the math subject in College Entrance Examination, which is valuable and has potential applications.
- CTT /
- IRT /
- Evaluation of Examination
注释:

1) 丁树良、罗芬等：《项目反应理论新进展专题研究》，北京：北京师范大学出版社, 2012年第109页。

2) 杜文久：《高等项目反应理论》，重庆：西南师大出版社，2007年第71-88页。

3) 杜文久：《高等项目反应理论》，重庆：西南师大出版社，2007年第153-156页。

4) 沈南山：《基于IRT模型的数学学业成就水平测试分析》，《安徽师范大学学报》(社科版)2012年第1期。

5) Lord，F.M.Applications of item response theory to practical testing problems. Hillsdale, NJ: Lawrence Erlbaum Associates, 1980.

6) 王晓华、文剑冰：《项目反应理论在教育考试命题质量评价中的应用》，《教育科学》2010年第3期。

7) 朱德全、宋乃庆：《教育统计与测评技术》，重庆：西南师大出版社，2008年第67页。

8) 赵守盈、石艳梅等：《项目反应理论在大规模选拔性考试试题质量评价中的应用》，《教育学报》2013年第1期。

9) Hambleton, R.K. Swaminathan，H.Item Response Theory:Principles and Applications.Kluwer-Nijhoff Publishing, 1985.

10) 陈谨、何静等：《英语标准化考试评价中IRT与CTT的比较研究》，《数学的实践与认识》2011年第20期。

11) 何穗、吴慧萍：《基于教育测量理论的中学数学试卷质量评价研究》，《教育测量与评价》(理论版)2012年第8期。

HTML全文

图 1 CTT与IRT难度参数对比图

下载: 全尺寸图片幻灯片

图 2 CTT与IRT区分度参数对比图

下载: 全尺寸图片幻灯片

图 3 第11题项目信息函数

下载: 全尺寸图片幻灯片

图 4 第11题项目特征曲线

下载: 全尺寸图片幻灯片

图 5 第12题项目信息函数

下载: 全尺寸图片幻灯片

图 6 第12题项目特征曲线

下载: 全尺寸图片幻灯片

图 7 测验分数分布

下载: 全尺寸图片幻灯片

图 8 能力分数分布

下载: 全尺寸图片幻灯片

图 9 测验信息函数

下载: 全尺寸图片幻灯片

图 10 能力分数置信区间

下载: 全尺寸图片幻灯片

表 1 IRT与CTT项目指标分布图

项目	a	b	c	难度	区分度
1(1)	0.18	-9.90	0.25	0.94	0.42
2(1)	1.30	-1.22	0.25	0.89	0.52
3(1)	1.48	-1.38	0.25	0.92	0.52
4(1)	1.43	-1.11	0.25	0.89	0.54
5(1)	1.27	-1.26	0.25	0.90	0.52
6(1)	1.70	-0.82	0.25	0.86	0.58
7(1)	0.95	-0.05	0.25	0.65	0.44
8(1)	1.70	0.03	0.25	0.66	0.50
9(1)	2.66	0.77	0.25	0.38	0.40
10(1)	2.01	0.97	0.25	0.39	0.24
11(1)	1.28	-1.64	0.00	0.92	0.52
12(1)	1.75	0.03	0.00	0.54	0.62
13(1)	1.29	0.07	0.00	0.51	0.55
14(1)	1.54	0.72	0.00	0.25	0.46
15(1)	1.18	1.61	0.00	0.08	0.27
16(1)	3.51	-0.74	0.00	0.68	0.80
16(2)	2.37	-0.38	0.00	0.68	0.80
16(3)	1.88	-0.12	0.00	0.68	0.80
16(4)	1.52	0.45	0.00	0.68	0.80
17(1)	1.77	-0.68	0.00	0.74	0.74
17(2)	1.56	-0.21	0.00	0.74	0.74
17(3)	1.43	-0.07	0.00	0.74	0.74
18(1)	2.40	-0.72	0.00	0.63	0.80
18(2)	2.17	-0.53	0.00	0.63	0.80
18(3)	2.16	-0.41	0.00	0.63	0.80
18(4)	1.79	0.58	0.00	0.63	0.80
18(5)	1.46	1.05	0.00	0.63	0.80
19(1)	1.56	-1.00	0.00	0.56	0.75
19(2)	2.05	0.20	0.00	0.56	0.75
19(3)	2.44	0.52	0.00	0.56	0.75
19(4)	2.16	0.67	0.00	0.56	0.75
20(1)	1.92	-0.17	0.00	0.42	0.75
20(2)	1.82	-0.02	0.00	0.42	0.75
20(3)	2.71	0.50	0.00	0.42	0.75
20(4)	3.55	0.95	0.00	0.42	0.75
21(1)	1.92	-0.07	0.00	0.22	0.68
21(2)	1.97	0.43	0.00	0.22	0.68
21(3)	111.30	288.83	0.00	0.22	0.68
21(4)	111.14	318.49	0.00	0.22	0.68
21(5)	111.01	320.20	0.00	0.22	0.68
说明：平均分:90，标准差:31.69，信度rx:0.84，测验标准误:12.52

下载: 导出CSV

表 2 难度与区分度参数分布表

难度区、分度分布范围	0~0.30	0.30~0.70	0.7~1.00
难度题数	3	10	8
区分度题数	2	15	4
平均分	标准差	信度	测验标准误
90	31.69	0.84	12.52

下载: 导出CSV

表 3 项目难度与区分度参数分布

难度b	-2以下	-2~2	2以上
题数	1	36	3
区分度a	0.5以下	0.5~2	2以上
题数	1	24	15

下载: 导出CSV

参考文献(0)

施引文献

资源附件(0)

访问统计

点击查看大图

图(10) / 表(3)

计量

文章访问数: 214
HTML全文浏览量: 171
PDF下载量: 1
被引次数: 0

姓名
邮箱
手机号码
标题
留言内容
验证码

留言板

高考数学中考试评价的研究——基于CTT与IRT的实证比较

计量

Evaluation of Examination in Math in College Entrance Examination: A Empirical Comparative Study on CTT and IRT

计量

目录

留言板

高考数学中考试评价的研究——基于CTT与IRT的实证比较

计量

出版历程

Evaluation of Examination in Math in College Entrance Examination: A Empirical Comparative Study on CTT and IRT

计量

出版历程

目录