簡易檢索 / 詳目顯示

研究生: 張宴晟
Yen-Sheng Chang
論文名稱: 擴展反應型論述題反應之自動化評估方法 - 以教師教學能力為例
Automated Scoring Methods for Extended Response Essay Item - Case Study of Instructional Abilities
指導教授: 張國恩
Chang, Kuo-En
Sung, Yao-Ting
學位類別: 碩士
系所名稱: 資訊教育研究所
Graduate Institute of Information and Computer Education
論文出版年: 2008
畢業學年度: 96
語文別: 中文
論文頁數: 100
中文關鍵詞: 擴展反應型論述題自動批改概念比對
英文關鍵詞: automatic scoring, concept matching
論文種類: 學術論文
相關次數: 點閱:124下載:3
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報
  • 擴展反應型論述題指的是受試者在回答論述題時,可以根據自己最佳的判斷組織、整合和評鑑適切的想法及觀念於答案中。雖然擴展反應型論述題比起其他選擇反應型題型更能夠測量學習者較複雜、較高層次的學習效果,但是在批改上卻非常耗費教師的時間。自動批改系統雖然有助於減輕教師批改上的負擔,然而現階段卻沒有一個可實際應用的工具,能針對擴展反應型論述題進行自動評分。因此,本文的目的在探討國內外論述題自動批改的相關研究,並提出一個以概念比對為基礎的自動評分模型,探討此模型的效能及未來發展方向。
    本研究以立台北教育大學某課程的89位大一學生為對象,擬定10題有關教師教學能力的問題製成問捲進行自動評分實驗。研究結果發現:(1) 本實驗顯示本文所提之方法有相當好的效果,然而距離可應用階段所需的正確率仍有差距;(2) 語料庫的品質會大大的影響了本文所提方法之自動批改效能。

    The Extended Responses Essay refers when the subjects answer to the discussed ques-tions basing on their best judgments, integration and evaluation of the relevance of ideas and concepts to answer. Although the Extended Response type items can measure more complicated and higher level of learning abilities than the Selecting Response type items, but to mark the answer is much more time-consuming. Although the automatic scoring system will help to reduce the burden of teacher's marking, there is no practical applica-tion tool at this stage that can automatically score Extended Response Essay items. Therefore, the purpose of this paper aims to explore the automatic scoring technique and research and put forward an automatic scoring model based on concept matching. Besides, we propose the effectiveness of this model and the future development direction.
    The participants of this research are 89 freshmen of National Taipei university of Education students. We collected the data of 10 questionnaires of instructional abilities to our automatic scoring system experiment. The results showed that: (1) This experiment showed that this method is referred to very good effect, but there is still a gap to apply in practical application, (2) the quality of corpus greatly affects the automatic scoring per-formance.

    附表目錄 vi 附圖目錄 vii 第一章 緒論 1 第一節 研究背景與動機 1 第二節 研究目的 6 第二章 文獻探討 7 第一節 教學評量 7 第二節 論述題自動批改系統之探討 15 第三節 自然語言處理技術 23 第四節 綜合分析 27 第三章 自動化評估方法 32 第一節 概念擷取 33 第二節 建立概念集 38 第三節 概念比對 41 第四節 計分模組 43 第四章 實驗設計 55 第一節 實驗工具 55 第二節 實驗資料 56 第三節 實驗流程 58 第四節 實驗結果 63 第五節 實驗結果討論 81 第五章 結論與未來發展 87 參考文獻 90 附錄一 中研院平衡語料庫詞類標記集 94 附錄二 簡化詞類及精簡詞類對照表 96 附錄三 系統實作畫面 97






    李坤崇(1999) 。多元化教學評量。台北:心理出版社。

    XVIII: Conference on Computational Linguistics and Speech Processing, 2006.



    423 - 431。





    張道行 (2007)。中文寫作自動評閱之概念化方法。新竹市:國立交通大學博士論文。(未出版)











    Chen ,Y.J. , (2000). Scalable summarization for Chinese text ,master thesis of Na-tional Tsing-Hua University.
    Manning Christopher D. & Schutze, H. (1999). Foundations of Statistical Natural Language Processing ,MIT Press.
    Dikli, S. (2006). Automated Essay Scoring. Online Submission 7: 49-62.
    Edel, R. L., & Frisbie, D. A. (1991). Essentials of educational measurement (5th ed.). Englewood Cliffs, NJ: Prentice-Hall.
    Gronlund, N. E. (1993). How to make achievement tests and measurements (5th ed.). Needham Heights, MA: Allyn and Bacon.
    Hutchison, D. (2007). An evaluation of computerised essay marking for national cur-riculum assessment in the UK for 11-year-olds. British Journal of Educational Technology 38: 13.
    Leacock, C. & Chodorow, M. (2003). C-rater: Scoring of short-answer questions. Computers and the Humanities, 37(4), 389-405.
    Li, G. C., K. Y. Liu., & Y. K. Zhang. (1998). Identifying Chinese Word and Processing Different Meaning Structures. Journal of Chinese Information Processing, Vol. 2, pp. 45-53.
    Liang, N. Y. (1990). Knowledge of Chinese Word Segmentation. Journal of Chinese Information Processing, Vol. 4, pp. 42-49.
    Mark, D. R. (1997). The Next Generation of Computerized Tests: Implications for Testing of Advances in Multimedia, Intelligent Tutoring Systems, and Language Processing. AEDS Journal (19:2), 1997, pp: 81-108.
    McKenna, C. & Bull, J. (1999). Designing Effective Objective Test Questions: An Introductory Workshop. Third Annual Computer-assisted Assessment confe-rence.
    Salton, G., Allan , J., & Buckly , C. (1994). Automatic structuring and retrieval of large text files. Communications of the ACM, 37(2) ,pp97-108
    Stephen, G., Pulman , J., & Sukkarieh, Z. (2005). Automatic Short Answer Marking. Proceedings of the 2nd Workshop on Building Educational Applications Using NLP: 9-16.
    Valenti, S., Neri, F., & Cucchiarelli, A. (2003). An Overview of Current Research on Automated Essay Grading. Journal of Information Technology Education, Vo-lume 2, 2003. Wang, H. C., Kumar, R., Rose, C. P., Li, T. Y., & Chang, C. Y. (2007). A Hybrid Ontology Directed Feedback Selection Algorithm for Sup-porting Creative Problem Solving Dialogues. Proceedings of 20th International Joint Conference on Artificial Intelligence.
    Wang, H. C., Chang, C. Y., & Li, T. Y. (2005, November). Automated scoring for creative problem solving ability with ideation-explanation modeling. Paper presented at the 2005 International Conference on Computers in Education, Singapore.
