簡易檢索 / 詳目顯示

研究生: 陳奕璇
Chen, Yi-Hsuan
論文名稱: 利用混合模型探討英文詞彙測驗受測者之作答順序
指導教授: 蔡蓉青
學位類別: 碩士
Master
系所名稱: 數學系
Department of Mathematics
論文出版年: 2017
畢業學年度: 105
語文別: 中文
論文頁數: 46
中文關鍵詞: 混合模型英文詞彙測驗作答順序
英文關鍵詞: Mixture model, Vocabulary levels test, Answering order
DOI URL: https://doi.org/10.6345/NTNU202202812
論文種類: 學術論文
相關次數: 點閱:81下載:1
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報
  • 研究第一語言及第二語言的研究者認為詞彙量對於語言能力有著重要的意義。而英文詞彙測驗(Vocabulary Levels Test, VLT)是一個廣泛應用在測試詞彙量的測驗,可有效確認受測者是否擁有足夠的詞彙量。由於英文詞彙測驗有10個題組的試題結構,文獻中已提出利用二參數英文詞彙測驗的依序試題模型(VLT-Sequential Model, VSM),解決題組內題目間所存在的相依性的問題。當應用該模型來分析實徵資料時,出現配適不佳的狀況。除了考量該測驗中配適不佳的試題是否具有其特殊性之外,希望進一步地對於二參數VSM模型中所假設的全部受測者都是按照題目難易度順序作答的合理性進行檢定,故本論文將建構混合二參數VSM模型,進一步考慮也存在有按照題目順序作答的族群的可能性。在模擬研究中,使用邊際最大概似估計來估計參數,確認混合二參數VSM模型估計的有效性,並觀察當資料中同時存在有按照題目順序及題目難易度順序作答的受測者時的作答,卻僅使用二參數VSM模型,在參數估計上會有何影響。分析實徵資料3,000字等級的英文詞彙測驗中,確認二參數VSM模型配適度不佳的情況與按照題目作答順序無關,而在5,000字等級的英文詞彙測驗結果中發現,混合二參數VSM模型表現優於二參數VSM模型,更進一步,如果忽略其實存在有按照題目順序作答的這個族群,僅用二參數VSM模型來進行分析,將會低估這類受測者的能力。

    Vocabulary knowledge is considered by both first-language and second-language researchers to be of great significance in language competence. Vocabulary Levels Test (VLT) is commonly used and shown effective in measuring learner’s vocabulary size. The test consists of ten item clusters, and two-parameter logistic VLT-Sequential Model(2PL-VSM) considers taking into account the dependency structure among items within a cluster in modeling VLT data. However, some clusters exhibit misfit while fitting the actual 3000-level data with a 2PL-VSM.The purpose of this study is to construct a two-parameter logistic Mixture VLT-Sequential Model(2PL-MVSM) that relaxes the assumption that the items within a cluster are answered in the order of their item difficulty, from easiest to the most difficult ones, and allows some examinees to answer these items in their order presented in the original test, i.e. item number. In the simulation study, we investigate the effect of ignoring the examinees answering items within a cluster by item number by comparing the estimation results from the 2PL-VSM to those from the 2PL-MVSM. In analyzing the 3000-level and 5000-level VLT data, the results show no necessity to include the additional class of answering-by-item-number examinees for 3000-level data, whereas 2PL-MVSM is shown to be superior over 2PL-VSM in the actual 5000-level data. Furthermore, the abilities of the examinees who answer the items by item number would be under-estimated if only 2PL-VSM is fit to the 5000-level data.

    目錄 誌謝 iii 摘要 v Abstract vii 1 緒論 1 2 模型 3 2.1 二參數VSM 3 2.1.1 模型發展 3 2.1.2 模型假設 3 2.1.3 題組結構 4 2.2 混合二參數VSM 6 2.2.1 模型假設 6 2.2.2 題組結構 6 2.2.3 試題反應機率函數 7 2.3 估計 8 3 模擬研究 15 3.1 生成資料 15 3.1.1 題目參數 15 3.1.2 樣本數 16 3.1.3 比例參數 16 3.1.4 能力參數 16 3.2 結果分析 16 3.2.1 混合二參數VSM 的估計有效性 16 3.2.2 二參數VSM 估計的誤差情況 18 3.2.3 分群正確率與能力估計比較 24 4 實徵資料 33 4.1 VLT 資料分析 33 4.1.1 模型比較 33 4.1.2 參數估計 34 4.1.3 模型配適度 38 5 討論與結論 43 參考文獻 45 中文文獻 45 英文文獻 45

    中文文獻
    賴國棟(2016)。英文詞彙測驗試題反應模型之建構與檢定。國立臺灣師範大學數學研究所碩士論文,未出版。台北市。

    英文文獻
    Abramowitz, M., & Stegun, I. A. (1972). Handbook of mathematical functions with formulas, graphs, and mathematical tables (9th ed.). Dover, New York.
    Akaike, H. (1974). A new look at the statistical identification model. IEEE Transactions on Automatic Control, 19, 716–723.
    Beglar, D., & Hunt, A. (1999). Revising and validating the 2000 word level and university word level vocabulary tests. Language Testing, 16(2), 131–162.
    DeMars, C. E. (2006). Application of the bi-factor multidimensional item response theory model to testlet-based tests. Educational Measurement, 43(2), 145–168.
    Drasgow, F. (1989). An evaluation of marginal maximum likelihood estimation for the two-parameter logistic model. Applied Psychological Measurement, 13(1), 77–90.
    Frederiksen, J. R. (1982). A componential theory of reading skills and their interactions. Advances in the psychology of human intelligence, 1, 125–180.
    Grabe, W. (1991). Current developments in second language reading research. TESOL Quarterly, 25(3), 375–406.
    Hulin, C. L., Lissak, R. I., & Drasgow, F. (1982). Recovery of two-and three-parameter logistic item characteristic curves: A Monte Carlo study. Applied psychological measurement, 6(3), 249–260.
    Linacre, J.,& Wright, B. (1993). A user’s guide to bigsteps: Rasch–model computer program. Mesa Press.
    Nation, I. S. P. (1990). Teaching and learning vocabulary. Boston,MA:Heinle and Heinle.
    Nation, I. S. P. (1999). A vocabulary-size test of controlled productive ability. Language Testing, 16(1), 33–51.
    Neyman, J., & Scott, E. L. (1948). Consistent estimates based on partially consistent observations. Econometrica: Journal of the Econometric Society, 1–32.
    Schmitt, N. (2000). Vocabulary in language teaching. Ernst Klett Sprachen.
    Schmitt, N., Schmitt, D., & Clapham, C. (2001). Developing and exploring the behaviour of two new versions of the vocabulary levels test. Language Testing, 18(1), 55–88.
    Schwarz, G. (1978). Estimating the dimension of a model. The annals of statistics, 6(2), 461–464.

    下載圖示
    QR CODE