研究生: |
黃聖育 Huang, Sheng-Yu |
---|---|
論文名稱: |
多層次嵌套資料加權估計策略之比較研究:以TALIS資料庫為例 Comparison of the weighting methods on the multi-level data analysis: Taking TALIS databases as an example |
指導教授: |
邱皓政
Chiou, Haw-Jeng |
口試委員: |
施人英
Shih, Jen-Ying 李澄賢 LI, Cheng-Hsien |
口試日期: | 2021/07/28 |
學位類別: |
碩士 Master |
系所名稱: |
管理研究所 Graduate Institute of Management |
論文出版年: | 2021 |
畢業學年度: | 109 |
語文別: | 中文 |
論文頁數: | 65 |
中文關鍵詞: | 加權 、多層次模型 、樣本規模 、強韌最大概似估計法 、斜對角線加權最小平方法 |
英文關鍵詞: | weighting, multi-level modeling, sample size, robust maximum likelihood estimator, diagonally weighted least squares |
研究方法: | 次級資料分析 、 比較研究 |
DOI URL: | http://doi.org/10.6345/NTNU202101060 |
論文種類: | 學術論文 |
相關次數: | 點閱:142 下載:12 |
分享至: |
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
調查研究需仰賴嚴謹的抽樣設計使樣本反應母體的狀況,若研究者使用分層抽樣使得資料帶有嵌套特性,採用多層次模型(multi-level modeling, MLM)能夠就個體與總體不同層次的數據進行整合分析。當多層次模型考慮到加權議題時,樣本除了其個體層加權,會因其歸屬的組別(顯性或隱性分層變數)而再次調整權重,進一步地影響估計結果。為探討加權在階層模型係數估計議題,本研究使用經濟合作暨發展組織(OECD)主導的教學與學習國際調查(Teaching and Learning International Survey, TALIS)探討估計法、加權以及樣本規模對係數估計之影響。
TALIS資料庫的抽樣設計為二階段抽樣法,台灣於2018年所參與的調查工作共抽取553所學校(含高中、國中及國小,總體層次),共10,129名教師(個體層)。本研究採用國小部分資料(200所學校與3,494名教師)進行多層次分析。為了瞭解樣本規模的影響,本研究依據TALIS分層抽樣的顯性分層(公私立別、城鄉別),設計五種樣本規模(200、150、100、50與20)。基於工作壓力模式,在個體層次至入教職定向、壓力指數兩變數;總體層次至入學校公私立別、城鄉別(都市、市鎮與鄉村),解釋國小教師產生教職退縮(後悔從事教職)的多層次模型。另外,由於依變數為Likert五點量表,估計法以適合數值為量尺型態的MLR與WLSMV,並設置使用加權估計與不使用加權估計兩種設定。在五種樣本規模下,透過MLR與WLSMV反覆10次估計個體層與總體層係數,藉由反覆估計平均數與變異數,對照使用加權與未使用加權的係數估計結果。
研究結果顯示在樣本量充足的情況下(150所學校與100所),MLR與WLSMV兩種估計法在加權與未加權情況下的參數估計表現皆穩定,但WLSMV的參數估計數值普遍大於MLR估計數值約1.5至2倍,以及WLSMV估計法的標準誤明顯比MLR小,所以有著較好的檢定力,因此WLSMV會比MLR更適合被使用在依變數為分類變數的研究。不過當樣本不足時,兩種估計法在總體層次係數估計皆出現過度校正的情況,但在使用加權估計的情況下,過度校正的次數較少發生。在比較加權估計與未加權估計方面,本研究結果發現加權會降低估計的穩定度,但同時也會減少總體層級變數的估計偏誤度,因此在小樣本下層級係數雖有負數的估計結果,但仍能透過加權減少估計法過度修正的情況。綜合本研究分析結果,認為加權估計雖會降低總體係數估計精準度,但會提升係數估計的標準誤,使得估計較不穩定。因此建議研究者建立加權估計對照組以及使用不同估計法多方比較加權對係數估計之影響。
Survey research relies on rigorous sampling design to make the sample reflecting the status of the population. If the researcher implements stratified sampling, and use multi-level modeling (MLM), which could help researcher to estimate parameters from the individual level(level-1) and macro level(level-2), simultaneously. When the MLM integrated weighting issues, the sample shouldn’t only adjust by its own weights on level-1, but also have to be adjusted again by the level-2 weights (explicit stratification or implicit stratification) because of the group which the sample belongs to. In order to examine the weighting issue on coefficient estimation at MLM, this study uses the Teaching and Learning International Survey (TALIS) led by the Organization for Economic Cooperation and Development (OECD).
The sampling design of the TALIS database is two-stage sampling method. The TALIS Taiwan survey select the number of 553 schools (including elementary schools, junior high schools, high schools, level-2), and the number of 10,129 teachers (individual level-1) in 2018. Using the elementary schools data (200 schools and 3,494 teachers) to conduct multi-level analysis. In order to figure out the impact of sample size, there design five sample sizes which is based on the explicit stratification of TALIS stratified sampling (public and private & urban, town and rural). Since the dependent variable is the Likert five-point scale, using the appropriate estimation method MLR and WLSMV.
Results indicate that, MLR and WLSMV are suitable for estimation under the sufficient sample size ( both of the change rate of repeating sampling variation and the change rate of coefficient on 150 schools and 100 schools are stable). Moreover, WLSMV produce samller standard error than MLR. For the case of the dependent variable of research taking categorical dependent variable, WLMSV is better choice than MLR. In terms of comparing weighted and unweighted results, although the weighting could reduce the bias of level-2 variables, it also increasing the instability (standard error) at level-1 and level-2 variables. It is recommended that the control group of weighting treatment should be used with different estimatiors to examine the results of weighted coefficient estimates.
任宗浩、譚克平、張立民(2011)。二階段分層叢集抽樣的設計效應估計: 以TIMSS 2007 調查研究為例。教育科學研究期刊,56(1),33-65。
柯華葳、陳明蕾、李俊仁、陳冠銘(2019)。2018 教學與學習國際調查臺灣報告:綜整報告。臺北市:國家教育研究院。
邱皓政(2017)。多層次模式與縱貫資料分析:Mplus8解析應用。台北,五南圖書出版股份有限公司。
邱皓政(2020)。貝氏統計:原理與應用。台北,雙葉書廊有限公司。
溫福星(2006)。階層線性模式:原理,方法與應用。台北,雙葉書廊有限公司。
劉雅惠(2011)。中小學教師工作壓力之探究.社團法人中華民國學校行政研究學,72,77-98。
Asparouhov, T. (2006). General multi-level modeling with sampling weights. Communications in Statistics—Theory and Methods, 35(3), 439-460.
Asparouhov, T., & Muthén, B. (2006). Multilevel modeling of complex survey data. In Proceedings of the joint statistical meeting in Seattle (pp. 2718-2726).
Brick, J. M., & Kalton, G. (1996). Handling missing data in survey research. Statistical Methods in Medical Research, 5(3), 215-238
Cohen, J. (1988). Statistical Power Analysis for the Behavioral Sciences. Second Edition. Hillsdale, NJ: Lawrence Erlbaum Associates, Publishers.
Fowler Jr, F. J. (2013). Survey Research Methods. Sage publications.
Gelman, A. (2007). Struggles with survey weighting and regression modeling. Statistical Science, 22(2), 153-164.
Jerrim, J., Lopez-Agudo, L. A., Marcenaro-Gutierrez, O. D., & Shure, N. (2017). What happens when econometrics and psychometrics collide? An example using the PISA data. Economics of Education Review, 61, 51-58.
Johnson, D. R. (2008). Using weights in the analysis of survey data. Presentation prepared for the Population Research Institute, Pennsylvania State University, November.
Johnson, D. R., & Creech, J. C. (1983). Ordinal measures in multiple indicator models: A simulation study of categorization error. American Sociological Review, 3,398-407.
Kalton, G., & Flores-Cervantes, I. (2003). Weighting methods. Journal of Official Statistics, 19(2), 81-97.
Ko, H. W., Chen, M. L., Lee, J. R., & Chen, K.-M. (2019). Teaching and Learning International Survey (TALIS) 2018 Taiwan report: Elementary, junior high, & senior high educations. New Taipei City, Taiwan: National Academy for Educational Research.
Lavy, V. (2015). Do differences in schools’ instruction time explain international achievement gaps? Evidence from developed and developing countries. The Economic Journal, 125(588), F397-F424.
Levy, P. S., & Lemeshow, S. (2013). Sampling of Populations: Methods and Applications. John Wiley & Sons.
Li, C. H. (2016a). Confirmatory factor analysis with ordinal data: Comparing robust maximum likelihood and diagonally weighted least squares. Behavior Research Methods, 48(3), 936-949.
Li, C. H. (2016b). The performance of ML, DWLS, and ULS estimation with robust corrections in structural equation models with ordinal variables. Psychological Methods, 21(3), 369.
Li, C. H. (2021). Statistical estimation of structural equation models with a mixture of continuous and categorical observed variables. Behavior Research Methods, 1 23. https://doi.org/10.3758/s13428-021-01547-z
Litt, M. D., & Turk, D. C. (1985). Sources of stress and dissatisfaction in experienced high school teachers. The Journal of Educational Research, 78(3), 178-185. doi:10.1080/00220671.19 85.10885596
Luke, D. A. (2011). Basic multilevel modeling. Multilevel modeling. 10-53. https://dx.doi.org/10.4135/9781412985147.n2
Lynn, P. (2019). The advantage and disadvantage of implicitly stratified sampling. Methods, Data, Analyses, 13(2), 14.
Miller, J. G. (1975). The nature of living. Behavior Science, 20, 334-365
Muthén, B. O. (1993). Goodness of fit with categorical and other nonnormal variables. SAGE Focus Editions, 154, 205-205.
Muthén, L.K., &Muthén, B.O. (1998-2017). Mplus User’s Guide. Eighth Edition. Los Angeles, CA: Muthén & Muthén
Muthén, L. K., & Muthén, B. O. (2008) Scaling of Sampling Weights For Two Level Models in Mplus 4.2. Mplus https://www.statmodel.com/download/Scaling3.pdf
Muthén, L. K., & Muthén, B. O. (2009). Mplus. Statistical analysis with laten variables. User's guide,8.
Rhemtulla, M., Brosseau-Liard, P. É., & Savalei, V. (2012). When can categorical variables be treated as continuous? A comparison of robust continuous and categorical SEM estimation methods under suboptimal conditions. Psychological Methods, 17(3), 354.
Tellenback, S., Brenner, S.O., & Löfgren, H. (1983). Teacher stress: Exploratory model building. Journal of Occupational Psychology, 56(1), 19-33.
White, D. B. (1992). Sampling of populations: Methods and applications (Paul Levy and Stanley Lemeshow). SIAM Review, 34(2), 347-349.
Winship, C., & Radbill, L. (1994). Sampling weights and regression analysis. Sociological Methods & Research, 23(2), 230-257.