簡易檢索 / 詳目顯示

研究生: 莊惟翔
Chuang, Wei-Hsuang
論文名稱: 語音辨識系統上的模仿、比較和回饋方法對華語學習者「發音」的影響
Effects of imitation, discrimination and feedback in a speech recognition system on CFL pronunciation
指導教授: 張國恩
Chang, Kuo-En
宋曜廷
Sung, Yao-Ting
劉子鍵
Liu, Tzu-Chien
學位類別: 碩士
Master
系所名稱: 資訊教育研究所
Graduate Institute of Information and Computer Education
論文出版年: 2018
畢業學年度: 106
語文別: 中文
論文頁數: 90
中文關鍵詞: 語音辨識聽後模仿電腦輔助發音學習比較辨別模仿回饋
英文關鍵詞: ASR, CAPT, Speech imitation, Speech discrimination, Tutor feedback
DOI URL: http://doi.org/10.6345/THE.NTNU.GICE.006.2018.F02
論文種類: 學術論文
相關次數: 點閱:218下載:4
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報
  • 對於華語學習者而言,溝通是其學習中文的重要目的,而要能清楚傳遞訊息,進行溝通,勢必需要正確的發音。伴隨著網路科技及語音辨識系統的發展,電腦輔助發音學習正在流行,由傳統教室透過老師大班教學的方式轉變成使用新穎的資訊科技與學習理論的結合教學,讓學生可以自主學習,老師在課堂上可以將教學重心放在更高層次的教學。
    本研究針對學生的學習,提出使用聽後模仿、比較辨別與模仿回饋方法,結合語音辨識技術,進行電腦輔助發音教學,幫助學習者口說學習。研究方法間的結合,是否能協助學習者,在進行口說單字練習時,可以快速矯正其錯誤發音,增強學習者自主學習的成效。本研究的實際研究對象為60位外國學習者。透過隨機分派的方式,分成四組,分別針對不同方法的結合,進行比較,並了解學習者在學習後是否有所進步。
    研究結果顯示:
    一、 參與者的背景可能影響口說學習成效,在本研究的學習者,來自漢字學習圈的平均成績高於非漢字學習圈的學習者。
    二、 集合不同方法融入於電腦輔助口說學習的方式,讓各個組別在後測成績上均有進步。
    三、 將三種方法結合使用的實驗組一,在口說成效測驗上顯著高於只有使用聽後模仿方法的控制組。
    四、 在滿意度問卷的調查上,學習者大多給予肯定的評價。控制組的分數,略低於其他組別。
    最後,研究者針對研究結果與訪談內容進行討論,並提出幾點建議與未來研究方向以供日後研究與參考。

    For all the Chinese learners, the attainment of successful communication is essential. And the correct articulation/pronunciation is then necessary if we want to get our meaning across in a lucid style. Along with the advancement of technology and the development of speech recognition device, Computer Assisted Pronunciation Teaching (CAPT) is now under way and incorporated into teaching which conjoins both information technology and pedagogical theory. This sort of technical invention as a medium of pronunciation diagnosis not only helps students benefit from the pleasure of independent learning but also allows teachers to focus more on the quality of the teaching materials per se.

    This study proposes three strategies as what follows: Speech Imitation, Speech Discrimination, and Automatic Feedback after speakers imitate the correct sound given. Besides, Automatic Speech Recognition (ASR) would be applied in conjunction with these strategies to facilitate CAPT and also encourage learners to practise Chinese pronunciation. This research aims to argue that the different combinations of strategies may shed light on the correct sound acquisition of learners. Whilst practising monosyllables’ pronunciation, the deployment of the strategies are found to be helpful in rectifying their mispronunciation to several levels.

    The participants in this research include 60 foreign learners of Chinese language, and they had been randomised into 4 groups. Each group is designed to undergo a specific combination of strategies. Lastly, we’d evaluate and compare the learning effect of speakers amongst the four groups.

    The result shows that:
    1. The background of participants would influence the Chinese pronunciation acquisition. Those who are of the Sino-sphere (Chinese cultural sphere) usually have a more desirable performance than those who aren’t.
    2. Each group makes progress in the posttest, yet the degree of progress varies across groups.
    3. The upshot of the pronunciation test of the group which taps into all three strategies is more satisfactory than that of the group which uses the sole strategy ‘Automatic Feddback.’
    4. From the questionnaire in Appendix, we know that most participants have positive feedback towards this system.

    In the last chapter, we will further our discussion based on the conclusion and the interviews with participants, and summarise this paper with some advices and potential research directions..

    目錄 附表目錄 VII 附圖目錄 IX 第一章 緒論 1 第一節 研究背景與動機 1 第二節 研究目的 5 第三節 研究問題 6 第四節 名詞釋義 7 第二章 文獻探討 11 第一節 華語口說學習的困難 11 第二節 華語口說學習方法 14 第三節 語音辨識系統輔助口說學習現況 20 第三章 系統設計 27 第一節 設計理念 27 第二節 系統架構 28 第三節 教材內容 32 第四章 研究方法 34 第一節 參與者 34 第二節 實驗設計 35 第三節 實驗工具 37 第四節 實驗程序 44 第五章 結果與討論 46 第一節 參與者背景分析 46 第二節 口說學習成效測驗成效分析 49 第三節 滿意度結果 55 第四節 討論 62 第六章 結論與建議 73 第一節 結論 73 第二節 建議 75 第三節 未來研究方向 76 參考文獻 78 中文部分 78 英文部分 81 附錄一:滿意度問卷 86 附錄二:訪談表 88 附錄三:學習者者國籍列表 89 附錄四:學習者母語列表 90

    參考文獻
    中文部分
    張羽良(2007,9月)。正視全球中文熱。新紀元週刊。2008年10月9日,取自:http://mag.epochtimes.com/042/3757.htm
    韋樞(2007,2月)。美國首創中文AP考試 影響深遠。大紀元新聞。2008年12月26日,取自http://www.epochtimes.com/b5/7/2/23/n1628467.htm
    廖修寬 (2013)。善用東風-從全球中文熱推展國際教育. 臺 灣 教 育 評 論 月 刊, 70.
    張良民 (2006)。全球華語學習熱潮與僑教發展.
    葉德明 (2005)。華語文教學的推廣與展望: 載於淡江大學兩岸三地成人教育學術研討會論文集.
    陳慶華 (2007)。日籍學習者華語發音之偏誤分析矯正與教學研究. 臺灣師範大學華語文教學研究所學位論文, 1-143.
    閆麗萍 (1998)。漢語聽力教學中將聽和說有機地結合起來── 對 “聽後模仿” 聽力訓練法的一些思考. 語言與翻譯, 1, 020.
    曹文 (1998)。英語文化教學的兩個層次. 外語教學與研究: 外國語文雙月刊(3), 12-16.
    陳竑媝 (2008)。國小學童英語音素能力的電腦化訓練環境。國立台灣師範大學資訊教育所碩士論文,未出版,台北市。
    國立臺灣師範大學國音教材編輯委員會 (2012)。《國音學》。臺北:正中書局
    黃伯榮, & 廖旭東 (2007)。現代漢語. 北京: 高等教育出版杜.
    蔡秉倫 (2007)。外國學生華語詞彙聲調的發音與感知(碩士論文)。取自http://140.113.39.130/cgi-bin/gs32/hugsweb.cgi?o=dnthucdr&s=%22GH000934709%22.id.&
    蔡茜伃 (2012)。以中文為外語學習者之聲調表現研究(碩士論文)。取自http://etds.lib.ncku.edu.tw/etdservice/view_metadata?etdun=U0026-2008201200000200
    劉珣 (2002)。漢語作為第二語言教學簡論。北京:北京語言大學出版社。
    孫月 (2007)。對外漢語語音教學難點。今日湖北(理論版)。湖北省新聞工作者協會,3,076。
    鄭旭倫 (2013)。探討英語為母語及英語非母語寫作諮詢教師之信念與實務及學生感受. 交通大學英語教學研究所學位論文, 1-134.
    孟慶明 (2007)。在美國非華語環境下中文教學方法之行動研究. 臺灣師範大學華語文教學研究所學位論文, 1-165.
    吳青蓉 (2001)。英語學習歷程模式之驗證暨 [主題建構式語言學習方法] 對國中生英語學習表現影響之研究. 臺灣師範大學教育心理與輔導學系學位論文, 1-312.
    邱麗娜 (2009)。從語塊學習與重複模仿談大學英語口語教學.
    丁言仁 (2008)。课文背诵与模仿——成功英语学习者的学习经验. 中国外语教育, 2, 19-26.
    凌國樺, & 許皓捷. (2011). 鳥音回播在低海拔鳥類相調查之應用. 環境與生態學報, 4(1), 1-16.
    范氏梅 (2005)。越南留学生声调偏误分析及教学对策. 玉林师范学院学报, 26(2), 121-123.
    周福強、曾金金 (2005)。語音辨識技術於網路華語發音教學與評鑑系統之應用。第九屆全球華人計算機教育應用大會論文集,夏威夷:夏威夷楊百翰大學。
    廖埻棓、曾金金 (2005) 。符合SCORM標準的華語語音數位教學設計,第四屆全球華文網路教學研討會論文集,臺北:中華民國僑務委員會。
    邵寶誼 (2005) 。線上漢語拼音網站簡介。第四屆全球華文網路教學研討會論文集,379-386。臺北:中華民國僑務委員會。
    陳麗宇、譚詠瑜 (2013) 。數位資源輔助華語語音教學分析-以香港粵語母語者為教學對象。第十二屆臺灣華語文教學年會暨國際學術研討會論文集,351-361。臺北:臺灣華語文教學學會。
    曾金金 (2008) 。華語語音資料庫及數位學習應用,臺北:新學林出版社。
    許妙如, & 黃競億. (2004). 整合語音處理技術之電腦輔助語言學習. 電腦與通訊(108), 80-85.
    吳佳諭. (2009). 漢字識字學習系統之設計與應用. 臺灣師範大學資訊教育學系學位論文, 1-119.
    黃苕冠(2007)。外國學生以數位學習方式進行漢字書寫方法學習之成效研究設計。第五屆全球華文網路教育研討會論文集。


    英文部分
    Dilloni, S. (2010, January 20). Foreign Languages Fade in Class — Except Chinese. The New York Times, from the World Wide Web: http://www.nytimes.com/2010/01/21/education/21chinese.html
    Yang, C. (2014, February 9). Foreign Languages Fade in Class — Except Chinese. The New York Times, from the World Wide Web: http://www.nytimes.com/2014/02/10/world/asia/young-laotians-learn-chinese-to-improve-job-prospects.html
    Oxford, R. L. (1990). Language learning strategies: na.
    Williams, L. (1979). The modification of speech perception and production in second-language learning. Perception & Psychophysics, 26(2), 95-104.
    McCrocklin, S. M. (2016). Pronunciation learner autonomy: The potential of Automatic Speech Recognition. System, 57, 25-42.
    Begaye, T. (2007). Native teacher understanding of culture as a concept for curricular inclusion. Wicazo Sa Review, 22(1), 35-52.
    Árva, V., & Medgyes, P. (2000). Native and non-native teachers in the classroom. System, 28(3), 355-372.
    Reves, T., & Medgyes, P. (1994). The non-native English speaking EFL/ESL teacher's self-image: An international survey. System, 22(3), 353-367.
    Johnson, K. E. (1995). Understanding communication in second language classrooms: Cambridge University Press.
    Foote, J. A., Trofimovich, P., Collins, L., & Urzúa, F. S. (2016). Pronunciation teaching practices in communicative second language classes. The Language Learning Journal, 44(2), 181-196.
    Hao, Y.-C., & de Jong, K. (2016). Imitation of second language sounds in relation to L2 perception and production. Journal of Phonetics, 54, 151-168.
    Flege, J. E., & Eefting, W. (1988). Imitation of a VOT continuum by native speakers of English and Spanish: Evidence for phonetic category formation. The Journal of the Acoustical Society of America, 83(2), 729-740.
    Mitterer, H., & Ernestus, M. (2008). The link between speech perception and production is phonological and abstract: Evidence from the shadowing task. Cognition, 109(1), 168-173.
    Mitterer, H., & Müsseler, J. (2013). Regional accent variation in the shadowing task: Evidence for a loose perception–action coupling in speech. Attention, Perception, & Psychophysics, 75(3), 557-575.
    Baker, W., & Trofimovich, P. (2006). Perceptual paths to accurate production of L2 vowels: The role of individual differences. IRAL–International Review of Applied Linguistics in Language Teaching, 44(3), 231-250.
    Goldinger, S. D. (1998). Echoes of echoes? An episodic theory of lexical access. Psychological review, 105(2), 251.
    Segal, O., Hejli-Assi, S., & Kishon-Rabin, L. (2016). The effect of listening experience on the discrimination of/ba/and/pa/in Hebrew-learning and Arabic-learning infants. Infant Behavior and Development, 42, 86-99.
    Bakar, N., Chee, C. Y., Abdullah, L. C., Ratnam, C. T., & Azowa, N. (2014). Effect of methyl methacrylate grafted kenaf on mechanical properties of polyvinyl chloride/ethylene vinyl acetate composites. Composites Part A: Applied Science and Manufacturing, 63, 45-50.
    Linossier, J., Courvoisier, H., & Aubin, T. (2015). The two parts of the blackcap song: Acoustic analysis and male responses to playbacks. Behavioural processes, 121, 87-92.
    Margoliash, D. (2003). Offline learning and the role of autogenous speech: new suggestions from birdsong research. Speech communication, 41(1), 165-178.
    Porayska-Pomsta, K., & Mellish, C. (2013). Modelling human tutors' feedback to inform natural language interfaces for learning. International Journal of Human-Computer Studies, 71(6), 703-724.
    Di Eugenio, B., Lu, X., Kershaw, T. C., Corrigan-Halpern, A., & Ohlsson, S. (2005). Positive and negative verbal feedback for Intelligent Tutoring Systems. Paper presented at the AIED.
    D’mello, S. K., Craig, S. D., Witherspoon, A., Mcdaniel, B., & Graesser, A. (2008). Automatic detection of learner’s affect from conversational cues. User modeling and user-adapted interaction, 18(1-2), 45-80.
    VanLehn, K., Graesser, A. C., Jackson, G. T., Jordan, P., Olney, A., & Rosé, C. P. (2007). When are tutorial dialogues more effective than reading? Cognitive science, 31(1), 3-62.
    Rodgers, C. R. (2006). Attending to student voice: The impact of descriptive feedback on learning and teaching. Curriculum Inquiry, 36(2), 209-237.
    Dewey, J. (1997). How we think: Courier Corporation.
    Thomson, R. I., & Derwing, T. M. (2014). The effectiveness of L2 pronunciation instruction: A narrative review. Applied Linguistics, amu076.
    Levis, J., & Suvorov, R. (2014). Automated speech recognition. The encyclopidia of applied linguistics. Retrieved from http://onlinelibrary. wiley. com/store/10.1002/9781405198431. wbeal0066/asset/wbeal0066. pdf, 1.
    Stone, R. (2013). Cutting-edge technology: The software that started it all. Retrieved from http://www.rosettastone.com/features#link1.
    Auralog. (2013). Tell me more: Click speak discover. Retrieved from: http://store.tellmemore.com/Articles.asp?ID¼294.
    Burlington English. (2014). Burlington English: Meeting the challenges of adult language acquisition. Retrieved from: http://www.burlingtonenglish.com/about.aspx.
    Coniam, D. (1999). Voice recognition software accuracy with second language speakers of English. System, 27(1), 49-64.
    Kim, I. S., (2006). Automatic Speech Recognition: Reliability and Pedagogical Implications for Teaching Pronunciation. Journal of Educational Technology & Society, 9(1), 322-334.
    Celce-Murcia, M., Brinton, D. & Goodwin, J. (2010). Teaching pronunciation: A course book and reference guide. New York: Cambridge University Press.
    Moustroufas, N., & Digalakis, V. (2007). Automatic pronunciation evaluation of foreign speakers using unknown text. Computer Speech & Language, 21(1), 219-230.
    Hincks, R. (2003). Speech technologies for pronunciation feedback and evaluation. ReCALL, 15(01), 3-20.
    Yao-Chi Hsu, Ming-Han Yang, Hsiao-Tsung Hung, Yuwen Hsiung, Yao-Ting Sung, Berlin Chen, "Exploring combinations of various deep neural network based acoustic models and classification techniques for Mandarin mispronunciation detection," the 27th International Conference on Computational Linguistics and Speech Processing (ROCLING 2015), Hsinchu, Taiwan, October 1 - 2, 2015.
    Neri, A., Cucchiarini, C., & Strik, H. (2006). ASR-based corrective feedback on pronunciation: does it really work? Paper presented at the INTERSPEECH.
    Neri, A., Mich, O., Gerosa, M., & Giuliani, D. (2008). The effectiveness of computer assisted pronunciation training for foreign language learning by children. Computer Assisted Language Learning, 21(5), 393-408.
    Fylan, F. (2005). Semi-structured interviewing. A handbook of research methods for clinical and health psychology, 65-78.
    Neri, A., Cucchiarini, C., & Strik, W. (2003). Automatic speech recognition for second language learning: how and why it actually works. Paper presented at the Proc. ICPhS.

    下載圖示
    QR CODE