研究生: 張容榕
Chang, Jung-Jung
論文名稱: 中文諷刺語氣表達及理解之間之聲音特色與年齡及句型對諷刺語音之影響
The Production and Perception of the Voice Quality in Taiwanese Mandarin Sarcasm: The Effects of Age and Phrase Type
指導教授: 甯俐馨
Ning, Li-Hsin
口試委員: 張妙霞
Chang, Miao-Hsia
Chang, Yung-Hsiang
口試日期: 2021/07/06
學位類別: 碩士
系所名稱: 英語學系
Department of English
論文出版年: 2021
畢業學年度: 109
語文別: 英文
論文頁數: 106
中文關鍵詞: 諷刺語氣聲音特質句子型態年齡
英文關鍵詞: sarcasm, acoustic features, prosodic features, age, phrase type
研究方法: 實驗設計法
DOI URL: http://doi.org/10.6345/NTNU202100696
論文種類: 學術論文
  • 由於過去文獻顯示語音訊息為幫助分辨諷刺語氣的重要依據,此研究欲探討臺灣中文母語者其諷刺語氣之語音特色以及聽者從何正確判斷說話者之語氣。除了說話者的態度,句子的型態以及說話者的年齡對諷刺語氣之影響也被納入討論。
    研究中,首先進行的錄音實驗錄製了中文母語受試者在不同句子型態裡所表達的 三種態度(中性、真誠、諷刺)。接著,另一批受試者對錄下的句子進行語氣的判斷。 實驗結果顯示,相較於中性態度,諷刺語氣呈現較高的音調(mean F0)、較寬的音調全 距(pitch range)、較低的頻率擾動度(jitter)和音量擾動度(shimmer)、以及較慢的語速 (speech rate)。而與真誠態度比較之下,諷刺語氣則呈現較低的音調(mean F0)、較小的 音調全距(pitch range)、較低的頻率擾動度(jitter)和音量擾動度(shimmer)、以及較快的 語速(speech rate)。年齡則對諷刺語音的影響顯示於音調及音調全距,且對於音調的影 響僅出現於短句(keyphrases)。而在諷刺的語氣中,短句與其他句子型態相比,呈現較 慢的語速。

    Previous research has acknowledged prosodic information as one major component contributes to sarcasm detection. However, the voice quality of sarcastic speech shows no consistency cross-linguistically. This study focuses on the voice quality of sarcasm in Taiwanese Mandarin. Specifically, we investigate whether phrase types and age differences have effects on Taiwanese Mandarin speakers’ delivery and perception towards sarcastic utterances. Six voice quality parameters are examined, including mean F0, F0 range, jitter, shimmer, H1-H2, and speech rate.
    A sarcasm elicitation task, which uses a fully crossed 3 (attitudes) x 3 (phrase types) design, was adopted to record participants’ utterances of neutrality, sincerity and sarcasm. Then, a perceptual validation process helped identify the successfully recognized and misinterpreted attitudes produced by the speakers.
    Our results showed that Taiwanese Mandarin sarcasm featured higher mean F0, wider F0 range, lower jitter, lower shimmer, and slower speech rate compared with neutrality, but lower mean F0, narrower F0 range, lower jitter, lower shimmer, and slower speech rate than sincerity. Age difference can be seen in speakers’ sarcasm production strategies regarding F0 range and mean F0, while the difference in mean F0 was only observed in keyphrases. Phrase type effect can be seen in speakers’ sarcasm where keyphrases were produced more slowly than the other two phrase types.
    Vocalization of jitter, shimmer, and speech rate were found to be major causes for misinterpretation. Sarcastic expression with higher jitter, higher shimmer, and faster speech rate would be considered as sincerity. Sincere utterances with slower speech rate would be recognized as neutrality. Neutral expression with lower shimmer would be misjudged as sarcasm and would be misinterpreted as sincerity if it featured faster speech rate. Moreover, mean F0 and F0 range showed significant effects on misinterpreted expression for different age groups. The sarcastic utterances misinterpreted as sincerity produced by young speakers demonstrated higher mean F0 and wider F0 range. Lower mean F0 and narrower F0 range would cause elderly speakers’ sarcastic expression to be misjudged as sincerity.

    ACKNOWLEDGEMENT i CHINESE ABSTRACT iii ENGLISH ABSTRACT iv TABLE OF CONTENT vi 1 INTRODUCTION 1 1.1 Background and Motivation 1 1.2 Organization of the Study 4 2 LITERATURE REVIEW 5 2.1 Sarcasm Definition 5 2.2 Sarcasm Processing in Communication 6 2.2.1 Approaches to Sarcasm Processing 7 The Literal-First Account 7 The Interactive Account 13 Relevance Theory 15 2.2.2 The Role of Prosodic/Acoustic Cues in Sarcasm Processing 19 2.3 The Prosodic/Acoustic Features of Sarcastic Voices 21 2.3.1 Cross-Linguistic Studies 21 Fundamental Frequency 21 Loudness 24 Speech Rate 25 Other Acoustic Parameters: HNR, H1-H2 25 2.3.2 The Voice Qualities of Sarcasm in Mandarin Chinese 26 2.3.3 Ageing Voice and Sarcasm 28 Fundamental Frequency 29 Speech Rate 29 Jitter and Shimmer 30 The Possible Influence of Ageing Voice on Sarcasm 30 2.4 Research Questions 31 3 METHODOLOGY 36 3.1 Recording 36 3.1.1 Participants 36 3.1.2 Materials 37 3.1.3 Recording Procedure 39 3.2 Perceptual Validation Test 41 3.2.1 Participants 41 3.2.2 Materials 42 3.2.3 Procedure 42 3.3 Data Analysis 44 3.3.1 Speech Data Selection 44 3.3.2 Voice Quality Parameters 48 3.3.3 Statistical Procedure 50 4 RESULTS 52 4.1 Three-way ANOVA on Different Voice Qualities 52 4.1.1 The Effect of Attitude 55 4.1.2 The Effect of Phrase Type 58 4.1.3 The Effect of Age 62 4.1.4 The Interactions 63 4.2 The Effects of Congruity, Phrase Type, and Age in Different Attitudes 68 4.2.1 Sarcasm Dataset 69 4.2.2 Sincerity Dataset 75 4.2.3 Neutrality Dataset 80 4.3 Summary 84 5 DISCUSSION 87 5.1 The Sound of Sarcasm 87 5.2 Misinterpreted Intentions 92 5.3 Limitation and Future Study 94 6 CONCLUSION 96 REFERENCES 98 APPENDIX 1 104 APPENDIX 2 106

