研究生: |
林鈺庭 Lin, Yu-Ting |
---|---|
論文名稱: |
聲音特質與文字的情緒向性對於聲音偏好度之影響 Voice Preference in Mandarin: The Effect of Voice Quality and Text Valence |
指導教授: |
甯俐馨
Ning, Li-Hsin |
口試委員: |
陳正賢
Chen, Cheng-Hsien 郭貞秀 Kuo, Chen-Hsiu |
口試日期: | 2021/06/28 |
學位類別: |
碩士 Master |
系所名稱: |
英語學系 Department of English |
論文出版年: | 2021 |
畢業學年度: | 109 |
語文別: | 英文 |
論文頁數: | 101 |
中文關鍵詞: | 聲音好感度 、聲音特質 、文字向性 |
英文關鍵詞: | voice preference, acoustic features, text valence |
研究方法: | 實驗設計法 |
DOI URL: | http://doi.org/10.6345/NTNU202100639 |
論文種類: | 學術論文 |
相關次數: | 點閱:166 下載:39 |
分享至: |
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
此研究探討中文母語者對於聲音的好感度判斷,其中聲音特質與說話內容為兩大討論重點。過去研究顯示,音質特色在西方言語溝通中,為衡量好感度的關鍵因素。本研究旨在分析聲音特質對中文講者的影響,並且深入討論說話內容的正負向情緒是否會影響聽者對於說話者的印象。再者,我們分析聽者在判斷其聲音好感度上的重要音質特質,分別在中性、正向、負向說話內容上的表現。
研究參與者皆為年輕的中文母語者,每位參與者皆需參與聲音好感度判別的測驗。此研究播放錄音室錄製的不同人聲,共有180種(3種文字內容 X 5種句子 X 6位錄音者 X 2種性別)。參與者每一次只會聽到一個說話者的聲音,接著就必需直覺的回答他們對於此聲音的好感程度,作答方式是使用「五點李克特量尺」(five-point Likert scale) 來做出程度上的分別。實驗結束後,參與者需憑藉方才作答之印象,回答關於聲音特質及文字情緒的相關問題。
本研究發現聲音特質及文字內容皆會影響中文聽者對於中文說話者的好感程度。首先,正向、中性及負向說話內容有明顯的差異。其中,正向內容最廣為喜愛,其次為中性內容,再來為負向語句。根據參與者的實驗後問答,正向文字會增加聲音的好感度。接者,在眾多聲音特質(acoustic features)中,頻率擾動度擾動度 (jitter)是衡量聲音喜好的最關鍵因素。無論在正向或負向內容中,頻率的穩定性越高,此聲音就很容易被評斷為好聽。另外,本研究也發現音質影響力與內容息息相關。舉例來說,對於中性內容,穩定度是最重要的,但對正向內容,音高的表現更為重要。最後,我們討論了各種音質特質如何影響著中文的聲音好感度。頻率擾動度(jitter)及音量擾動度(shimmer)皆對好感度有負面的作用,而諧噪比 (HNR) 、平均音高 (pitch mean) 和音高全距 (pitch range) 皆有正面的作用。此實驗也證實了聲音好感度不僅和聲音特質相關,其聲音呈現是否符合說話者的內容也是判斷要素之一。
This study investigates two major issues on Mandarin voice preference: the effect of voice quality and text emotions. Previous research has pointed out the importance of voice quality in Western languages. This study therefore explores the impact of voice quality (e.g., pitch mean, pitch range, jitter, shimmer, HNR, and duration) in Mandarin, and put forward to the investigation of text emotions (neutral, positive, and negative). Finally, we examine whether listeners rely on different acoustic information while making their preference judgment for different textual emotions.
To examine the influence of voice quality and text emotion on voice preference, this thesis includes one voice preference task. We had a group of native Mandarin speakers rate a variety of voices with their personal preference. Materials were 180 stimuli (5 experimental speech x 3 text emotions x 6 Mandarin speakers x 2 genders). The participants were instructed to mark their voice preference on a five-point Likert scale. After the experiment, listeners would answer a few questions about voice qualities and text emotions.
Our results showed that both voice quality and text emotions demonstrated their influence on voice preference. Concerning the content of the utterance, positive utterances are more favored than those with neutral and negative content. From statistical analysis and qualitative response, it was suggested that lexical positivity is likely to enhance listeners’ preference for a voice. As for the acoustic influence, jitter is found to be the most important acoustic cue. Regardless of content, the amount of jitter stands as the most influential factor on voice preference. Besides, we also found that acoustic influence is associated with the content. Listeners may rely on different acoustic cues while facing different content. For instance, vocal stability is the most crucial factor in neutral content, while pitch performance is more essential for positive utterance. Finally, we further discuss the acoustic influence in Mandarin. Increasing jitter and shimmer was found to be detrimental to Mandarin voice preference, while that of HNR, pitch mean, and pitch range can greatly enhance one’s personal preference for a voice. We therefore conclude that both influences of voice and text should be taken into account regarding the discussion on voice preference. A preferred voice should be defined with a suitable acoustic performance on the corresponding text valence.
Anderson, R. C., Klofstad, C. A., Mayew, W. J., & Venkatachalam, M. (2014). Vocal fry may undermine the success of young women in the labor market. PloS one, 9(5), e97506-e97506. doi:10.1371/journal.pone.0097506
Anolli, L., & Ciceri, R. (2002). Analysis of the vocal profiles of male seduction: from exhibition to self-disclosure. The Journal of general psychology, 129(2), 149-169.
Argyle, M., Alkema, F., & Gilmour, R. (1971). The communication of friendly and hostile attitudes by verbal and non‐verbal signals. European Journal of Social Psychology, 1(3), 385-402.
Babel, M., McGuire, G., & King, J. (2014). Towards a More Nuanced View of Vocal Attractiveness. PloS one, 9(2), e88616. doi:10.1371/journal.pone.0088616
Banse, R., & Scherer, K. R. (1996). Acoustic profiles in vocal emotion expression. Journal of personality and social psychology, 70(3), 614.
Belyk, M., & Brown, S. (2014). The acoustic correlates of valence depend on emotion family. Journal of Voice, 28(4), 523. e529-523. e518.
Boersma, P. (1993). Accurate short-term analysis of the fundamental frequency and the harmonics-to-noise ratio of a sampled sound. Paper presented at the Proceedings of the institute of phonetic sciences.
Borkowska, B., & Pawlowski, B. (2011). Female voice frequency in the context of dominance and attractiveness perception. Animal Behaviour, 82(1), 55-59. doi:https://doi.org/10.1016/j.anbehav.2011.03.024
Bruckert, L., Bestelmeyer, P., Latinus, M., Rouger, J., Charest, I., Rousselet, G. A., . . . Belin, P. (2010). Vocal attractiveness increases by averaging. Current biology, 20(2), 116-120.
Bryant, G. A., & Haselton, M. G. (2009). Vocal cues of ovulation in human females. Biology letters, 5(1), 12-15. doi:10.1098/rsbl.2008.0507
Chao, Y. R. (1968). A grammar of spoken Chinese: Univ of California Press.
Cheang, H. S., & Pell, M. D. (2009). Acoustic markers of sarcasm in Cantonese and English. The Journal of the Acoustical Society of America, 126(3), 1394-1405.
Cleveland, W., Grosse, E., & Shyu, W. (1992). Local regression models. Chapter 8 in Statistical models in S (JM Chambers and TJ Hastie eds.), 608 p. Wadsworth & Brooks/Cole, Pacific Grove, CA.
Coelho, L., Hain, H.-U., Jokisch, O., & Braga, D. (2009). Towards an objective voice preference definition for the Portuguese language. I Iberian SLTech 2009, 67.
Collins, S. A. (2000). Men's voices and women's choices. Animal Behaviour, 60(6), 773-780.
Collins, S. A., & Missing, C. (2003). Vocal and visual attractiveness are related in women. Animal Behaviour, 65(5), 997-1004. doi:https://doi.org/10.1006/anbe.2003.2123
Cullen, A., Kane, J., Drugman, T., & Harte, N. (2013). Creaky voice and the classification of affect. Proceedings of WASSS, Grenoble, France.
Egan, O. (1980). Intonation and meaning. Journal of Psycholinguistic Research, 9(1), 23-39. doi:10.1007/BF01067300
Feinberg, D. R., DeBruine, L. M., Jones, B. C., & Perrett, D. I. (2008). The role of femininity and averageness of voice pitch in aesthetic judgments of women's voices. Perception, 37(4), 615-623. doi:10.1068/p5514
Feinberg, D. R., Jones, B. C., Little, A. C., Burt, D. M., & Perrett, D. I. (2005). Manipulations of fundamental and formant frequencies influence the attractiveness of human male voices. Animal Behaviour, 69(3), 561-568. doi:https://doi.org/10.1016/j.anbehav.2004.06.012
Grammer, K., Fink, B., Møller, A. P., & Thornhill, R. (2003). Darwinian aesthetics: sexual selection and the biology of beauty. Biological reviews, 78(3), 385-407.
Guzman, M., Correa, S., Munoz, D., & Mayerhoff, R. (2013). Influence on spectral energy distribution of emotional expression. Journal of Voice, 27(1), 129. e121-129. e110.
Ham, E.-S., Lim, K.-S., Yi, S.-H., & Kim, H.-K. (2008). A Study on Characteristics of Children's Voice Preference from Different Pitch. Speech Sciences, 15(3), 175-181.
Henton, C. G., & Bladon, R. A. W. (1985). Breathiness in normal female speech: Inefficiency versus desirability. Language & Communication, 5(3), 221-227. doi:https://doi.org/10.1016/0271-5309(85)90012-6
Hillenbrand, J., & Houde, R. A. (1996). Acoustic correlates of breathy vocal quality: dysphonic voices and continuous speech. J Speech Hear Res, 39(2), 311-321. doi:10.1044/jshr.3902.311
Hirst, D. J. (2007). A Praat plugin for Momel and INTSINT with improved algorithms for modelling and coding intonation. Paper presented at the Proceedings of the XVIth International Conference of Phonetic Sciences.
Hughes, S., & Harrison, M. (2002). Hughes, S.M., Harrison, M.A., & Gallup, G. G. (2002). The sound of symmetry: Voice as a marker of developmental instability. Evolution and Human Behavior, 23(3), 173-180. Evolution and Human Behavior, 23, 173. doi:10.1016/S1090-5138(01)00099-X
Hughes, S. M., Farley, S. D., & Rhodes, B. C. (2010). Vocal and Physiological Changes in Response to the Physical Attractiveness of Conversational Partners. Journal of Nonverbal Behavior, 34(3), 155-167. doi:10.1007/s10919-010-0087-9
Hughes, S. M., Mogilski, J. K., & Harrison, M. A. (2014). The Perception and Parameters of Intentional Voice Manipulation. Journal of Nonverbal Behavior, 38(1), 107-127. doi:10.1007/s10919-013-0163-z
Hughes, S. M., Pastizzo, M. J., & Gallup, G. G. (2008). The Sound of Symmetry Revisited: Subjective and Objective Analyses of Voice. Journal of Nonverbal Behavior, 32(2), 93-108. doi:10.1007/s10919-007-0042-6
Johnson, W., Emde, R., Scherer, K., & Klinnert, M. (1986). Recognition of emotion from vocal cues. Archives of general psychiatry, 43, 280-283.
Jones, B. C., Feinberg, D. R., DeBruine, L. M., Little, A. C., & Vukovic, J. (2010). A domain-specific opposite-sex bias in human preferences for manipulated voice pitch. Animal Behaviour, 79(1), 57-62.
Jones, T. M., Trabold, M., Plante, F., Cheetham, B. M., & Earis, J. E. (2001). Objective assessment of hoarseness by measuring jitter. Clin Otolaryngol Allied Sci, 26(1), 29-32. doi:10.1046/j.1365-2273.2001.00413.x
Karpf, A. (2006). The human voice: How this extraordinary instrument reveals essential clues about who we are: Bloomsbury Publishing USA.
Klofstad, C. A., Anderson, R. C., & Nowicki, S. (2015). Perceptions of Competence, Strength, and Age Influence Voters to Select Leaders with Lower-Pitched Voices. PloS one, 10(8), e0133779. doi:10.1371/journal.pone.0133779
Lass, N. J., Hughes, K. R., Bowyer, M. D., Waters, L. T., & Bourne, V. T. (1976). Speaker sex identification from voiced, whispered, and filtered isolated vowels. J Acoust Soc Am, 59(3), 675-678. doi:10.1121/1.380917
Leaderbrand, K., Morey, A., & Tuma, L. (2008). The Effects of Voice Pitch on Perceptions of Attractiveness: Do You Sound Hot or Not?
Leongómez, J. D., Binter, J., Kubicová, L., Stolařová, P., Klapilová, K., Havlíček, J., & Roberts, S. C. (2014). Vocal modulation during courtship increases proceptivity even in naive listeners. Evolution and Human Behavior, 35(6), 489-496.
Li, S., Gu, W., Liu, L., & Tang, P. (2020). The Role of Voice Quality in Mandarin Sarcastic Speech: An Acoustic and Electroglottographic Study. Journal of Speech, Language, and Hearing Research, 63(8), 2578-2588.
Lin, H.-Y., & Fon, J. (2012). Prosodic and acoustic features of emotional speech in Taiwan Mandarin. Paper presented at the Speech Prosody 2012.
Liu, Z., Xu, A., Guo, Y., Mahmud, J., Liu, H., & Akkiraju, R. (2018). Seemo: A Computational Approach to See Emotions.
Murry, T., Brown, W. S., & Rothman, H. (1987). Judgments of voice quality and preference: Acoustic interpretations. Journal of Voice, 1(3), 252-257. doi:https://doi.org/10.1016/S0892-1997(87)80008-5
Nass, C., Jonsson, I.-M., Harris, H., Reaves, B., Endo, J., Brave, S., & Takayama, L. (2005). Improving automotive safety by pairing driver emotion and car voice emotion. Paper presented at the CHI '05 Extended Abstracts on Human Factors in Computing Systems, Portland, OR, USA. https://doi.org/10.1145/1056808.1057070
Pisanski, K., Jones, B. C., Fink, B., O'Connor, J. J., DeBruine, L. M., Röder, S., & Feinberg, D. R. (2016). Voice parameters predict sex-specific body morphology in men and women. Animal Behaviour, 112, 13-22.
Pisanski, K., Oleszkiewicz, A., Plachetka, J., Gmiterek, M., & Reby, D. (2018). Voice pitch modulation in human mate choice. Proceedings of the Royal Society B, 285(1893), 20181634.
Ptacek, P. H., & Sander, E. K. (1966). Age recognition from voice. Journal of speech and hearing Research, 9(2), 273-277.
Puts, D. A., Gaulin, S. J. C., & Verdolini, K. (2006). Dominance and the evolution of sexual dimorphism in human voice pitch. Evolution and Human Behavior, 27(4), 283-296. doi:https://doi.org/10.1016/j.evolhumbehav.2005.11.003
Re, D. E., O'Connor, J. J. M., Bennett, P. J., & Feinberg, D. R. (2012). Preferences for very low and very high voice pitch in humans. PloS one, 7(3), e32719-e32719. doi:10.1371/journal.pone.0032719
Reilly, S. S., & Muzekari, L. H. (1979). Responses of normal and disturbed adults and children to mixed messages. Journal of Abnormal Psychology, 88(2), 203.
Ripley, W. N. V. a. B. D. (2002). Modern Applied Statistics with S. New York: Springer.
Rozsypal, A. J., & Millar, B. F. (1979). Perception of jitter and shimmer in synthetic vowels. Journal of Phonetics, 7(4), 343-355. doi:https://doi.org/10.1016/S0095-4470(19)31069-1
Russell, J. A. (1980). A circumplex model of affect. Journal of personality and social psychology, 39(6), 1161.
Scherer, K. R., Banse, R., & Wallbott, H. G. (2001). Emotion Inferences from Vocal Expression Correlate Across Languages and Cultures. Journal of Cross-Cultural Psychology, 32(1), 76-92. doi:10.1177/0022022101032001009
Schröder, M., Cowie, R., Douglas-Cowie, E., Westerdijk, M., & Gielen, S. (2001). Acoustic correlates of emotion dimensions in view of speech synthesis. Paper presented at the Seventh European Conference on Speech Communication and Technology.
Šebesta, P., Kleisner, K., Tureček, P., Kočnar, T., Akoko, R. M., Třebický, V., & Havlíček, J. (2017). Voices of Africa: acoustic predictors of human male vocal attractiveness. Animal Behaviour, 127, 205-211. doi:https://doi.org/10.1016/j.anbehav.2017.03.014
Sorokowski, P., Puts, D., Johnson, J., Żółkiewicz, O., Oleszkiewicz, A., Sorokowska, A., Pisanski, K. (2019). Voice of Authority: Professionals Lower Their Vocal Frequencies When Giving Expert Advice. Journal of Nonverbal Behavior, 43(2), 257-269. doi:10.1007/s10919-019-00307-0
Suire, A., Raymond, M., & Barkat-Defradas, M. (2019). Male Vocal Quality and Its Relation to Females’ Preferences. Evolutionary Psychology, 17(3), 1474704919874675. doi:10.1177/1474704919874675
Teixeira, J., & Gonçalves, A. (2014). Accuracy of Jitter and Shimmer Measurements. Procedia Technology, 16, 1190-1199. doi:10.1016/j.protcy.2014.10.134
Teixeira, J. P., Oliveira, C., & Lopes, C. (2013). Vocal Acoustic Analysis – Jitter, Shimmer and HNR Parameters. Procedia Technology, 9, 1112-1122. doi:https://doi.org/10.1016/j.protcy.2013.12.124
Titze, I. R. (1989). Physiologic and acoustic differences between male and female voices. J Acoust Soc Am, 85(4), 1699-1707. doi:10.1121/1.397959
Van Borsel, J., Janssens, J., & De Bodt, M. (2009). Breathiness as a Feminine Voice Characteristic: A Perceptual Approach. Journal of Voice, 23(3), 291-294. doi:https://doi.org/10.1016/j.jvoice.2007.08.002
Walton, J. H., & Orlikoff, R. F. (1994). Speaker race identification from acoustic cues in the vocal signal. Journal of Speech, Language, and Hearing Research, 37(4), 738-745.
Watson, S. (2019). The Unheard Female Voice: Women are more likely to be talked over and unheeded. But SLPs can help them speak up and be heard. The ASHA Leader, 24, 44-53. doi:10.1044/leader.FTR1.24022019.44
Wendahl, R. (1963). Laryngeal analog synthesis of harsh voice quality. Folia Phoniatrica et Logopaedica, 15(4), 241-250.
Xu, Y., Lee, A., Wu, W.-L., Liu, X., & Birkholz, P. (2013). Human Vocal Attractiveness as Signaled by Body Size Projection. PloS one, 8(4), e62397. doi:10.1371/journal.pone.0062397