Applied human informatics (AHI)
Open Journal of the Academy of Human Informatics (ISSN 2433-2372)

Vol.1 (2019) No.1 p.1-17 prev. | next
10.14865/ahi.1.1.1
Article
Voice quality perception from emotional voices: Comparison of emotional utterances in English and Japanese
Machiko Ikemoto1), Mariko Kikutani2), James A. Russel 3)
1) Faculty of Psychology, Doshisha University, 1-3 Miyakodani, Tatara, Kyotanabe, Kyoto 610-0394, Japan
2) Department of Social Psychology, Toyo University, 5-28-20 Hakusan, Bunkyo-ku, Tokyo 112-8606, Japan
3) Department of Psychology, Boston College, Chestnut Hill, MA USA
Received: Nov. 19 2018; Revised: Jan. 21 2019; Accepted: Feb. 12 2019
Keywords: voice quality, emotion, rating scale, culture
Abstract
Voice quality refers to the auditory impression (e.g., hoarse and nasal) the listener experiences from a piece of speech, and it provides information about the speaker such as his/her physical characteristics, age, and emotional state. The present study aimed to develop a scale in English language to measure how we perceive qualities of emotional voices expressing happy, angry, sad and fear, based on an existing Japanese scale [1]. Further, it examined whether the voice qualities related to those emotions differ between Japanese listeners and American listeners. In Study 1, English adjectives used to describe voices emoting happiness, anger, sadness, and fear were collected from native English speakers, and then ten words were selected for the scale. In Study 2, participants from the US and Japan listened to voices expressing the four emotions and evaluated qualities using the scales in their native language. A factor analysis and a canonical discriminant analysis on the data showed that the results from the two cultures were very similar despite the difference in adjectives contained in the scales. The ten words in each scale were grouped into three factors and all factors matched closely for the two cultures. The canonical discriminant analysis revealed three discriminant functions available to distinguish different emotions based on the quality ratings. These dimensions appear to indicate variations in valence and activation of the expressed emotion, and tensions of the speakers’ vocal folds. By using these dimensions, the voice quality ratings can distinguish emotions of the voice fairly well, demonstrating that the perception of voice quality is an important aspect of recognizing emotion from voices.
Copyright © 2019 The Academy of Human Informatics