Toward the simulation of emotion in synthetic speech

A review of the literature on human vocal emotion

Iain R. Murray, John L. Arnott

    Research output: Contribution to journalArticle

    606 Citations (Scopus)

    Abstract

    There has been considerable research into perceptible correlates of emotional state, but a very limited amount of the literature examines the acoustic correlates and other relevant aspects of emotion effects in human speech; in addition, the vocal emotion literature is almost totally separate from the main body of speech analysis literature. A discussion of the literature describing human vocal emotion, and its principal findings, are presented. The voice parameters affected by emotion are found to be of three main types: voice quality, utterance timing, and utterance pitch contour. These parameters are described both in general and in detail for a range of specific emotions. Current speech synthesizer technology is such that many of the parameters of human speech affected by emotion could be manipulated systematically in synthetic speech to produce a simulation of vocal emotion; application of the literature to construction of a system capable of producing synthetic speech with emotion is discussed.
    Original languageEnglish
    Pages (from-to)1097-1108
    Number of pages12
    JournalJournal of the Acoustical Society of America
    Volume93
    Issue number2
    DOIs
    Publication statusPublished - 1993

    Fingerprint

    emotions
    simulation
    synthesizers
    Emotion
    Simulation
    time measurement
    acoustics

    Cite this

    @article{09fd46b0ce3e4e5cb522fec777e576ef,
    title = "Toward the simulation of emotion in synthetic speech: A review of the literature on human vocal emotion",
    abstract = "There has been considerable research into perceptible correlates of emotional state, but a very limited amount of the literature examines the acoustic correlates and other relevant aspects of emotion effects in human speech; in addition, the vocal emotion literature is almost totally separate from the main body of speech analysis literature. A discussion of the literature describing human vocal emotion, and its principal findings, are presented. The voice parameters affected by emotion are found to be of three main types: voice quality, utterance timing, and utterance pitch contour. These parameters are described both in general and in detail for a range of specific emotions. Current speech synthesizer technology is such that many of the parameters of human speech affected by emotion could be manipulated systematically in synthetic speech to produce a simulation of vocal emotion; application of the literature to construction of a system capable of producing synthetic speech with emotion is discussed.",
    author = "Murray, {Iain R.} and Arnott, {John L.}",
    note = "Medline is the source for the MeSH terms of this document.",
    year = "1993",
    doi = "10.1121/1.405558",
    language = "English",
    volume = "93",
    pages = "1097--1108",
    journal = "Journal of the Acoustical Society of America",
    issn = "0001-4966",
    publisher = "Acoustical Society of America",
    number = "2",

    }

    Toward the simulation of emotion in synthetic speech : A review of the literature on human vocal emotion. / Murray, Iain R.; Arnott, John L.

    In: Journal of the Acoustical Society of America, Vol. 93, No. 2, 1993, p. 1097-1108.

    Research output: Contribution to journalArticle

    TY - JOUR

    T1 - Toward the simulation of emotion in synthetic speech

    T2 - A review of the literature on human vocal emotion

    AU - Murray, Iain R.

    AU - Arnott, John L.

    N1 - Medline is the source for the MeSH terms of this document.

    PY - 1993

    Y1 - 1993

    N2 - There has been considerable research into perceptible correlates of emotional state, but a very limited amount of the literature examines the acoustic correlates and other relevant aspects of emotion effects in human speech; in addition, the vocal emotion literature is almost totally separate from the main body of speech analysis literature. A discussion of the literature describing human vocal emotion, and its principal findings, are presented. The voice parameters affected by emotion are found to be of three main types: voice quality, utterance timing, and utterance pitch contour. These parameters are described both in general and in detail for a range of specific emotions. Current speech synthesizer technology is such that many of the parameters of human speech affected by emotion could be manipulated systematically in synthetic speech to produce a simulation of vocal emotion; application of the literature to construction of a system capable of producing synthetic speech with emotion is discussed.

    AB - There has been considerable research into perceptible correlates of emotional state, but a very limited amount of the literature examines the acoustic correlates and other relevant aspects of emotion effects in human speech; in addition, the vocal emotion literature is almost totally separate from the main body of speech analysis literature. A discussion of the literature describing human vocal emotion, and its principal findings, are presented. The voice parameters affected by emotion are found to be of three main types: voice quality, utterance timing, and utterance pitch contour. These parameters are described both in general and in detail for a range of specific emotions. Current speech synthesizer technology is such that many of the parameters of human speech affected by emotion could be manipulated systematically in synthetic speech to produce a simulation of vocal emotion; application of the literature to construction of a system capable of producing synthetic speech with emotion is discussed.

    UR - http://www.scopus.com/inward/record.url?scp=0027447292&partnerID=8YFLogxK

    U2 - 10.1121/1.405558

    DO - 10.1121/1.405558

    M3 - Article

    VL - 93

    SP - 1097

    EP - 1108

    JO - Journal of the Acoustical Society of America

    JF - Journal of the Acoustical Society of America

    SN - 0001-4966

    IS - 2

    ER -