Зарегистрироваться
Восстановить пароль
FAQ по входу

Rao K.S., Koolagudi S.G. Robust Emotion Recognition using Spectral and Prosodic Features

  • Файл формата pdf
  • размером 1,94 МБ
  • Добавлен пользователем
  • Описание отредактировано
Rao K.S., Koolagudi S.G. Robust Emotion Recognition using Spectral and Prosodic Features
Springer, 2013. — 127 p. — ISBN 978-1-4614-6359-7, ISBN 978-1-4614-6360-3.
Human beings use speech as a primary mode of communication for conveying messages. A speech signal carries multiple cues related to intended message, speaker and language identities, behavioural and emotional mood of the speaker and characteristics of background environment. Human beings exploit all these cues for performing various speech tasks. Now a days in all areas of applications, machines are extensively used for performing the desired tasks automatically. For interacting with machines, humans feel speech interface to be the most convenient one. Speech interface to machine should take care of basic tasks such as speech understanding and speech generation.
Most of the present speech systems support the basic speech tasks for the neutral speech recorded in a clean environment. But, in general human speech is mostly embedded with emotions to convey the intended message. Processing emotional speech by a machine is a complex task. In real life, emotions are highly dynamic and depend on several factors such as speaker, language, culture, society and geographical regions. The basic objective of this book is to explore various features and models for characterising and discriminating the emotions. In addition to conventional speech features, emotion-specific spectral and prosodic features are introduced in this book for performing robust emotion recognition.
This book primarily focuses on spectral and prosodic features to discriminate emotions. In addition to conventional spectral features extracted from the entire speech through block processing, sub-syllabic regions such as consonants, vowels and CV (Consonant-to-Vowel) transition regions are explored for computing spectral features. Spectral features extracted through pitch synchronous analysis are explored for reducing the ambiguity in confusable emotions. Prosodic features extracted from words and syllables present at different positions (initial, middle and final) are investigated to analyse the emotion-specific knowledge. Various nonlinear models, such as auto-associative neural networks, support vector machines and Gaussian mixture models, are explored to capture the emotionspecific information from the features mentioned above. In this book, emotional database Indian Institute of Technology Kharagpur Simulated Emotion Speech Corpus (IITKGP-SESC) in an Indian language Telugu, Berlin emotional speech database (Emo-DB) and Hindi movie database are used for analysing the emotion recognition performance.
This book is mainly intended for researchers working on emotion recognition from speech. The book is also useful for young researchers, who want to pursue research in speech processing using basic excitation source, vocal tract and prosodic features. Hence, this may be recommended as the text or reference book at the postgraduate level advanced speech processing course.
Robust Emotion Recognition using Pitch Synchronous and Sub-syllabic Spectral Features.
Robust Emotion Recognition using Sentence, Word and Syllable Level Prosodic Features.
Robust Emotion Recognition using Combination of Excitation Source, Spectral and Prosodic Features.
Robust Emotion Recognition using Speaking Rate Features.
Emotion Recognition on Real Life Emotions.
Summary and Conclusions.
A: MFCC Features.
B: Gaussian Mixture Model (GMM).
  • Чтобы скачать этот файл зарегистрируйтесь и/или войдите на сайт используя форму сверху.
  • Регистрация