Springer, 2014. — 149 p.
Speaker verification (SV) is the process of validating the claimed identity of an individual using his/her speech. State-of-the-art SV systems perform reasonably well when the spoken utterances are ‘clean’, i.e., free from any sort of distortions caused by external factors. However, the accuracy of such systems degrade severely when speech signals are distorted due to the presence of environmental or background noise. Robustness towards environmental noise is crucial for several SV applications, especially in hand-held devices where the background environments are uncontrolled, time-varying and unpredictable. The strategies used for handling background noise can be broadly categorized as (i) ‘compensation’ or ‘adaptation’ methods where features extracted in the test environment are denoised (compensated) to match with the training environment, or speaker model parameters estimated in the training environment are altered (adapted) to reflect the test environment, and (ii) developing speaker models or extracting features which are relatively immune towards the effect of background noise, by design. This book explores novel methods from each of the aforementioned categories for robust SV in noisy background environments. A group of Gaussian mixture model (GMM) based stochastic feature compensation methods is proposed for SV in noisy environments. Alternatively, the robustness of GMM supervectorbased speaker modeling approaches is explored for SV in noisy environments. A discriminative framework is used in which fixed-size vectors obtained by stacking GMM means (i.e., supervectors) or by total variability modeling (i.e., i-vectors) are in turn used for training speaker-specific support vector machines (SVMs). These SVMs are evaluated in noisy test environments. Training an ensemble of robust SVM classifiers using adaptive boosting is proposed for improving SV performance in noisy environments. This book is mainly intended for researchers working on robust speaker recognition technologies. This book is also useful for the young researchers, who want to pursue research in speech processing with an emphasis on acoustic modeling and feature compensation. Hence, this may be recommended as the text or reference book for the postgraduate level advanced speech processing course.