Spectral Subband Centroids for Robust Speaker Identification Using Marginalization-based Missing Feature Theory

IJSPS News

Submissions

Please send your full manuscript to: ijsps@ejournal.net

Useful Documents

FAQs

1. How to submit my research paper? What’s the process of publication of my paper?
The journal receives submitted manuscripts via email only. Please submit your research paper in .doc or .pdf format to the submission email: ijsps@ejournal.net.
2. Can I submit an abstract?
The journal publishes full research papers. So only full paper submission should be considered for possible publication...[Read More]

Home > Published Issues > 2018 > Volume 6, No. 1, March 2018 >

Aaron Nicolson, Jack Hanson, James Lyons, and Kuldip Paliwal

Signal Processing Laboratory,Griffith University, Brisbane, Australia

Abstract—Until now, marginalization-based Missing Feature Theory (MFT) for speech classification has been limited to the use of Log Spectral Subband Energies (LSSEs) as features. These features are highly correlated, thus suboptimal for classification with diagonal-covariance Gaussian Mixture Models (GMMs), a common classifier in marginalization-based MFT. In this paper, we propose that Spectral Subband Centroids (SSCs) are more apt for marginalization-based MFT, as they are both decorrelated and spectrally local. Our results show that SSCs as features produce a more robust marginalization-based MFT, diagonal-covariance GMM-based, Automatic Speaker Identification (ASI) system than LSSEs as features, for at all tested SNR values (with Additive White Gaussian Noise (AWGN)). It is also shown that a fully-connected Deep Neural Network (DNN) can accurately estimate the Ideal Binary Mask (IBM) used for MFT.

Index Terms—spectral subband centroids, missing feature theory, speaker identification, deep neural network, ideal binary mask

Cite: Aaron Nicolson, Jack Hanson, James Lyons, and Kuldip Paliwal, "Spectral Subband Centroids for Robust Speaker Identification Using Marginalization-based Missing Feature Theory," International Journal of Signal Processing Systems, Vol. 6, No. 1, pp. 12-16, March 2018. doi: 10.18178/ijsps.6.1.12-16

3-SP009

Previous paper：Comparision of LPC Based Parametric Techniques for Respiratory Sounds Recognition
Next paper：Last page