Speaker Diarization in Personal Video Speaker Diarization in Personal Video Speaker Diarization in Personal Video Speaker Diarization in Personal Vide
Zeenat Afroze
Department of Electrical and Electronic Engineering, American International University Bangladesh, Dhaka, Bangladesh
Abstract—In this paper, we present the speaker diarization system which is used in personal video recordings. Speaker diarization begins by the extraction of relevant features from the input signal. Features are measurable characteristics which are important to the distinction between different classes. They should have low inter-class similarity and also low intra-class variability. So, LDA is used to cope intra-speaker variability. We demonstrate improvement of the performance over the baseline system based on LDA. This paper also reports the performance of the speaker diarization system. A new approach, called the user feedback has been developed here to improve the performance of the speaker diarization system.
Index Terms—agglomerative clustering algorithm, bayesian information criterion, mel frequency cepstral coefficients, speaker diarization, user feedback
Cite: Zeenat Afroze, "Speaker Diarization in Personal Video Speaker Diarization in Personal Video Speaker Diarization in Personal Video Speaker Diarization in Personal Vide," International Journal of Signal Processing Systems, Vol. 2, No. 2, pp. 108-114, December 2014. doi: 10.12720/ijsps.2.2.108-114
Cite: Zeenat Afroze, "Speaker Diarization in Personal Video Speaker Diarization in Personal Video Speaker Diarization in Personal Video Speaker Diarization in Personal Vide," International Journal of Signal Processing Systems, Vol. 2, No. 2, pp. 108-114, December 2014. doi: 10.12720/ijsps.2.2.108-114