Acta Metallurgica Sinica(English letters) ›› 2008, Vol. 15 ›› Issue (2): 130-134.doi: 1005-8885 (2008) 02-0130-05

• Wireless • Previous Articles    

Cross similarity measurement for speaker adaptive test normalization in text-independent speaker verification

ZHAO Jian, DONG Yuan, ZHAO Xian-yu, YANG Hao, WANG Hai-la   

  1. Laboratory of Pattern Recognition and Intelligent System, Beijing University of Posts and Telecommunications, Beijing 100876, China
  • Received:2007-07-06 Revised:1900-01-01 Online:2008-06-30
  • Contact: ZHAO Jian

Abstract:

Speaker adaptive test normalization (ATnorm) is the most effective approach of the widely used score normalization in text-independent speaker verification, which selects speaker adaptive impostor cohorts with an extra development corpus in order to enhance the recognition performance. In this paper, an improved implementation of ATnorm that can offer overall significant advantages over the original ATnorm is presented. This method adopts a novel cross similarity measurement in speaker adaptive cohort model selection without an extra development corpus. It can achieve a comparable performance with the original ATnorm and reduce the computation complexity moderately. With the full use of the saved extra development corpus, the overall system performance can be improved significantly. The results are presented on NIST 2006 Speaker Recognition Evaluation data corpora where it is shown that this method provides significant improvements in system performance, with relatively 14.4% gain on equal error rate (EER) and 14.6% gain on decision cost function (DCF) obtained as a whole.

Key words:

speaker ATnorm;score normalization;cross similarity measurement;speaker verification;NIST speaker recognition evaluation

CLC Number: