Acta Metallurgica Sinica(English letters) ›› 2008, Vol. 15 ›› Issue (2): 130-134.doi: 1005-8885 (2008) 02-0130-05

• Speech Recognition • 上一篇    

Cross similarity measurement for speaker adaptive test normalization in text-independent speaker verification

赵剑 董远 赵贤宇 杨浩 王海拉   

  1. Laboratory of Pattern Recognition and Intelligent System, Beijing University of Posts and Telecommunications, Beijing 100876, China
  • 收稿日期:2007-07-06 修回日期:1900-01-01 出版日期:2008-06-30
  • 通讯作者: 赵剑

Cross similarity measurement for speaker adaptive test normalization in text-independent speaker verification

ZHAO Jian, DONG Yuan, ZHAO Xian-yu, YANG Hao, WANG Hai-la   

  1. Laboratory of Pattern Recognition and Intelligent System, Beijing University of Posts and Telecommunications, Beijing 100876, China
  • Received:2007-07-06 Revised:1900-01-01 Online:2008-06-30
  • Contact: ZHAO Jian

摘要:

Speaker adaptive test normalization (ATnorm) is the most effective approach of the widely used score normalization in text-independent speaker verification, which selects speaker adaptive impostor cohorts with an extra development corpus in order to enhance the recognition performance. In this paper, an improved implementation of ATnorm that can offer overall significant advantages over the original ATnorm is presented. This method adopts a novel cross similarity measurement in speaker adaptive cohort model selection without an extra development corpus. It can achieve a comparable performance with the original ATnorm and reduce the computation complexity moderately. With the full use of the saved extra development corpus, the overall system performance can be improved significantly. The results are presented on NIST 2006 Speaker Recognition Evaluation data corpora where it is shown that this method provides significant improvements in system performance, with relatively 14.4% gain on equal error rate (EER) and 14.6% gain on decision cost function (DCF) obtained as a whole.

关键词:

speaker;ATnorm,;score;normalization,;cross;similarity;measurement,;speaker;verification,;NIST;speaker;recognition;evaluation

Abstract:

Speaker adaptive test normalization (ATnorm) is the most effective approach of the widely used score normalization in text-independent speaker verification, which selects speaker adaptive impostor cohorts with an extra development corpus in order to enhance the recognition performance. In this paper, an improved implementation of ATnorm that can offer overall significant advantages over the original ATnorm is presented. This method adopts a novel cross similarity measurement in speaker adaptive cohort model selection without an extra development corpus. It can achieve a comparable performance with the original ATnorm and reduce the computation complexity moderately. With the full use of the saved extra development corpus, the overall system performance can be improved significantly. The results are presented on NIST 2006 Speaker Recognition Evaluation data corpora where it is shown that this method provides significant improvements in system performance, with relatively 14.4% gain on equal error rate (EER) and 14.6% gain on decision cost function (DCF) obtained as a whole.

Key words:

speaker ATnorm;score normalization;cross similarity measurement;speaker verification;NIST speaker recognition evaluation

中图分类号: