Unvoiced/voiced classification and voiced harmonic parameters estimation using the third-order statistics

doi:1005-8885 (2007) 01-0085-05

中国邮电高校学报(英文) ›› 2007, Vol. 14 ›› Issue (1): 85-89.doi: 1005-8885 (2007) 01-0085-05

• Artificial Intelligence • 上一篇下一篇

Unvoiced/voiced classification and voiced harmonic parameters estimation using the third-order statistics

YING Na, ZHAO Xiao-hui, DONG Jing

Communication Engineering College of Hangzhou Dianzi University, Hangzhou 310018, China

收稿日期:2006-03-31 修回日期:1900-01-01 出版日期:2007-03-30
通讯作者: YING Na

Unvoiced/voiced classification and voiced harmonic parameters estimation using the third-order statistics

YING Na, ZHAO Xiao-hui, DONG Jing

Communication Engineering College of Hangzhou Dianzi University, Hangzhou 310018, China

Received:2006-03-31 Revised:1900-01-01 Online:2007-03-30
Contact: YING Na

摘要/Abstract

摘要：

Unvoiced/voiced classification of speech is a challenging problem especially under conditions of low signal-to-noise ratio or the non-white-stationary noise environment. To solve this problem, an algorithm for speech classification, and a technique for the estimation of pairwise magnitude frequency in voiced speech are proposed. By using third order spectrum of speech signal to remove noise, in this algorithm the least spectrum difference to get refined pitch and the max harmonic number is given. And this algorithm utilizes spectral envelope to estimate signal-to-noise ratio of speech harmonics. Speech classification, voicing probability, and harmonic parameters of the voiced frame can be obtained. Simulation results indicate that the proposed algorithm, under complicated background noise, especially Gaussian noise, can effectively classify speech in high accuracy for voicing probability and the voiced parameters.

关键词:

unvoiced/voiced;classification,;harmonic;extraction,;the;third-order;cumulant,;sinusoidal;speech;model

Abstract:

Key words:

unvoiced/voiced classification;harmonic extraction;the third-order cumulant;sinusoidal speech model

中图分类号:

TN912.3

YING Na, ZHAO Xiao-hui, DONG Jing. Unvoiced/voiced classification and voiced harmonic parameters estimation using the third-order statistics[J]. Acta Metallurgica Sinica(English letters), 2007, 14(1): 85-89.

参考文献

1. Marián K, Weruaga L. High-resolution noise-robust spectral- based pitch estimation. Proceedings of the 9th European Conference on Speech Communication and Technology, Sep 4-8, 2005, Lisbon, Portugal. Baixas, France: International Speech and Communication Association, 2005: 313-316

2. Tomohiro N, Toshio I, Parham Z. Dominance spectrum based V/UV classification and F0 estimation. Proceedings of the 8th European Conference on Speech Communication and Technology, Sep 1-4, 2003, Geneva, Switzerland. 2003: 2313-2316

3. Lobo A P, Loizou P C. Voiced/unvoiced speech discrimination in noise using Gabor atomic decomposition. Proceedings of 2003 IEEE International Conference on Accoustics, Speech, and Signal Processing (ICASSP' 03): Vol 1, Apr 6-10, 2003, Hong Kong, China. Piscataway, NJ, USA: IEEE, 2003: 820-823

4. Zhou G, Giannakis G B. Harmonics in multiplicative and additive noise: performance analysis of cyclic estimators. IEEE Transactions on Signal Processing. 1995, 43 (6): 1445-1460

5. WANG F, WANG S X, DOU H J. Polyspectral Analysis of two-dimensional mixed processes and coupled harmonics. The Journal of China Universities of Posts and Telecommunications. 2002, 9(3): 4-9

6. Rangoussi M, Carayannis G. Higher-order statistics based Gaussianity test applied to on-line speech processing. Proceedings of the 28th Asilomar Conference on Signals, Systems and Computers: Vol 1, Oct 30-Nov 1, 1994, Pacific Grove, CA, USA. Los Alamitos, CA, USA: IEEE Computer Society, 1994: 303-307

7. Wanchieh P, Doerschuk P C. Signal processing using statistical nonlinear speech production models. Proceedings of IEEE Workshop on Nonlinear Signal and Image Processing, Sep 8-10, 1997, Mackinac Island, MI, USA. Piscataway, NJ, USA: IEEE

8. Mcaulay R J, Quatieri T F. Sinusoidal coding. Chapter 4, Speech Coding and Synthesis. Elsevier Science B V. 1995: 121-170

9. Li H, Cheng Q, Yuan B. Strong laws of large numbers for two dimensional processes. Proceedings of the 4th International Conference on Signal Processing: Vol 1, Oct 12-16, 1998, Beijing, China. Piscataway, NJ, USA: IEEE, 1998: 43-46

10. PAUL D B. The Spectral envelope estimation vocoder. IEEE Transactions on Acoustic, Speech and Signal Processing, 1981, 29(4): 786-794

11. Mcaulay R J, Quatieri T F. Low-rate speech coding based on the sinusoidal model. Advances in Speech Signal Processing. New York, NY, USA: Marcel Dekker. 1992: 165-207

[1]	Cheng Yi, Zhao Yan, Yin Peiwen. Radar false alarm plots elimination based on multi-feature extraction and classification[J]. 中国邮电高校学报(英文版), 2024, 31(1): 83-92.
[2]	张晓娇, 吴祥. Distributed consensus of Lurie multi-agent systems under directed topology: a contraction approach [J]. 中国邮电高校学报(英文版), 2023, 30(6): 11-21.
[3]	梁晓林, 马佳旭, 曹旺斌, 徐建鹏, 刘帅奇, 赵雄文 . Characteristics and modeling of UAV-vehicle MIMO wideband channels [J]. 中国邮电高校学报(英文版), 2023, 30(6): 60-67.
[4]	张俊杰, 刘彩霞, 刘树新, 臧韦菲, 李倩. Node interdependent percolation of multiplex hypergraph with weak interdependence [J]. 中国邮电高校学报(英文版), 2023, 30(6): 49-59.
[5]	梁晓林, 戎展毅, 曹旺斌, 刘帅奇, 赵雄文. Performance analysis of different coding schemes for wideband vehicle-to-vehicle MIMO systems [J]. 中国邮电高校学报(英文版), 2023, 30(6): 89-98.
[6]	彭宏刘耀宗. Joint global constraint and Fisher discrimination based multi-layer dictionary learning for image classification[J]. 中国邮电高校学报(英文版), 2023, 30(5): 1-10.
[7]	Duan Li, Chen Lin, Luo Bing. Surveys on the application of neural networks to event extraction[J]. 中国邮电高校学报(英文版), 2023, 30(4): 43-54.
[8]	Luo Zhiyong, Wang Shuyi, Song Weiwei, Liu Jiahui, Wang Jianming. Research on security situation awareness algorithm of Markov differential game block-chain model[J]. 中国邮电高校学报(英文版), 2023, 30(4): 105-120.
[9]	Fan Xinyue, Wu Kai, Chen Shuai. RB-SLAM: visual SLAM based on rotated BEBLID feature point description [J]. 中国邮电高校学报(英文版), 2023, 30(3): 1-13.
[10]	王岚婷胡威刘建毅庞进高雅婷薛婧瑶张婕. Encrypted traffic classification based on fusion of vision transformer and temporal features [J]. 中国邮电高校学报(英文版), 2023, 30(2): 73-82.
[11]	Wu Qing, Li Feiyan, Zhang Hengchang, Fan Jiulun, Gao Xiaofeng. Least squares twin support vector machine with asymmetric squared loss[J]. 中国邮电高校学报(英文版), 2023, 30(1): 1-16.
[12]	Guan Sihai, Cheng Qing, Zhao Yong, Liu Fangyao. Variable step-size adaptive filtering algorithm based on an exponent sin function[J]. 中国邮电高校学报(英文版), 2023, 30(1): 56-65.
[13]	Jiang Fan, Chen Jiajun, Gao Youjun, Sun Changyin. Research on ECG classification based on transfer learning [J]. 中国邮电高校学报(英文版), 2022, 29(6): 83-96.
[14]	晁浩连卫芳刘永利. Spatiotemporal emotion recognition based on 3D time-frequency domain feature matrix[J]. 中国邮电高校学报(英文版), 2022, 29(5): 62-72.
[15]	张承畅徐余杨建鹏李晓梦. Automatic modulation classification based on AlexNet with data augmentation[J]. 中国邮电高校学报(英文版), 2022, 29(5): 51-61.

Unvoiced/voiced classification and voiced harmonic parameters estimation using the third-order statistics

Unvoiced/voiced classification and voiced harmonic parameters estimation using the third-order statistics

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价