中国邮电高校学报(英文) ›› 2018, Vol. 25 ›› Issue (4): 28-37.doi: 10.19682/j.cnki.1005-8885.2018.1014

• Artificial Intelligence • 上一篇    下一篇

Speech recognition algorithm based on neural network and hidden Markov model

Zhao Jianhui, Gao Hongbo, Liu Yuchao, Cheng Bo   

  1. 1. Department of Computer Science and Technology, Tsinghua University, Beijing 100084, China
    2. State Key Laboratory of Automotive Safety and Energy, Tsinghua University, Beijing 100084, China
    3. Center for Intelligent Connected Vehicles and Transportation, Tsinghua University, Beijing 100084, China
  • 收稿日期:2017-12-22 修回日期:2018-08-27 出版日期:2018-08-30 发布日期:2018-11-02
  • 通讯作者: Gao Hongbo,E-mail:ghb48@tsinghua.edu.cn E-mail:ghb48@tsinghua.edu.cn
  • 作者简介:Gao Hongbo,E-mail:ghb48@tsinghua.edu.cn
  • 基金资助:
    This work was supported by China Postdoctoral Science Foundation Special Funded Projects (2018T110095), Project funded by China Postdoctoral Science Foundation (2017M620765 ), National Key Research and Development Program of China (2017YFB0102603), and Junior Fellowships for Advanced Innovation Think-tank Program of China Association for Science and Technology ( DXB-ZKQN-2017-035).

Speech recognition algorithm based on neural network and hidden Markov model

Zhao Jianhui, Gao Hongbo, Liu Yuchao, Cheng Bo   

  1. 1. Department of Computer Science and Technology, Tsinghua University, Beijing 100084, China
    2. State Key Laboratory of Automotive Safety and Energy, Tsinghua University, Beijing 100084, China
    3. Center for Intelligent Connected Vehicles and Transportation, Tsinghua University, Beijing 100084, China
  • Received:2017-12-22 Revised:2018-08-27 Online:2018-08-30 Published:2018-11-02
  • Contact: Gao Hongbo,E-mail:ghb48@tsinghua.edu.cn E-mail:ghb48@tsinghua.edu.cn
  • About author:Gao Hongbo,E-mail:ghb48@tsinghua.edu.cn
  • Supported by:
    This work was supported by China Postdoctoral Science Foundation Special Funded Projects (2018T110095), Project funded by China Postdoctoral Science Foundation (2017M620765 ), National Key Research and Development Program of China (2017YFB0102603), and Junior Fellowships for Advanced Innovation Think-tank Program of China Association for Science and Technology ( DXB-ZKQN-2017-035).

摘要: This study proposes a hybrid model of speech recognition parallel algorithm based on hidden Markov model (HMM) and artificial neural network (ANN). First, the algorithm uses HMM for time-series modeling of speech signals and calculates the voice to the HMM of the output probability score. Second, with the probability score as input to the neural network, the algorithm gets information for classification and recognition and makes a decision based on the hybrid model. Finally, Matlab software is used to train and test sample data. Simulation results show that using the strong time-series modeling ability of HMM and the classification features of neural network, the proposed algorithm possesses stronger noise immunity than the traditional HMM. Moreover, the hybrid model enhances the individual flaws of the HMM and the neural network and greatly improves the speed and performance of speech recognition.

关键词: speech recognition, HMM, neural network

Abstract: This study proposes a hybrid model of speech recognition parallel algorithm based on hidden Markov model (HMM) and artificial neural network (ANN). First, the algorithm uses HMM for time-series modeling of speech signals and calculates the voice to the HMM of the output probability score. Second, with the probability score as input to the neural network, the algorithm gets information for classification and recognition and makes a decision based on the hybrid model. Finally, Matlab software is used to train and test sample data. Simulation results show that using the strong time-series modeling ability of HMM and the classification features of neural network, the proposed algorithm possesses stronger noise immunity than the traditional HMM. Moreover, the hybrid model enhances the individual flaws of the HMM and the neural network and greatly improves the speed and performance of speech recognition.

Key words: speech recognition, HMM, neural network