中国邮电高校学报(英文版) ›› 2019, Vol. 26 ›› Issue (5): 1-10.doi: 10.19682/j.cnki.1005-8885.2019.0028

• Artificial Intelligence •    下一篇

Adaptive transfer learning framework for dense prediction of human activity recognition

Zhang Zhao 1,Zhang Yong 1 (*), Teng Ying lei1, Guo Da1, Deng Hai qin2   

  1. 1. School of Electronic Engineering, Beijing University of Posts and Telecommunications, Beijing 100876, China 
    2. AIdong Super AI (Beijing) Company Limited, Beijing 100007, China
  • 收稿日期:2019-01-21 修回日期:2019-04-09 出版日期:2019-10-31 发布日期:2019-11-06
  • 通讯作者: ZhangYong, E-mail:yongzhang@bupt.edu.cn E-mail:yongzhang@bupt.edu.cn
  • 作者简介:Corresponding author: ZhangYong, E-mail:yongzhang@bupt.edu.cn
  • 基金资助:
    国家重大科技专项

Adaptive transfer learning framework for dense prediction of human activity recognition

Zhang Zhao 1,Zhang Yong 1 (*), Teng Ying lei1, Guo Da1, Deng Hai qin2   

  1. 1. School of Electronic Engineering, Beijing University of Posts and Telecommunications, Beijing 100876, China 
    2. AIdong Super AI (Beijing) Company Limited, Beijing 100007, China
  • Received:2019-01-21 Revised:2019-04-09 Online:2019-10-31 Published:2019-11-06
  • Contact: ZhangYong, E-mail:yongzhang@bupt.edu.cn E-mail:yongzhang@bupt.edu.cn
  • About author:Corresponding author: ZhangYong, E-mail:yongzhang@bupt.edu.cn
  • Supported by:
    State Major Science and Technology Special Projects

摘要: Human activity recognition (HAR) for dense prediction is proven to be of good performance, but it relies on labeling every point in time series with the high cost. In addition, the performance of HAR model will show significant degradation when tested on the sensor data with different distribution from the training data, where the training data and the test data are usually collected from different sensor locations or sensor users. Therefore, the adaptive transfer learning framework for dense prediction of HAR is introduced to implement cross-domain transfer, where the proposed multi-level unsupervised domain adaptation (MLUDA) approach combines the global domain adaptation and the specific task adaptation to adapt the source and target domain in multiple levels. The multi-connected global domain adaptation architecture is proposed for the first time, which can adapt the output layer of the encoder and the decoder in dense prediction model. After this, the specific task adaptation is proposed to ensure alignment of each class centroid in source domain and target domain by introducing the cosine distance loss and the moving average method. Experiments on three public human activity recognition datasets demonstrate that the proposed MLUDA improves the prediction accuracy of target data by 20% compared to the source domain pre-trained model and it is more effective than the other three deep transfer learning methods with an improvement of 10% to 18% in accuracy.

关键词: transfer learning, human activity recognition, dense prediction, global domain adaptation, specific task adaptation

Abstract: Human activity recognition (HAR) for dense prediction is proven to be of good performance, but it relies on labeling every point in time series with the high cost. In addition, the performance of HAR model will show significant degradation when tested on the sensor data with different distribution from the training data, where the training data and the test data are usually collected from different sensor locations or sensor users. Therefore, the adaptive transfer learning framework for dense prediction of HAR is introduced to implement cross-domain transfer, where the proposed multi-level unsupervised domain adaptation (MLUDA) approach combines the global domain adaptation and the specific task adaptation to adapt the source and target domain in multiple levels. The multi-connected global domain adaptation architecture is proposed for the first time, which can adapt the output layer of the encoder and the decoder in dense prediction model. After this, the specific task adaptation is proposed to ensure alignment of each class centroid in source domain and target domain by introducing the cosine distance loss and the moving average method. Experiments on three public human activity recognition datasets demonstrate that the proposed MLUDA improves the prediction accuracy of target data by 20% compared to the source domain pre-trained model and it is more effective than the other three deep transfer learning methods with an improvement of 10% to 18% in accuracy.

Key words: transfer learning, human activity recognition, dense prediction, global domain adaptation, specific task adaptation