中国邮电高校学报(英文) ›› 2020, Vol. 27 ›› Issue (5): 69-81.doi: 10.19682/j.cnki.1005-8885.2020.0030

所属专题: 人工智能专题

• Wireless • 上一篇    下一篇

RFID Indoor positioning based on Semi-supervised Actor-Critic Co-training

李丽; 郑嘉利; 全艺璇; 林子涵; 李映超; 黄天星   

  1. 1. 广西大学
    2. 广西多媒体通信与网络技术重点实验室
    3. 广西民族大学
  • 收稿日期:2020-01-22 修回日期:2020-08-14 出版日期:2020-10-22 发布日期:2020-10-23
  • 通讯作者: 郑嘉利 E-mail:zji@gxu.edu.cn
  • 基金资助:
    国家自然科学基金

RFID Indoor positioning based on Semi-supervised Actor-Critic Co-training

Li Li, Zheng Jiali, Quan Yixuan, Lin Zihan, Li Yingchao, Huang Tianxing   

  1. Guangxi University Guangxi University for Nationalities
  • Received:2020-01-22 Revised:2020-08-14 Online:2020-10-22 Published:2020-10-23
  • Supported by:

    the National Natural Science Foundation of China and the Natural Science Foundation of Guangxi Province, China

摘要: For large-scale radio frequency identification ( RFID) indoor positioning system, the positioning scale is relatively large, with less labeled data and more unlabeled data, and it is easily affected by multipath and white noise. An RFID positioning algorithm based on semi-supervised actor-critic co-training (SACC) was proposed to solve this problem. In this research, the positioning is regarded as Markov decision-making process. Firstly, the actor-critic was combined with random actions and selects the unlabeled best received signal arrival intensity (RSSI) data by co-training of the semi-supervised. Secondly, the actor and the critic were updated by employing Kronecker-factored approximation calculate (K-FAC) natural gradient. Finally, the target position was obtained by co-locating with labeled RSSI data and the selected unlabeled RSSI data. The proposed method reduced the cost of indoor positioning significantly by decreasing the number of labeled data. Meanwhile, with the increase of the positioning targets, the actor could quickly select unlabeled RSSI data and updates the location model. Experiment shows that, compared with other RFID indoor positioning algorithms, such as twin delayed deep deterministic policy gradient (TD3), deep deterministic policy gradient (DDPG), and actor-critic using Kronecker-factored trust region ( ACKTR), the proposed method decreased the average positioning error respectively by 50.226%, 41.916%, and 25.004%. Meanwhile, the positioning stability was improved by 23.430%, 28.518%, and 38.631%.

关键词: RFID, RSSI, semi-supervised actor-critic, Kronecker-Factored, co-training

Abstract: For large-scale radio frequency identification ( RFID) indoor positioning system, the positioning scale is relatively large, with less labeled data and more unlabeled data, and it is easily affected by multipath and white noise. An RFID positioning algorithm based on semi-supervised actor-critic co-training (SACC) was proposed to solve this problem. In this research, the positioning is regarded as Markov decision-making process. Firstly, the actor-critic was combined with random actions and selects the unlabeled best received signal arrival intensity (RSSI) data by co-training of the semi-supervised. Secondly, the actor and the critic were updated by employing Kronecker-factored approximation calculate (K-FAC) natural gradient. Finally, the target position was obtained by co-locating with labeled RSSI data and the selected unlabeled RSSI data. The proposed method reduced the cost of indoor positioning significantly by decreasing the number of labeled data. Meanwhile, with the increase of the positioning targets, the actor could quickly select unlabeled RSSI data and updates the location model. Experiment shows that, compared with other RFID indoor positioning algorithms, such as twin delayed deep deterministic policy gradient (TD3), deep deterministic policy gradient (DDPG), and actor-critic using Kronecker-factored trust region ( ACKTR), the proposed method decreased the average positioning error respectively by 50.226%, 41.916%, and 25.004%. Meanwhile, the positioning stability was improved by 23.430%, 28.518%, and 38.631%.

Key words: RFID, RSSI, semi-supervised actor-critic, Kronecker-Factored, co-training

中图分类号: