The Journal of China Universities of Posts and Telecommunications ›› 2020, Vol. 27 ›› Issue (5): 69-81.doi: 10.19682/j.cnki.1005-8885.2020.0030

Special Issue: 人工智能专题

Previous Articles     Next Articles

RFID Indoor positioning based on Semi-supervised Actor-Critic Co-training

Li Li, Zheng Jiali, Quan Yixuan, Lin Zihan, Li Yingchao, Huang Tianxing   

  1. Guangxi University Guangxi University for Nationalities
  • Received:2020-01-22 Revised:2020-08-14 Online:2020-10-22 Published:2020-10-23
  • Supported by:

    the National Natural Science Foundation of China and the Natural Science Foundation of Guangxi Province, China

Abstract: For large-scale radio frequency identification ( RFID) indoor positioning system, the positioning scale is relatively large, with less labeled data and more unlabeled data, and it is easily affected by multipath and white noise. An RFID positioning algorithm based on semi-supervised actor-critic co-training (SACC) was proposed to solve this problem. In this research, the positioning is regarded as Markov decision-making process. Firstly, the actor-critic was combined with random actions and selects the unlabeled best received signal arrival intensity (RSSI) data by co-training of the semi-supervised. Secondly, the actor and the critic were updated by employing Kronecker-factored approximation calculate (K-FAC) natural gradient. Finally, the target position was obtained by co-locating with labeled RSSI data and the selected unlabeled RSSI data. The proposed method reduced the cost of indoor positioning significantly by decreasing the number of labeled data. Meanwhile, with the increase of the positioning targets, the actor could quickly select unlabeled RSSI data and updates the location model. Experiment shows that, compared with other RFID indoor positioning algorithms, such as twin delayed deep deterministic policy gradient (TD3), deep deterministic policy gradient (DDPG), and actor-critic using Kronecker-factored trust region ( ACKTR), the proposed method decreased the average positioning error respectively by 50.226%, 41.916%, and 25.004%. Meanwhile, the positioning stability was improved by 23.430%, 28.518%, and 38.631%.

Key words: RFID, RSSI, semi-supervised actor-critic, Kronecker-Factored, co-training

CLC Number: