中国邮电高校学报(英文版) ›› 2020, Vol. 27 ›› Issue (2): 72-81.doi: 10.19682/j.cnki.1005-8885.2020.1009

• Others • 上一篇    下一篇

Cleaning RFID data streams based on K-means clustering method

Lin Qiaomin, Fa Anqi, Pan Min, Xie Qiang, Du Kun, Sheng Michael   

  1. 1. College of Computer, Nanjing University of Posts and Telecommunications, Nanjing 210023, China
    2. College of Education Science and Technology, Nanjing University of Posts and Telecommunications, Nanjing 210023, China
    3. Department of Computing, Macquarie University, Sydney 2109, Australia
    4. College of Overseas Education, Nanjing University of Posts and Telecommunications, Nanjing 210023, China
  • 收稿日期:2019-12-12 修回日期:2020-05-11 出版日期:2020-04-30 发布日期:2020-07-07
  • 通讯作者: Lin Qiaomin, E-mail: lqm@njupt.edu.cn E-mail:lqm@njupt.edu.cn
  • 作者简介:Lin Qiaomin, E-mail: lqm@njupt.edu.cn
  • 基金资助:

    This work was supported by National Natural Science Foundation of China (61907025, 61807020), Scientific Research Foundation of Jiangsu High Technology Research Key Laboratory for Wireless Sensor Networks (WSNLBZY201512), Jiangsu Government Scholarship for Studying Abroad.

Cleaning RFID data streams based on K-means clustering method

Lin Qiaomin, Fa Anqi, Pan Min, Xie Qiang, Du Kun, Sheng Michael   

  1. 1. College of Computer, Nanjing University of Posts and Telecommunications, Nanjing 210023, China
    2. College of Education Science and Technology, Nanjing University of Posts and Telecommunications, Nanjing 210023, China
    3. Department of Computing, Macquarie University, Sydney 2109, Australia
    4. College of Overseas Education, Nanjing University of Posts and Telecommunications, Nanjing 210023, China
  • Received:2019-12-12 Revised:2020-05-11 Online:2020-04-30 Published:2020-07-07
  • Contact: Lin Qiaomin, E-mail: lqm@njupt.edu.cn E-mail:lqm@njupt.edu.cn
  • About author:Lin Qiaomin, E-mail: lqm@njupt.edu.cn
  • Supported by:
    This work was supported by National Natural Science Foundation of China (61907025, 61807020), Scientific Research Foundation of Jiangsu High Technology Research Key Laboratory for Wireless Sensor Networks (WSNLBZY201512), Jiangsu Government Scholarship for Studying Abroad.

摘要: Currentlyradio frequency identification (RFID) technology has been widely used in many kinds of applications. Store retailers use RFID readers with multiple antennas to monitor all tagged items. However, because of the interference from environment and limitations of the radio frequency technology, RFID tags are identified by more than one RFID antenna, leading to the false positive readings. To address this issue, we propose a RFID data stream cleaning method based on K-means to remove those false positive readings within sampling time. First, we formulate a new data stream model which adapts to our cleaning algorithm. Then we present the preprocessing method of the data stream model, including sliding window setting, feature extraction of data stream and normalization. Next, we introduce a novel way using K-means clustering algorithm to clean false positive readings. Last, the effectiveness and efficiency of the proposed method are verified by experiments. It achieves a good balance between performance and price.

关键词: false positive reading, data stream, K-means, RFID

Abstract: Currentlyradio frequency identification (RFID) technology has been widely used in many kinds of applications. Store retailers use RFID readers with multiple antennas to monitor all tagged items. However, because of the interference from environment and limitations of the radio frequency technology, RFID tags are identified by more than one RFID antenna, leading to the false positive readings. To address this issue, we propose a RFID data stream cleaning method based on K-means to remove those false positive readings within sampling time. First, we formulate a new data stream model which adapts to our cleaning algorithm. Then we present the preprocessing method of the data stream model, including sliding window setting, feature extraction of data stream and normalization. Next, we introduce a novel way using K-means clustering algorithm to clean false positive readings. Last, the effectiveness and efficiency of the proposed method are verified by experiments. It achieves a good balance between performance and price.

Key words: false positive reading, data stream, K-means, RFID