中国邮电高校学报(英文) ›› 2020, Vol. 27 ›› Issue (5): 23-33.doi: 10.19682/j.cnki.1005-8885.2020.0023

• Artificial Intelligence • 上一篇    下一篇

Semantic segmentation of track image based on deep neural network

汪昭颖; 周军华; 廖中华; 翟翔; 张连平   

  1. 北京邮电大学

  • 收稿日期:2019-11-29 修回日期:2020-04-10 出版日期:2020-10-22 发布日期:2020-10-23
  • 通讯作者: 汪昭颖 E-mail:wangzhaoying@bupt.edu.cn
  • 基金资助:
    国家自然科学基金;国家重点研发计划“政府间国际科技创新合作“重点专项

Semantic segmentation of track image based on deep neural network

Wang Zhaoying, Zhou Junhua, Liao Zhonghua, Zhai Xiang, Zhang Lianping   

  1. Beijing University of Posts and Telecommunications Beijing Simulation Center Beijing Institute of Electronic System Engineering Alibaba Cloud Computing
  • Received:2019-11-29 Revised:2020-04-10 Online:2020-10-22 Published:2020-10-23
  • Contact: ZhaoYing WANG E-mail:wangzhaoying@bupt.edu.cn
  • Supported by:
    the Key Special Project in Intergovernmental International Scientific and Technological Innovation Cooperation of the National Key Research and Development Program of China

摘要: In this paper, deep learning technology was utilited to solve the railway track recognition in intrusion detection problem. The railway track recognition can be viewed as semantic segmentation task which extends image processing to pixel level prediction. An encoder-decoder architecture DeepLabv3 + model was applied in this work due to its good performance in semantic segmentation task. Since images of the railway track collected from the video surveillance of the train cab were used as experiment dataset in this work, the following improvements were made to the model. The first aspect deals with over-fitting problem due to the limited amount of training data. Data augmentation and transfer learning are applied consequently to rich the diversity of data and enhance model robustness during the training process. Besides, different gradient descent methods are compared to obtain the optimal optimizer for training model parameters. The third problem relates to data sample imbalance, cross entropy (CE) loss is replaced by focal loss (FL) to address the issue of serious imbalance between positive and negative sample. Effectiveness of the improved DeepLabv3 + model with above solutions is demonstrated by experiment results with different system parameters.

关键词: railway track recognition, convolutional neural networks, semantic segmentation, DeepLabv3 +

Abstract: In this paper, deep learning technology was utilited to solve the railway track recognition in intrusion detection problem. The railway track recognition can be viewed as semantic segmentation task which extends image processing to pixel level prediction. An encoder-decoder architecture DeepLabv3 + model was applied in this work due to its good performance in semantic segmentation task. Since images of the railway track collected from the video surveillance of the train cab were used as experiment dataset in this work, the following improvements were made to the model. The first aspect deals with over-fitting problem due to the limited amount of training data. Data augmentation and transfer learning are applied consequently to rich the diversity of data and enhance model robustness during the training process. Besides, different gradient descent methods are compared to obtain the optimal optimizer for training model parameters. The third problem relates to data sample imbalance, cross entropy (CE) loss is replaced by focal loss (FL) to address the issue of serious imbalance between positive and negative sample. Effectiveness of the improved DeepLabv3 + model with above solutions is demonstrated by experiment results with different system parameters.

Key words: railway track recognition, convolutional neural networks, semantic segmentation, DeepLabv3 +

中图分类号: