Impact of data set noise on distributed deep learning

doi:10.19682/j.cnki.1005-8885.2020.1005

中国邮电高校学报(英文) ›› 2020, Vol. 27 ›› Issue (2): 37-45.doi: 10.19682/j.cnki.1005-8885.2020.1005

• Artificial Intelligence • 上一篇下一篇

Impact of data set noise on distributed deep learning

Guo Qinghao, Shuai Liguo, Hu Sunying

School of Mechanical Engineering, Southeast University, Nanjing 211100, China

收稿日期:2019-02-24 修回日期:2020-06-14 出版日期:2020-04-30 发布日期:2020-07-07
通讯作者: Shuai Liguo, E-mail: liguo.shuai@126.com E-mail:liguo.shuai@126.com
作者简介:Shuai Liguo, E-mail: liguo.shuai@126.com
基金资助:

Impact of data set noise on distributed deep learning

Guo Qinghao, Shuai Liguo, Hu Sunying

School of Mechanical Engineering, Southeast University, Nanjing 211100, China

Received:2019-02-24 Revised:2020-06-14 Online:2020-04-30 Published:2020-07-07
Contact: Shuai Liguo, E-mail: liguo.shuai@126.com E-mail:liguo.shuai@126.com
About author:Shuai Liguo, E-mail: liguo.shuai@126.com
Supported by:

摘要/Abstract

摘要： The training efficiency and test accuracy are important factors in judging the scalability of distributed deep learning. In this dissertation, the impact of noise introduced in the mixed national institute of standards and technology database (MNIST) and CIFAR-10 datasets is explored, which are selected as benchmark in distributed deep learning. The noise in the training set is manually divided into cross-noise and random noise, and each type of noise has a different ratio in the dataset. Under the premise of minimizing the influence of parameter interactions in distributed deep learning, we choose a compressed model (SqueezeNet) based on the proposed flexible communication method. It is used to reduce the communication frequency and we evaluate the influence of noise on distributed deep training in the synchronous and asynchronous stochastic gradient descent algorithms. Focusing on the experimental platform TensorFlowOnSpark, we obtain the training accuracy rate at different noise ratios and the training time for different numbers of nodes. The existence of cross-noise in the training set not only decreases the test accuracy and increases the time for distributed training. The noise has positive effect on destroying the scalability of distributed deep learning.

关键词: distributed deep learning, stochastic gradient descent, parameter server (PS), dataset noise

Abstract: The training efficiency and test accuracy are important factors in judging the scalability of distributed deep learning. In this dissertation, the impact of noise introduced in the mixed national institute of standards and technology database (MNIST) and CIFAR-10 datasets is explored, which are selected as benchmark in distributed deep learning. The noise in the training set is manually divided into cross-noise and random noise, and each type of noise has a different ratio in the dataset. Under the premise of minimizing the influence of parameter interactions in distributed deep learning, we choose a compressed model (SqueezeNet) based on the proposed flexible communication method. It is used to reduce the communication frequency and we evaluate the influence of noise on distributed deep training in the synchronous and asynchronous stochastic gradient descent algorithms. Focusing on the experimental platform TensorFlowOnSpark, we obtain the training accuracy rate at different noise ratios and the training time for different numbers of nodes. The existence of cross-noise in the training set not only decreases the test accuracy and increases the time for distributed training. The noise has positive effect on destroying the scalability of distributed deep learning.

Key words: distributed deep learning, stochastic gradient descent, parameter server (PS), dataset noise

中图分类号:

Guo Qinghao, Shuai Liguo, Hu Sunying. Impact of data set noise on distributed deep learning[J]. The Journal of China Universities of Posts and Telecommunications, 2020, 27(2): 37-45.

参考文献

References
1. Zhao J H, Gao H B, Liu Y C, et al. Speech recognition algorithm based on neural network and hidden Markov model. Journal of China Universities of Posts and Telecommunications, 2018, 25(4): 28 -37
2. Gao H B, Cheng B, Wang J Q, et al. Object classification using CNN-based fusion of vision and LIDAR in autonomous vehicle environment. IEEE Transactions on Industrial Informatics, 2018, 14(9): 4224 -4231
3. Coates A, Huval B, Wang T, et al. Deep learning with COTS HPC systems. 30th International Conference on Machine Learning, June 16 -21, 2013, Atlanta, 2013: 1337 -1345
4. Wang Y X, Zhao S H, Yu Y Y, et al. Speech bandwidth extension based on restricted boltzmann machines. Journal of Electronics and Information Technology, 2016, 38(7): 1717 -1723
5. Wen W, Xu C, Yan F, et al. Terngrad: ternary gradients to reduce communication in distributed deep learning. 31st Annual Conference on Neural Information Processing Systems, Dec 4 -9, 2017, Long Beach, CA, USA, 2017: 1510 -1520
6. Campos V, Sastre F, Yagues M, et al. Distributed training strategies for a computer vision deep learning algorithms on a distributed GPU cluster. 18th IEEE/ACM International Conference on Computational Science, June 12 -14, 2017, Zurich, 2017: 315 -324
7. Ren Y, Wu X, Zhang L, et al. IRDMA: efficient use of RDMA in distributed deep learning systems. 19th IEEE Intl Conference on High Performance Computing and Communications, Dec 18 -20, 2017, Bangkok, 2018: 231 -238
8. Ichinose A, Nakada H, Takefusa A, et al. Pipeline-based processing of the deep learning framework caffe. 11th International Conference on Ubiquitous Information Management and Communication, Jan 5 -7, 2017, Beppu, 2017: 1 -8
9. Ma T, Wang F, Tian Y S, et al. A novel ensemble forecasting algorithm based on distributed deep learning network. International Journal of Performability Engineering, 2019, 15(11): 2927 -2935
10. Gao H B, Xie G T, Liu H Z, et al. Lateral control of autonomous vehicle based on learning driver behaviour via cloud model. Journal of China Universities of Posts and Telecommunications, 2017, 24(2): 10 -17
11. Zhou Q, Zhou W A, Yang B, et al. Deep cycle autoencoder for unsupervised domain adaptation with generative adversarial networks. IET Computer Vision, 2019, 13(7): 659 -665
12. Gao H B, Zhang X Y, Zhang T L, et al. Research of intelligent vehicle variable granularity evaluation based on cloud model. Acta Electronica Sinica, 2016, 44(2): 365 -374 (in Chinese)
13. Li D Y, Gao H B. A hardware platform framework for an intelligent vehicle based on a driving brain. Engineering, 2018, 4(4): 464 -470
14. Li M, Andersen D G, Smola A, et al. Communication efficient distributed machine learning with the parameter server. 28th Annual Conference on Neural Information Processing Systems 2014, Dec 8 -13, 2014, Montreal, 2014: 19 -27
15. Khumoyun A, Cui Y, Hanku L. Spark based distributed deep learning framework for big data applications. 2016 International Conference on Information Science and Communications Technologies, Nov 2 -4, 2016, Tashkent, Uzbekistan, 2016: 1 -5
16. Langer M, Hall A, He Z, et al. MPCA SGD-a method for distributed training of deep learning models on spark. IEEE Transactions on Parallel and Distributed Systems, 2018, 29(11):
2540 -2556
17. Mikyoung L, Sungho S, Seungkyun H, et al. BAIPAS: distributed deep learning platform with data locality and shuffling. 2017 European Conference on Electrical Engineering and Computer Science, Nov 17 -19, 2017, Bern, 2017: 5 -8
18. Guo Q, Liu Q L, Wang W M. Fast SqueezeNet algorithm with application in metro crowd density estimation. Control Theory and Applications, 2019, 36(7): 1036 -1046 (in Chinese)

[1]	何明枢, 金磊, 王小娟, 李源. Web log classification framework with data augmentation based on GANs[J]. 中国邮电高校学报(英文版), 2020, 27(5): 34-46.
[2]	高杨张勇滕颖蕾王晋雄. Joint multi-QoS and energy saving routing for LEO satellite network[J]. 中国邮电高校学报(英文版), 2019, 26(3): 25-34.
[3]	白媛安杰张会兵. Location aided probabilistic broadcast algorithm for mobile Ad-hoc network routing[J]. 中国邮电高校学报(英文版), 2017, 24(2): 66-71.

Impact of data set noise on distributed deep learning

Impact of data set noise on distributed deep learning

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 3

编辑推荐

Metrics

本文评价