Impact of data set noise on distributed deep learning

doi:10.19682/j.cnki.1005-8885.2020.1005

The Journal of China Universities of Posts and Telecommunications ›› 2020, Vol. 27 ›› Issue (2): 37-45.doi: 10.19682/j.cnki.1005-8885.2020.1005

• Artificial intelligence • Previous Articles Next Articles

Impact of data set noise on distributed deep learning

Guo Qinghao, Shuai Liguo, Hu Sunying

School of Mechanical Engineering, Southeast University, Nanjing 211100, China

Received:2019-02-24 Revised:2020-06-14 Online:2020-04-30 Published:2020-07-07
Contact: Shuai Liguo, E-mail: liguo.shuai@126.com E-mail:liguo.shuai@126.com
About author:Shuai Liguo, E-mail: liguo.shuai@126.com
Supported by:

Abstract

Abstract: The training efficiency and test accuracy are important factors in judging the scalability of distributed deep learning. In this dissertation, the impact of noise introduced in the mixed national institute of standards and technology database (MNIST) and CIFAR-10 datasets is explored, which are selected as benchmark in distributed deep learning. The noise in the training set is manually divided into cross-noise and random noise, and each type of noise has a different ratio in the dataset. Under the premise of minimizing the influence of parameter interactions in distributed deep learning, we choose a compressed model (SqueezeNet) based on the proposed flexible communication method. It is used to reduce the communication frequency and we evaluate the influence of noise on distributed deep training in the synchronous and asynchronous stochastic gradient descent algorithms. Focusing on the experimental platform TensorFlowOnSpark, we obtain the training accuracy rate at different noise ratios and the training time for different numbers of nodes. The existence of cross-noise in the training set not only decreases the test accuracy and increases the time for distributed training. The noise has positive effect on destroying the scalability of distributed deep learning.

Key words: distributed deep learning, stochastic gradient descent, parameter server (PS), dataset noise

CLC Number:

Guo Qinghao, Shuai Liguo, Hu Sunying. Impact of data set noise on distributed deep learning[J]. The Journal of China Universities of Posts and Telecommunications, 2020, 27(2): 37-45.

References

References
1. Zhao J H, Gao H B, Liu Y C, et al. Speech recognition algorithm based on neural network and hidden Markov model. Journal of China Universities of Posts and Telecommunications, 2018, 25(4): 28 -37
2. Gao H B, Cheng B, Wang J Q, et al. Object classification using CNN-based fusion of vision and LIDAR in autonomous vehicle environment. IEEE Transactions on Industrial Informatics, 2018, 14(9): 4224 -4231
3. Coates A, Huval B, Wang T, et al. Deep learning with COTS HPC systems. 30th International Conference on Machine Learning, June 16 -21, 2013, Atlanta, 2013: 1337 -1345
4. Wang Y X, Zhao S H, Yu Y Y, et al. Speech bandwidth extension based on restricted boltzmann machines. Journal of Electronics and Information Technology, 2016, 38(7): 1717 -1723
5. Wen W, Xu C, Yan F, et al. Terngrad: ternary gradients to reduce communication in distributed deep learning. 31st Annual Conference on Neural Information Processing Systems, Dec 4 -9, 2017, Long Beach, CA, USA, 2017: 1510 -1520
6. Campos V, Sastre F, Yagues M, et al. Distributed training strategies for a computer vision deep learning algorithms on a distributed GPU cluster. 18th IEEE/ACM International Conference on Computational Science, June 12 -14, 2017, Zurich, 2017: 315 -324
7. Ren Y, Wu X, Zhang L, et al. IRDMA: efficient use of RDMA in distributed deep learning systems. 19th IEEE Intl Conference on High Performance Computing and Communications, Dec 18 -20, 2017, Bangkok, 2018: 231 -238
8. Ichinose A, Nakada H, Takefusa A, et al. Pipeline-based processing of the deep learning framework caffe. 11th International Conference on Ubiquitous Information Management and Communication, Jan 5 -7, 2017, Beppu, 2017: 1 -8
9. Ma T, Wang F, Tian Y S, et al. A novel ensemble forecasting algorithm based on distributed deep learning network. International Journal of Performability Engineering, 2019, 15(11): 2927 -2935
10. Gao H B, Xie G T, Liu H Z, et al. Lateral control of autonomous vehicle based on learning driver behaviour via cloud model. Journal of China Universities of Posts and Telecommunications, 2017, 24(2): 10 -17
11. Zhou Q, Zhou W A, Yang B, et al. Deep cycle autoencoder for unsupervised domain adaptation with generative adversarial networks. IET Computer Vision, 2019, 13(7): 659 -665
12. Gao H B, Zhang X Y, Zhang T L, et al. Research of intelligent vehicle variable granularity evaluation based on cloud model. Acta Electronica Sinica, 2016, 44(2): 365 -374 (in Chinese)
13. Li D Y, Gao H B. A hardware platform framework for an intelligent vehicle based on a driving brain. Engineering, 2018, 4(4): 464 -470
14. Li M, Andersen D G, Smola A, et al. Communication efficient distributed machine learning with the parameter server. 28th Annual Conference on Neural Information Processing Systems 2014, Dec 8 -13, 2014, Montreal, 2014: 19 -27
15. Khumoyun A, Cui Y, Hanku L. Spark based distributed deep learning framework for big data applications. 2016 International Conference on Information Science and Communications Technologies, Nov 2 -4, 2016, Tashkent, Uzbekistan, 2016: 1 -5
16. Langer M, Hall A, He Z, et al. MPCA SGD-a method for distributed training of deep learning models on spark. IEEE Transactions on Parallel and Distributed Systems, 2018, 29(11):
2540 -2556
17. Mikyoung L, Sungho S, Seungkyun H, et al. BAIPAS: distributed deep learning platform with data locality and shuffling. 2017 European Conference on Electrical Engineering and Computer Science, Nov 17 -19, 2017, Bern, 2017: 5 -8
18. Guo Q, Liu Q L, Wang W M. Fast SqueezeNet algorithm with application in metro crowd density estimation. Control Theory and Applications, 2019, 36(7): 1036 -1046 (in Chinese)

Metrics

Comments

Copyright © 2020 The Journal of China Universities of Posts and Telecommunications
　 Adress: P.O. Box 231,Beijing University of Posts and Telecommunications,10 Xi Tucheng Road,Beijing 100876,P.R.China　Post Code: 100081
Tel：86-010-62282493　Fax： 86-010-62283461　E-mail: jchupt@bupt.edu.cn
Support by: Beijing Magtech Co.Ltd

[1]	He Mingshu, Jin Lei, Wang Xiaojuan, Li Yuan. Web log classification framework with data augmentation based on GANs [J]. The Journal of China Universities of Posts and Telecommunications, 2020, 27(5): 34-46.
[2]	. Joint multi-QoS and energy saving routing for LEO satellite network [J]. The Journal of China Universities of Posts and Telecommunications, 2019, 26(3): 25-34.
[3]	. Location aided probabilistic broadcast algorithm for mobile Ad-hoc network routing [J]. JOURNAL OF CHINA UNIVERSITIES OF POSTS AND TELECOM, 2017, 24(2): 66-71.

Impact of data set noise on distributed deep learning

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 3

Recommended Articles

Metrics

Comments