中国邮电高校学报(英文) ›› 2018, Vol. 25 ›› Issue (4): 66-74.doi: 10.19682/j.cnki.1005-8885.2018.1018

• Networks • 上一篇    下一篇

IDRMS: an improved dynamic replication management scheme for cloud storage

Wang Jun, Wang Yue, Wang Menglin, Liu Junjie   

  1. Department of Communication and Information Engineering, Nanjing University of Posts and Telecommunications, Nanjing 210003, China
  • 收稿日期:2017-12-07 修回日期:2018-09-25 出版日期:2018-08-30 发布日期:2018-11-02
  • 通讯作者: Wang Jun,E-mail:wang_jun@njupt.edu.cn E-mail:wang_jun@njupt.edu.cn
  • 作者简介:Wang Jun,E-mail:wang_jun@njupt.edu.cn
  • 基金资助:
    This work was supported by the National Natural Science Foundation of China (61401234).

IDRMS: an improved dynamic replication management scheme for cloud storage

Wang Jun, Wang Yue, Wang Menglin, Liu Junjie   

  1. Department of Communication and Information Engineering, Nanjing University of Posts and Telecommunications, Nanjing 210003, China
  • Received:2017-12-07 Revised:2018-09-25 Online:2018-08-30 Published:2018-11-02
  • Contact: Wang Jun,E-mail:wang_jun@njupt.edu.cn E-mail:wang_jun@njupt.edu.cn
  • About author:Wang Jun,E-mail:wang_jun@njupt.edu.cn
  • Supported by:
    This work was supported by the National Natural Science Foundation of China (61401234).

摘要: Cloud storage is getting more and more popular as a new trend of data management. Data replication has been widely used as a means of increasing the data availability in large-scale cloud storage systems where failures are normal. However, most data replication schemes do not fully consider cost and latency issues when users need large amounts of remote replicas. We present animproved dynamic replication management scheme (IDRMS). By adding a prediction model, the optimal allocation of replicas among the cloud storage nodes is determined that the total communication cost and network delay are minimal. When the local data block is frequently requested, the data replicas can be moved to a closer or cheaper node for cost reduction and increased efficiency. Moreover, we replace
the B+ tree with the B*tree to speed up the search speed and reduce workload with the lowest blocking probability. We define the value of popularity to adjust the placement of replicas dynamically. We divide the data nodes in the network into hot nodes and cool nodes. By changing to visit cool nodes instead of hot nodes, we can balance the workload in the network. Finally, we implement IDRMS in Matlab simulation platform and simulation results demonstrate that IDRMS outperforms other replication management schemes in terms of communication cost and load balancing for large-scale cloud storage.

关键词: cloud storage, dynamic replication management, blocking probability, prediction, load balancing

Abstract: Cloud storage is getting more and more popular as a new trend of data management. Data replication has been widely used as a means of increasing the data availability in large-scale cloud storage systems where failures are normal. However, most data replication schemes do not fully consider cost and latency issues when users need large amounts of remote replicas. We present animproved dynamic replication management scheme (IDRMS). By adding a prediction model, the optimal allocation of replicas among the cloud storage nodes is determined that the total communication cost and network delay are minimal. When the local data block is frequently requested, the data replicas can be moved to a closer or cheaper node for cost reduction and increased efficiency. Moreover, we replace
the B+ tree with the B*tree to speed up the search speed and reduce workload with the lowest blocking probability. We define the value of popularity to adjust the placement of replicas dynamically. We divide the data nodes in the network into hot nodes and cool nodes. By changing to visit cool nodes instead of hot nodes, we can balance the workload in the network. Finally, we implement IDRMS in Matlab simulation platform and simulation results demonstrate that IDRMS outperforms other replication management schemes in terms of communication cost and load balancing for large-scale cloud storage.

Key words: cloud storage, dynamic replication management, blocking probability, prediction, load balancing