JOURNAL OF CHINA UNIVERSITIES OF POSTS AND TELECOM ›› 2018, Vol. 25 ›› Issue (4): 66-74.doi: 10.19682/j.cnki.1005-8885.2018.1018

• Networks • Previous Articles     Next Articles

IDRMS: an improved dynamic replication management scheme for cloud storage

Wang Jun, Wang Yue, Wang Menglin, Liu Junjie   

  1. Department of Communication and Information Engineering, Nanjing University of Posts and Telecommunications, Nanjing 210003, China
  • Received:2017-12-07 Revised:2018-09-25 Online:2018-08-30 Published:2018-11-02
  • Contact: Wang Jun,E-mail:wang_jun@njupt.edu.cn E-mail:wang_jun@njupt.edu.cn
  • About author:Wang Jun,E-mail:wang_jun@njupt.edu.cn
  • Supported by:
    This work was supported by the National Natural Science Foundation of China (61401234).

Abstract: Cloud storage is getting more and more popular as a new trend of data management. Data replication has been widely used as a means of increasing the data availability in large-scale cloud storage systems where failures are normal. However, most data replication schemes do not fully consider cost and latency issues when users need large amounts of remote replicas. We present animproved dynamic replication management scheme (IDRMS). By adding a prediction model, the optimal allocation of replicas among the cloud storage nodes is determined that the total communication cost and network delay are minimal. When the local data block is frequently requested, the data replicas can be moved to a closer or cheaper node for cost reduction and increased efficiency. Moreover, we replace
the B+ tree with the B*tree to speed up the search speed and reduce workload with the lowest blocking probability. We define the value of popularity to adjust the placement of replicas dynamically. We divide the data nodes in the network into hot nodes and cool nodes. By changing to visit cool nodes instead of hot nodes, we can balance the workload in the network. Finally, we implement IDRMS in Matlab simulation platform and simulation results demonstrate that IDRMS outperforms other replication management schemes in terms of communication cost and load balancing for large-scale cloud storage.

Key words: cloud storage, dynamic replication management, blocking probability, prediction, load balancing