节点文献

数据网格中分布式副本定位技术研究与实现

Research and Realization on Distributed Replica Location in Data Grids

【作者】 王福业

【导师】 高敬阳;

【作者基本信息】 北京化工大学 , 计算机应用技术, 2008, 硕士

【摘要】 数据网格是网格环境下共享和管理存储资源和分布式数据资源的大规模、可扩展架构,它适应数据密集型应用对网格环境下数据共享和处理的需要,给用户提供了透明访问远程异构数据资源的机制。副本管理是数据网格中一个重要的组成部分,创建数据副本可以降低远程访问该数据的网络延迟及带宽消耗,还可以提高网络的负载平衡,同时能够提高数据的安全性和可靠性,以及系统的容错性等。良好的数据副本管理策略是提高数据网格服务质量的一个重要方面,副本管理策略中包括副本的创建、副本的选择和副本的定位等。其中副本定位是提高系统性能的重要环节。本文主要对副本管理策略中的副本定位策略展开了研究,主要工作如下:1.通过对目前已有的副本定位技术的研究,本文提出了PM-Chord算法,该算法改进和扩展了基于Chord算法的P2P副本定位机制。PM-Chord算法具有如下新特性:(1)改变了数据的存储方式,按照前缀匹配原则存储数据,分离了节点查询与数据查询,在折半查找的基础上进一步运用前缀匹配原则查询数据。(2)增加了前继副本机制,以解决数据网格中的查询“热点”问题,并平衡系统查询负载和提高系统的稳定性、可靠性。2.在网格中间件Globus的基础上,以PM-Chord为核心算法,基本实现了副本定位系统PM-RLS。并对系统的性能进行了测试与分析,同时与Chord算法进行了对比。结果表明PM-RLS具有很高的副本定位效率和良好的稳定性、可靠性。

【Abstract】 Data Grid architecture provides a large-scale, scalable infrastructure for the management of storage resources and data that are distributed across grid environments. Based on the requirements to data sharing and managing of data-intensive computing application, it provides the mechanisms for transparently remote accessing to heterogeneous data resources. Replica management is one of the critical parts in data grids. The replica created could reduce the network delay and bandwidth consumes when accessing to the data and improves load balance of network. It could also improve security, reliability and system tolerance of the data. Excellent replica management strategies are important to improve the QoS in data grids. Replica management strategies include replicas creation strategies, replicas selection strategies and replicas location mechanism and so on. Replica location mechanism is an important tache in improving performance of the system. This paper investigates on replica location strategies of replica management in data grids and the main work are as follows:1. A PM-Chord (Prefix Matching-Chord) method which improves and expands a replica location strategy based on Chord theory in P2P field is proposed in this paper by studying some replica location strategies exist. PM-Chord has the following new features.(1) Change the method of the data storage and separate the node search and data search which use prefix matching principle based on binary search.(2) Add the predecessor replication mechanism to solve the hot spots question in data grids and improve the reliability of the system.2. Realized PM-RLS which use PM-Chord method as replica location strategy on the Globus basis. Analysis and experiments show that not only PM-Chord has better performance than Chord at the replica location and can achieve reliability, stability.

【关键词】 数据网格副本副本定位Chord
【Key words】 Data GridReplicaReplica LocationChord
  • 【分类号】TP393.02
  • 【被引频次】2
  • 【下载频次】77
节点文献中: 

本文链接的文献网络图示:

本文的引文网络