节点文献
多节点机群系统的高可用管理软件的设计与实现
【作者】 张文生;
【导师】 徐志伟;
【作者基本信息】 中国科学院研究生院(计算技术研究所) , 计算机组织与系统结构, 2000, 硕士
【摘要】 高可用计算机系统的研究一直是计算机科学与工程界的一个重要课题。随着通过Internet提供商业服务的趋势的发展,这一研究显得越发重要。这是因为服务系统的可用性程度对服务提供者的商业利益具有重大的影响。同时,由于通过计算机服务系统提供的服务内容和服务范围都在不断扩大,计算机服务系统的规模也需不断扩大,现有的小规模的高可用系统已经难以满足这样大规模计算机系统对高可用支持的需求。因此研究可扩展的高可用机群系统是十分重要的。 本文的主要内容之一,是探讨设计和实现多节点高可用机群系统的高可用管理软件过程中面临的关键问题及其解决方案。我们首先研究高可用管理软件的体系结构设计与系统可扩展性的关系,并对两种典型的体系结构——“平等式”和“结构式”进行分析和比较。之后,我们研究高可用管理软件与应用程序的接口设计,比较了3种策略——“黑箱”策略、“cluster-aware应用程序”策略和“虚拟cluster-aware应用程序”策略。 服务器聚集的概念近年来正日益受到重视,具有单一登录点的机群系统是适合用来实现服务器聚集的体系结构。本文的另一个目的是介绍和评价建立在“曙光2000”机群系统上的“曙光服务器聚集系统”(DSC Dawning Server Consolidation)的高可用管理软件的设计与实现。它实现了多节点机群系统高可用管理软件的基本功能。
【Abstract】 During the past years, the research for high-available computer systems has been active. With the rapid increasement of commercial services which are delivered throught Internet, this field has become more important. This is mainly due to the reason that, availability of computer services has great effects on the profits of service ventors. The other effect brought forth by this trendcy is that, high available computer systems with larger scale are in great demands. As a result, the research for scalable high available computer clusters has become very necessary. The thesis just aims at this issue.One major part of this thesis is about the essential issues in designing HA (high availability) management software for multi-node cluster. We first focuse on the relationship between the architecture of HA management software and the scalability of the cluster, and two typical architectures namely, "peer-peer architecture" and "structural architecture", are analysized and compared. Then,we turn to next focus--interface between HA management software andapplications, and three interface strategies namely, "black box", "cluster-aware applications" and "virtual cluster-aware applications", are analysized and compared.The other major part of this thesis is the design and implementation of DSC’s HA management software. DSC (Dawning Server Consolidation) is a server consolidation system built on the Dawning2000 computer cluster. DSC’s HA management software has implemented the functions that are essential to a HA management software of a multi-node HA cluster.
【Key words】 high availability; high available system; cluster; high availability management software; scalability;
- 【网络出版投稿人】 中国科学院研究生院(计算技术研究所) 【网络出版年期】2007年 02期
- 【分类号】TP311.52
- 【被引频次】1
- 【下载频次】86