节点文献

系统仿真数据分布式计算环境的研究与实现

Research and Implementation of System Simulation Data Distributed Computing Environment

【作者】 陈亮

【导师】 吴继娟;

【作者基本信息】 哈尔滨工程大学 , 计算机应用技术, 2008, 硕士

【摘要】 目前,系统仿真数据的计算都是在一台机器上进行计算的。系统仿真数据是用来评价支撑业务系统的网络系统的性能的,这就需要实时的计算系统仿真数据,以便向用户展现业务系统运行时的各个网络设备的负载情况,方便用户判断所选设备是否合理。然而由于仿真数据量大,在单机上很难满足向用户实时交互的需求。这就有必要通过分布式计算技术把地理上广泛分布的各种资源连成一个整体,共同完成计算任务。利用网络主机的资源开展分布式计算已经成为一种重要的高性能计算方式,它提供了更好的性能价格比,更容易解决一些在单台计算机上难以解决的复杂问题。本文着重分析了系统仿真数据,针对系统仿真数据要求实时性、计算量大、单机处理速度慢等特点,采用了基于Master—Worker模型的三层体系结构的分布式计算处理方案;在实现技术上,采用了WebServices技术,在计算节点端采用Web Services技术将系统仿真数据的计算应用发布为Web服务。首先论文给出了系统架构和各个模块的功能。由于任务计算需要很长的时间,计算节点发生错误的可能性增加,针对单个计算节点失效引起的计算上的巨大损失问题,采用了检查点功能。接着对系统中的任务调度算法进行了研究,对蚁群算法进行改进,即性能高的机器分配的任务比较多,性能低的机器分配的任务比较少,通过合理分配减少任务的运算时间。

【Abstract】 At present, system simulation data is calculated on one computer. System simulation data is used to evaluate the performance of the network system which supports transaction system. This requires real-time computing system simulation data, in order to demonstrate network equipment’s load to users when transaction system is running. Simulation data is so much as it is difficult of meeting real-time interaction to users when it is on one computer. It is necessary to connect different resources which are scattered on the internet into integrated one with the distributed computing technology. The new one will accomplish computing tasks together.Distributed computation using free resource on the network has been an important way of high-performance computation. It is more convenient than one computer to solve complex problems with providing better performance-price ratio. It analyzes system simulation data in thesis In view of the characteristics of system simulation data, such as real-time, the amount of computation, low processing speed of one computer, it adopts a three-layer structure based on the Master-Worker. In the realization of technology, it adopts Web Services in thesis. The computation application of system simulation data is deployed as a computation Web Services on the Worker.Firstly, it introduces the frameworks and the function of each module in thesis. Because it is a task which needs a long time to run, checkpoint function is adopted in order to avoid a great loss caused by one compute node’s failure. Secondly, job assignment algorithms in the system are researched and an improved ant colony optimization is introduced which means that more tasks are assigned to high-performance computer and less tasks are assigned to low-performance computer and it will reduce the computing time by this way.

  • 【分类号】TP391.9
  • 【下载频次】56
节点文献中: