节点文献

集群环境下TCP连接迁移的研究与实现

【作者】 黄翊

【导师】 张民选;

【作者基本信息】 国防科学技术大学 , 计算机科学与技术, 2002, 硕士

【摘要】 随着微处理器技术和高性能网络技术的飞速发展,集群计算逐渐成为一种有成本效益的并行/分布式计算资源。集群计算系统的核心问题是资源的共享及有效利用,只有平衡的负载才能达到最大的资源使用率。为了充分有效的利用整个集群资源,进程必须能够在集群中自由的迁移以达到有效的利用底层资源的能力和适应负载的动态变化。进程间的网络连接不仅直接影响迁移的透明性,带来了迁移限制,而且导致极大的剩余相关性,对迁移后的性能和系统的健壮性有很大的负面影响。因此,必须能够提供对网络连接迁移的支持,保持进程迁移前打开的网络连接的能力。我们将其称为套接字迁移。 本文研究了TCP/IP协议及其在LINUX系统下的实现,分析了网络连接的状态,给出了网络连接状态的提取、网络连接迁移和恢复的解决方案,解决了集群环境下进程迁移如何透明的保持TCP网络连接这一问题。本文还给出了套接字迁移在MOSIX集群系统上的一个原型实现。 本文对网络连接的迁移进行了有意义的探索,对集群系统设计和进程迁移机制研究有一定的参考价值。

【Abstract】 Improvements in microprocessor and networking performance have made cluster system a very attractive platform for high-end parallel and distributed computing. The core problem in cluster system is how to share the cluster resources and fully utilize them. The best performance can not be achieved when loads among nodes are unbalanced. Achieving good performance requires that processes be able to migrate between cluster resources and efficiently adapt to the underlying resource capabilities. However,the network communication between processes not only affects the transparency of migration,but also puts limit on some processes which may cause them unfit for migration . Another drawback is that it may cause residual dependency on the previous host which will impacts on the performance and robustness of the system. Therefore we must provide a mechanism to support the migration of network connections and maintain network connectivity after process migration. We call this mechanism "Socket Migration".In the thesis,the TCP/IP protocol and it’s implementation on Linux is studied. The thesis presents how to retrieve the state of network connection and how to migrate established TCP connection. The problem that how to maintain network connection after process migration can be solved perfectly with our methods. We also implement a prototype of "Socket Migration" on the basis of MOSIX system.The thesis makes a series of researches on migration of network connection and can help in the design of cluster system and study of process migration mechanism.

  • 【分类号】TP393.03
  • 【被引频次】4
  • 【下载频次】169
节点文献中: 

本文链接的文献网络图示:

本文的引文网络