节点文献

P2P网络流量控制管理若干关键技术研究

Research on Key Technologies of Peer-to-Peer Network Traffic Control Management

【作者】 吴敏

【导师】 王汝传;

【作者基本信息】 南京邮电大学 , 信息网络, 2011, 博士

【摘要】 最近几年,P2P技术作为一种新的互联网应用模式迅速风靡全球,作为改变互联网应用模式的一项新技术,P2P业务己悄然占据了互联网业务流量总量的70%以上。P2P流量的无序性在一定程度上打破了网络运营商原有的运营和商业模式,并且巨大的流量也给运营商的流量管理带来了巨大的压力。提供一个可运营可管理的P2P基础平台,协调好用户、P2P服务提供商、网络提供商各方的利益,使整个P2P产业链健康协调地发展,将是未来的发展方向。为应对P2P流量对网络带宽消耗带来的挑战,运营商一方面通过加大投入,增加带宽来解决问题;另一方面,希望找到合适的技术方法实现P2P流量的有效管理,通过限制或引导P2P流量,避免带宽被P2P流量过多地消耗。因此要实现可靠的数据传输以及合理的网络流量资源分配,就需要根据P2P网络流量的行为特性建立相应的P2P流量识别、预测与控制模型,形成完善的P2P流量控制管理机制。本文在深入分析P2P流量管理技术研究的现状、现有的P2P协议及相关技术的基础上,对P2P流量控制管理研究中的P2P流量检测技术、预测算法和控制机制这些关键技术分别进行了深入研究。为验证本文提出的算法和模型的正确性,最后设计并实现了一个基于移动代理的P2P流量控制管理系统,系统演示结果表明该流量管理系统能对网络中的P2P流量进行有效的检测、控制和管理,从而能保证网络中其他应用业务的带宽使用,为网络运营商提高网络管理水平提供良好的参考应用价值和实际应用意义。本文的主要贡献如下:(1)针对现有P2P流量识别方法的不足,吸收现有方法的优点,提出了一种基于流量属性特征选择的P2P流量分类方法,通过实验表明基于FCBF选择的SVM算法对P2P流量能进行良好的识别;为了提高P2P流量识别的敏感度和精度还提出了一种基于损失函数的支持向量机P2P流量检测模型,使用模糊隶属度概念对SVM算法的结果进行解释,结合损失函数的概念对SVM算法进行改进,并应用于P2P流量识别,能够对新的未知的P2P协议和加密传输的P2P流进行一定的识别。但是由于基于属性特征选择的P2P流量识别方法速度慢,只能进行离线识别,为此本文提出了一种基于流传输特性的实时P2P流量检测方法,该方法是一种轻量级的实时在线识别方法,能够高效快速的实时在线进行P2P协议的识别。实验结果表明所提出的方法是可行的。(2)提出了基于小波分析的P2P业务流量预测模型。该模型结合卡尔曼滤波和小波分析技术,使用小波重构实现对一类非平稳随机过程的平稳部分的各个分量进行更新估计,并且使用卡尔曼滤波方法可以在一个周期内利用某个时刻及其以前的信息对其后所有时刻进行预报,即实现了周期内实时的动态多步预报。同时,与小波分析方法相比,该方法具有实时性和递归性,可在时域中对网络流量进行实时的动态估计和分析。并且具有与真实网络流量重合度高,预测误差小的特点,能达到良好的跟踪和预测性能,同传统的非线性预测模型如神经网络等相比,其复杂度较低便于实时预测,可满足网络运营商对P2P流量管理的要求。(3)为了对P2P流量进行有效的控制,提出了两个流量本地化的策略:即P2P缓存系统和一种基于CDN路由的轻量级节点距离测量方案。前者既为内网用户提供可管理控制的P2P服务,保证用户的P2P应用体验,又能减轻主干网的带宽压力。因此本文对缓存系统部署架构及各个模块的功能和校园网内的实测结果进行了深入研究。后者在节点选择机制上利用网络路由机制,将逻辑距离和实际地理距离结合,有利于实现流量本地化的。仿真实验证明这种节点选择方案对于P2P流量本地化的意义。(4)为了验证上述部分算法和方法的有效性,本文实现了一个基于移动代理的P2P网络流量管理原型系统。在原型系统的P2P流量检测模块中,结合了流的实时传输特征和传统的DPI方法实现了对P2P流量的实时在线识别;在其P2P流量控制模块中,我们主要结合了传统的流量控制手段。原型系统中的两个模块相互补充,可以单独供网络管理者使用。通过实现的系统,把本文的相关研究成果应用其中,良好的展示了本文的研究极具有使用价值和实际应用意义。

【Abstract】 As a new model of Internet applications, Peer-to-Peer (P2P) technology has rapidly become a hot topic in the world with its traffic having occupied more than 70% of the total internet traffic. Meanwhile,the huge P2P random traffic has not only broken the original network operator’s operation and business model but also brings more challenge for ISPs to achieve more effective traffic management.The future development direction is to provide a P2P foundation platform which is operatable and managable, and can coordinate all benefits of users, P2P serve providers as well as network providers to promote the healthy development of the entire P2P industry chain. In order to deal with the challenge of huge P2P traffic, on the one hand the operator expands the investment and increases the band width. On the other hand, they are looking for appropriate technique to reliaze effective traffic management which can limit and guide the P2P traffic so as to avoid the excessive band comsumption by the P2P traffic. Therefore in order to realize reliable data transmission and reasonable network taffic resource distribution, it is impretive to establish corresponding control management mechanism for P2P taffic detection, forecast and control model according to the behavioral characteristics of P2P network traffic.Based on the deep research of P2P traffic control management product, current p2p protocols as well as the relevant techqiues, the thesis studies the key mechanisms for P2P control management including P2P taffic detection technology, P2P traffic forecast algorithm and its control mechanism.The main contribution of this thesis is as follows:(1) Aiming at the existing deficiencies of P2P traffic identification methods, and absorbing the advantages of existing methods, the thesis proposes a P2P traffic classification based on feature selection and flow properties of traffic. A large number of experiments show that SVM algorithm based on FCBF method does well in P2P traffic identification. To improve the sensitivity and accuracy of SVM algorithm , a P2P traffic detection model based on loss function of SVM is also proposed, which uses the concept of fuzzy theory and loss function of SVM algorithm to explain the results of SVM algorithm. Analysis shows that the improved algorithm is more accurate and can be applied to identify those unknown, new P2P protocol or those which adopt encrypted transmission. However, due to the P2P traffic identification method based on feature selection is slow which can only be used for offline recognition, the thesis presents a lightweight stream characteristics based online identification method which is fast and efficient in real-time P2P traffic identification. Experimental results show that the proposed method is feasible.(2) A novel P2P traffic forecasting model based on wavelet analysis is proposed. Combined with Kalman filter and wavelet analysis techniques, the proposed model uses wavelet reconstruction to update and estimate the various components of the smooth part of a class of non-stationary random process, meanwhile the Kalman filtering method is used to achieve a dynamic multi-step prediction . The method uses all their previous information to predict the traffic in a cycle. Compared with the wavelet analysis, the proposed method is real-time and recursive and can be used for estimation and analysis of dynamic real-time network traffic. Simulations show that it achieves good tracking and forecasting performance for its high degree of overlap with the real network traffic and its small forecasting errors. Besides, compared with the traditional non-linear prediction models such as neural networks, the proposed methods has the low complexity and is suitable for real time prediction, which can meet the traffic management requirements of P2P network operators.(3) To realize P2P traffic localization, two methods are thoroughly studied, namely , a P2P caching system and a lightweight node distance measurement based on the CDN routing. The former scheme can not only provide users within the network the control of P2P services and ensure their P2P applications experience, but also can reduce the backbone bandwidth pressure. Therefore the structure and deployment of P2P caching system as well as its measurement in the campus network environment is thoroughly studied in the thesis.The latter node selection mechanism combined the logical distance with the actual geographical distance which is conducive to flow localization. Finally, some simulation results show its significance in P2P traffic localization.(4) In order to verify the effectiveness of the proposed algorithms and methods, we implemented a mobile agent-based P2P network traffic management system. In the P2P traffic detection module of the realized system, we combine the stream-based real-time transmission characteristics and the traditional method of DPI for on-line identification of P2P traffic. In its P2P traffic control module, it is mainly a combination of the traditional methods. The realized system includes two modules which complements each other, and can be used separately for network administrators. By applying research results to the realisitic management system of P2P traffic, the thesis exhibits the significance in practical application.

节点文献中: 

本文链接的文献网络图示:

本文的引文网络