节点文献

流媒体集群系统复制存储策略研究

Research on Replication Policy for Clustered Streaming Media System

【作者】 卫星

【导师】 奚宏生;

【作者基本信息】 中国科学技术大学 , 控制理论与控制工程, 2009, 博士

【摘要】 宽带化高速网络技术、动态影像压缩解码技术与大容量存储技术的成熟和日益增长的互联网多媒体需求,促成了流媒体的诞生和发展。流媒体就是应用流技术在网络上传输的多媒体文件,而流技术就是把连续的影像和声音信息经过压缩处理后放入网络服务器,让用户一边下载一边观看、收听,而不需要等整个压缩文件下载后才可观看的网络传输技术。集群是由一组网络互联且独立的分布式存储、流化服务器构成,和集中式结构相比具有高可扩展性、高可用性、高性价比三方面的优势。副本放置问题是指在分布式环境下,为了优化特定的系统性能指标而对目标数据(文件)进行副本生成、副本放置、副本替换等一系列操作的问题,通常分为静态和动态两种典型方式。本文研究复制存储方式下的流媒体服务器集群的副本放置/内容部署问题。首先研究了在固定节目流行度的情况下,如何进行内容优化部署以最小化流媒体集群系统拒绝率和降低复制存储消耗的问题。首先运用排队理论知识分析得出优化目标和服务器访问概率之间的数值联系,并且通过某些数值方法确定出系统最小拒绝率情况下的最优服务器访问概率。由于内容部署属于NP-Hard问题且完全决定每台服务器的访问概率,本文设计了初始放置、副本交换和对等副本访问概率调整三种启发式策略来进行内容部署,以满足在优化内容分布下每台服务器访问概率和最优值之间的差异最小,从而实现降低系统拒绝率和存储代价的目标。最后分别采用数值分析和离散事件仿真验证了模型的正确性和算法的有效性。其次研究了同构流媒体集群服务器环境下的动态副本放置问题,提出一种请求调度和动态内容部署相结合的新颖策略。首先采用滑动窗的方式预测各文件的点播率,以平衡服务器之间的实时累积访问概率为目标,在不进行内容部署的情况下,预分配相同文件不同副本的访问概率;同时进行零迁移代价的动态内容更新,从而能够提前反映出请求的实时倾向性,在降低系统请求拒绝率的同时,有效的减少了请求被分发到服务器后产生负载迁移的频度和代价。仿真分析的结果表明这种策略对于流媒体服务系统的研究和实践具有指导意义。最后结合离散事件和排队理论,独创性地搭建了一个普遍适用的服务器集群系统仿真平台。本文提出的流媒体仿真平台,符合实际运行系统特点,各组成单元均具有相对独立性,可以通过自由组合的方式,适应于各种不同参数要求、优化策略、运行环境的仿真。而且由于对每个功能严格细化,保证了不同策略下仿真结果的公平性、透明性。

【Abstract】 Recent advances in high-speed network technology,dynamic video data compression and data decompression technology,bulk storage technology,and the increasing demand of network multimedia,accelerates the birth and development of streaming media.Streaming media is the multimedia transferred over network by stream technology. Stream technology,which compressing continuous audio or video program and placing them to the web server,can make the clints have no use for downloading the whole program from the server.The program can be listened and watched when it is being downloaded.The clustered server system,consisting a group of distributed streaming servers,is superior to the centralized structure in three aspects:good scalability, high availability,and competitive performance-to-price ratio.Replica Placement Problem(RPP),which involving how to produce the replicas of data or files,how to distribute the replicas to the servers and how to replace the old replicas with demanding replicas,et.,has the goal of enhancing the performance of the system.Typically, RPP formulations fall into two categories:static and dynamic.Our research focuses on replication policy or content distribution for clustered server system.Firstly,the optimizing problem of content distribution which minimizes the blocking probability and storage consumption on clustered streaming media system is discussed, in the case of knowing every program’ s unchanged popularity.The queuing theory is adopted to analysis the relationship between the server’ s access probability and the optimizing goal.The ideal access probability of every server can be obtained by some numerical methods,under the circumstance of minimal blocking probability. Content distribution determining each server’ s access probability,has been proved to be NP-Hard.The whole content distribution process consists of three strategies,i.e. initial allocating,duplicate swapping and peer duplicate’ s access probability adjusting. All the heuristic arithmetic is designed to perform the content distribution in order to minimize the distance between the result of optimization and the ideal one,minimize the storage consumption and reduce the blocking probability.Lastly the correctness of system modeling and the efficiency of proposed arithmetic are verified by numerical analysis and discrete event simulation.Secondly,the problem of dynamic content deployment for clustered streaming media system consisting of homogeneous servers is also discussed in this paper.We propose a dynamic storage balancing(DSB) based content updating strategy to lower the service blocking probability,to balance the load of media servers and to reduce the update cost.A relatively short time window is adopted to predict the changing popularity of all multimedia files.Unlike other exiting policies applying the predicted result to replicating and de-replicating,we assign it properly to each replica in the cluster for the purpose of unifying every server’ s access probability(AP),in other words,making the system be in a condition of DSB.Then we gradually replicate the newly popular objects to enhance the ability of DSB.Meanwhile,we adopt a dereplication strategy with zero in-service request migration cost to avoid storage space being exhausted by replicas of previously popular objects.The simulation analysis shows that the proposed policy can achieve an outstanding system performance.Lastly,a novel simulation platform for server cluster is brought forward which is based on queueing theory.The component elements of the platform may be assembled easily to execute various simulation because of their relatively independence.Since the function of each sanction is subdivided strictly,the whole platform which is in accordance with the actual system can guarantee the clarity and fairness of different policies.

节点文献中: 

本文链接的文献网络图示:

本文的引文网络