节点文献
粗粒度数据流网络处理器体系结构研究
Research on the Architecture of Coarse-grain Dataflow Network Processor
【作者】 李韬;
【导师】 卢锡城;
【作者基本信息】 国防科学技术大学 , 计算机科学与技术, 2006, 硕士
【摘要】 为了满足日益增长网络带宽的处理要求,网络处理器体系结构设计必须更多地考虑匹配网络报文流的处理特性,基于数据流的网络处理器体系结构可以很好地利用网络应用的流处理特征。本文针对控制流网络处理器在指令级并行性开发和相对固定拓扑结构两方面的不足,将粗粒度数据流模型的设计思想引入到网络处理器体系结构的设计中,提出了一种新型粗粒度数据流网络处理器体系结构CDNP,并深入研究了CDNP中若干关键技术。本文的主要工作和创新点包括:(1)针对控制流网络处理器存在的问题和细粒度同步数据流网络处理器在可编程性方面的缺陷,提出了一种新型的粗粒度数据流网络处理器体系结构CDNP(Coarse-grainDataflow Network Processor)。CDNP在数据流模型的基础上,不仅通过将控制流结构引入到处理单元PE的设计中,提高了整个网络处理器的可编程性;还利用了数据流模型在指令级并行性开发上的优势,有效地开发了工作负载中的任务级并行性,从而获得较高的处理性能和灵活性。(2)针对CDNP报文的数据流处理特点,研究了CDNP中处理单元PE(ProcessingElement)的关键技术。首先,提出了PE中微核指令集的选取和基本逻辑功能的实现方案。其次,针对CDNP令牌处理的数据流驱动特性,确定了PE的令牌处理机制,要求PE的令牌处理模块在满足令牌接收、缓冲、转换、封装、发送等基本功能的基础上,充分考虑令牌处理的数据流驱动特性。最后,针对帧缓冲FB的管理工作,提出以硬件链表方式对帧缓冲FB进行设计,从而支持不同类型工作负载的报文保序。(3)基于CDNP令牌处理路径的软配置机制,论文提出了一种动态令牌处理路径调度算法DTPPS(Dynamic Token Processing Path Scheduling)。DTPPS算法监视CDNP各PE的负载状况,当负载不平衡时,调整工作负载的令牌处理路径,对重负载PE上的任务进行重新映射。模拟结果表明,该算法可较好地平衡各PE的工作负载,有效提高CDNP的系统流量。论文还介绍了基于SoPC(System on Programmable Chip)技术的CDNP原型系统设计。该原型系统使用片上高速通信网络连接4个PE和多个功能模块,可以对CDNP的功能以及令牌处理路径调度等关键技术和算法进行验证。本文的工作对网络处理器的设计具有重要的指导意义。
【Abstract】 To meet the tremendous increasing of the network bandwidth, Network Processors (NPs) design should fully consider the matching with features of packet processing. The dataflow-based NP architecture can take advantage of flow-based characteristic of network applications.Aiming at the limitation of ILP exploitation and the fixed topology of control-flow NP, this dissertation proposes a new scheme of Coarse-grain Dataflow NP architecture (CDNP), by introducing the idea of coarse-grain dataflow design method. Several key techniques of CDNP are also investigated in-depth in this dissertation. As follows, the main work and contributions of the dissertation are:(1) Aiming at the problems of control-flow NP and the shortage in programmable ability of fine-grain synchronous dataflow NP architecture, a new scheme of CDNP architecture is proposed. Based on the data-flow model, CDNP not only improves the programmable ability of the entire NP by introducing the idea of control-flow structure into the design of Processing Element (PE), but also effectively exploits the task-level parallelism by making full use of the advantage of ILP exploitation in data-flow model. So it can get relatively high performance and flexibility in packet processing.(2) Aiming at the data-flow feature in packet processing of CDNP, the key techniques of PE design are researched. Firstly, implementation scheme of uCore ISA chosen and basic logic function are proposed. Secondly, mechanism of token processing in PE is researched, which demands the token processing module should implement the basic functions of token receiving, buffering, transition, encapsulation and sending, and match the characteristic of data-flow driven in token processing. Finally, aiming at management in frame buffer, the idea of hardware linklist is brought forward so as to provide good support in packet ordering of the same workload in the design of frame buffer.(3) Based on the mechanism of soft configuration for token processing path, a dynamic token processing path scheduling algorithm (DTPPS) is proposed. The algorithm monitors workload on each PE in CDNP. When the workloads among PEs become unbalanced, the algorithm prefers to adapt the token processing path of the workload and remap the task of heavy-loaded PE. The simulation shows that this algorithm can well balance the load of each PE and improve the overall throughput of CDNP effectively.Furthermore, the design of CDNP prototype system based on SoPC (System on Programmable Chip) is introduced. Four PEs and several functional modules are connected by communication network on the chip. The basic function of CDNP and some key techniques such as DTPPS can be analyzed and evaluated in depth in the prototype. The work in this dissertation can serve as an important guideline for the design of NPs.
- 【网络出版投稿人】 国防科学技术大学 【网络出版年期】2008年 07期
- 【分类号】TP393.05
- 【被引频次】1
- 【下载频次】75