èŠ‚ç‚¹æ–‡çŒ®

ç½‘ç»œå¤„ç†å™¨è®¾è®¡çš„è‹¥å¹²å…³é”®æŠ€æœ¯ç ”ç©¶

Research on Some Key Techniques in the Design of Network Processors

åˆ†é¡µä¸‹è½½
åˆ†ç« ä¸‹è½½
æ•´æœ¬ä¸‹è½½
åœ¨çº¿é˜…è¯»
ä¸æ”¯æŒè¿…é›·ç‰ä¸‹è½½å·¥å…·ï¼Œè¯·å–æ¶ˆåŠ é€Ÿå·¥å…·åŽä¸‹è½½ã€‚

ã€ä½œè€…ã€‘ å¼ æ™“æ˜Žï¼›

ã€ä½œè€…åŸºæœ¬ä¿¡æ¯ã€‘ å›½é˜²ç§‘å¦æŠ€æœ¯å¤§å¦ ï¼Œ è®¡ç®—æœºç§‘å¦ä¸ŽæŠ€æœ¯ï¼Œ 2006ï¼Œ åšå£«

ã€æ‘˜è¦ã€‘ ä¸ºæ”¯æŒä¸æ–å¢žé•¿çš„ç½‘ç»œåº”ç”¨ä¸šåŠ¡,ç½‘ç»œè®¾å¤‡è¶Šæ¥è¶Šå‘ˆçŽ°å‡ºæ™ºèƒ½åŒ–å¤„ç†ç‰¹ç‚¹ã€‚æ™ºèƒ½åŒ–å¤„ç†ä¸ä»…è¦æ±‚ç½‘ç»œè®¾å¤‡å…·æœ‰å¤šå±‚äº¤æ¢ã€å®‰å…¨å¤„ç†å’Œæµé‡ç®¡ç†ç‰åŠŸèƒ½,è¿˜å¿…é¡»å…·æœ‰å¼ºå¤§çš„åè®®å¤„ç†èƒ½åŠ›å’Œçµæ´»çš„å¯ç¼–ç¨‹æ€§,èƒ½å¤Ÿå¿«é€Ÿé€‚åº”æ–°åž‹ä¸šåŠ¡çš„æ·»åŠ å’Œé…ç½®ã€‚å› æ¤,åŸºäºŽASIP(Application Specific Instruction Processor)æŠ€æœ¯çš„ç½‘ç»œå¤„ç†å™¨åœ¨ç½‘ç»œè®¾å¤‡ä¸å¾—åˆ°äº†å¹¿æ³›çš„åº”ç”¨,æˆä¸ºæŽ¨åŠ¨ä¸‹ä¸€ä»£äº’è”ç½‘å‘å±•çš„æ ¸å¿ƒå™¨ä»¶ã€‚æœ¬æ–‡ä»¥ç½‘ç»œå¤„ç†å™¨ç³»ç»Ÿè®¾è®¡ä¸Žå®žçŽ°ä¸ºç›®æ ‡,ä»Žç³»ç»Ÿè®¾è®¡çš„è§’åº¦ç ”ç©¶ç½‘ç»œå¤„ç†å™¨çš„æ—©æœŸè®¾è®¡å’Œæ€§èƒ½è¯„ä»·æ–¹æ³•,å¹¶æ·±å…¥ç ”ç©¶äº†ç½‘ç»œå¤„ç†å™¨ä¸è‹¥å¹²å…³é”®çš„å®žçŽ°æŠ€æœ¯ã€‚ä¸»è¦åˆ›æ–°ç‚¹åŒ…æ‹¬:(1)é’ˆå¯¹ç½‘ç»œå¤„ç†å™¨è®¾è®¡è¯„ä»·å’Œæ–¹æ¡ˆä¼˜é€‰å›°éš¾çš„é—®é¢˜,ç»“åˆMPSoC(Multi-ProcessorSystem on Chip)ç³»ç»Ÿè®¾è®¡å’Œç½‘ç»œæŠ¥æ–‡å¤„ç†ç‰¹ç‚¹,æå‡ºäº†ç½‘ç»œå¤„ç†å™¨è®¾è®¡ç©ºé—´å¼€å‘æ¡†æž¶YH-NPDF(YinHe Network Processor Design Framework)ã€‚è¯¥æ¡†æž¶åŸºäºŽå¹³å°åŒ–è®¾è®¡æ€æƒ³,é‡‡ç”¨ååº”å¼æ•°æ®æµè¿›ç¨‹ç½‘ç»œRDPN(Reactive Dataflow Process Network)æè¿°ç½‘ç»œåº”ç”¨,å¹¶ä¸Žå‚æ•°åŒ–ç¡¬ä»¶ä½“ç³»ç»“æž„æ¨¡åž‹ç›¸ç»“åˆè¯„ä»·ç½‘ç»œå¤„ç†å™¨çš„å¤„ç†æ€§èƒ½,é€šè¿‡æ•´ä½“é€€ç«é—ä¼ ç®—æ³•å¿«é€Ÿæœç´¢è®¾è®¡ç©ºé—´å’Œä¼˜é€‰ç³»ç»Ÿè®¾è®¡æ–¹æ¡ˆã€‚YH-NPDFåœ¨ç½‘ç»œåº”ç”¨å»ºæ¨¡ã€ç¡¬ä»¶èµ„æºæ¨¡åž‹å’Œè®¾è®¡æ–¹æ¡ˆä¼˜é€‰ç‰æ–¹é¢èƒ½å¤Ÿè¾ƒå¥½åœ°é€‚åº”ç½‘ç»œå¤„ç†å™¨è®¾è®¡å’Œå¼€å‘ä¸çš„æ™ºèƒ½åŒ–æŠ¥æ–‡å¤„ç†éœ€æ±‚ã€‚(2)é’ˆå¯¹å¹¶è¡Œç»“æž„çš„ç½‘ç»œå¤„ç†å™¨,æå‡ºåŸºäºŽæ¨¡ç³Šåé¦ˆæŽ§åˆ¶çŽ¯(F2CL,Fuzzy FeedbackControl Loop)çš„æŠ¥æ–‡å¹¶è¡Œè°ƒåº¦ç®—æ³•ã€‚è¯¥ç®—æ³•ä½¿ç”¨F2CLæœºåˆ¶æ”¹å–„ç³»ç»Ÿè´Ÿè½½å‡è¡¡çŠ¶å†µ;é‡‡ç”¨æµcacheç¼“å˜æŠ¥æ–‡æµçš„è°ƒåº¦ä¿¡æ¯,åœ¨è´Ÿè½½ä¸å‡è¡¡æ—¶ä¼˜é€‰è°ƒèŠ‚é‡è´Ÿè½½æµ,åœ¨æµè¶…æ—¶æƒ…å†µä¸‹å…è®¸å¯¹åŒä¸€æµå†…çš„åŽç»§æŠ¥æ–‡å®žæ–½é‡æ˜ å°„,ä»Žè€Œæœ‰æ•ˆæŽ§åˆ¶æŠ¥æ–‡ä¹±åºã€‚å®žéªŒç»“æžœè¡¨æ˜Ž,è¯¥ç®—æ³•èƒ½å¤Ÿåœ¨ä¿æŒè´Ÿè½½å‡è¡¡çš„åŒæ—¶èŽ·å¾—è¾ƒå¥½çš„æŠ¥æ–‡ä¿åºæ•ˆæžœ,ç»¼åˆæ€§èƒ½ä¼˜äºŽç›®å‰å·²æœ‰çš„åŒç±»ç®—æ³•ã€‚(3)é’ˆå¯¹ç½‘ç»œå¤„ç†å™¨ä¸æŠ¥æ–‡ç¼“å†²çš„ç‰¹ç‚¹,æå‡ºåŸºäºŽæµæ°´è¾“å…¥/å¹¶è¡Œè¾“å‡º(PIPO,Pipelining Input and Parallel Output)çš„å¤šé€šé“æŠ¥æ–‡ç¼“å†²ç»“æž„ã€‚PIPOé‡‡ç”¨æµæ°´è¾“å…¥è°ƒåº¦æŠ€æœ¯å¤„ç†è¾“å…¥ç«¯çš„å†™è¯·æ±‚åºåˆ—,é‡‡ç”¨å¹¶è¡Œè¾“å‡ºè°ƒåº¦æŠ€æœ¯è°ƒåº¦è¾“å‡ºç«¯çš„è¯»è¯·æ±‚åºåˆ—,åŒæ—¶é‡‡ç”¨è®¿é—®ç–ç•¥ä¼˜åŒ–è¾“å…¥è¾“å‡ºç«¯å£çš„å˜å‚¨è®¿é—®æ•ˆçŽ‡ã€‚ä¸Žä¼ ç»Ÿçš„FCFSè°ƒåº¦æ–¹æ³•ç›¸æ¯”,PIPOå…·æœ‰æ›´é«˜çš„å¸¦å®½åˆ©ç”¨çŽ‡å’Œæ›´ä½Žçš„è¾“å…¥è¾“å‡ºç«¯å£çž¬æ—¶å¸¦å®½æŠ–åŠ¨ã€‚è®ºæ–‡è¿˜ä»‹ç»äº†åœ¨Altera FPGAä¸ŠåŸºäºŽSopC(System on Programmable Chip)çš„ç½‘ç»œå¤„ç†å™¨åŽŸåž‹å®žçŽ°ã€‚è¯¥åŽŸåž‹åŒ…å«4ä¸ªå¾®å¤„ç†å™¨æ ¸,é€šè¿‡è½¯ä»¶æŽ§åˆ¶å’Œåå¤„ç†å™¨åŠ é€Ÿå¯æ”¯æŒ4ä¸ªåƒå…†ä»¥å¤ªç½‘æŽ¥å£ã€‚åˆ©ç”¨è¯¥åŽŸåž‹,è®ºæ–‡å¯¹å¹¶è¡Œå¤„ç†ç»“æž„ä¸çš„æŒ‡ä»¤é›†æ‰©å……å’Œåå¤„ç†å™¨å…±äº«æœºåˆ¶è¿›è¡Œäº†æ·±å…¥åˆ†æžå’ŒæŽ¢è®¨ã€‚åŒæ—¶å¯¹æ–‡ä¸æå‡ºçš„F2CLè°ƒåº¦ç®—æ³•ç‰å…³é”®æŠ€æœ¯è¿›è¡Œäº†éªŒè¯ã€‚æœ¬æ–‡çš„å·¥ä½œå¯¹ç½‘ç»œå¤„ç†å™¨çš„è®¾è®¡å…·æœ‰é‡è¦çš„æŒ‡å¯¼æ„ä¹‰ã€‚æ›´å¤š è¿˜åŽŸ

ã€Abstractã€‘ With the development of network applications, network devices need more intelligent processing capability. This requires network devices to have various functions (e.g. multi-layer switching, security processing and traffic management) as well as powerful protocol processing capability and programmability, so that the novel network services can be quickly deployed and configured in these network devices. Thus network processors (NPs) which are based on the technology of Application Specific Instruction Processor (ASIP) emerge timely and are widely used in network domains to meet these requirements. NPs have already become one of the core devices in the next-generation Internet.This dissertation focuses on the issues of system design and implementation of NPs. The early design method and performance evaluation of NPs are presented on the standpoint of system design, and several key implementation technologies of NPs are investigated in-depth in this dissertation. The main contributions of the dissertation are as follows:(1) Aiming at the optimal decision and performance evaluation of system design in NPs, the YinHe Network Processor Design Framework (YH-NPDF) is constructed according to the characterization of Multi-Processor System on Chip (MPSoC) design and requirements of network packet processing. The YH-NPDF is based on the idea of platform-based design. It adopts the Reactive Dataflow Process Network (RDPN) model to describe network applications and establishes the parameterized model of NPsâ€™ hardware resources, where application model is mapped into the parameterized architecture model of NPs to evaluate the NP performance. The global annealing genetic algorithm is used to accelerate the search of design space and to optimize the design decision of NP system. The YH-NPDF can be used to model network applications and hardware resources and support optimal decision to meet the requirements for intelligent packet processing in early system design of NPs.(2) In network processors based on parallel processing elements (PEs), a packet parallel scheduling algorithm based on Fuzzy Feedback Control Loop (F2CL) is proposed. This algorithm uses F2CL schemes to improve the degree of load balancing among multiple processing elements, and also deploys a flow cache to buffer the scheduling information of packet streams. The packet reordering is effectively controlled by using the following two methods: when the workloads among PEs become unbalanced, the algorithm prefers to adapt the heavy-loaded flows; the successive packets belonging to the same flow can be remapped to another PE in case of flow timeout. The simulation results show that this algorithm with the well-chosen design parameters can gain preferable effects on packet ordering while preserving load balancing, and has better overall performance on load balancing and packet ordering when compared with other algorithms.(3) Based on the characteristics of the packet buffer memory in NPs, a multi-channel packet buffer memory system with the scheme of Pipelining Input and Parallel Output (PIPO) is proposed. PIPO schedules the write-required sequence with pipelining on the input and processes the read-required sequence in parallel on the output. Both actions in PIPO use memory access policy to improve the effectiveness of memory access. The effectiveness of PIPO, adaptive capacity of variable packet length and extensibility of buffer bandwidth are evaluated by theoretical analysis and simulation experiments with extrapolated workloads. Compared with traditional memory scheduling schemes of packet buffering such as FCFS, PIPO gains better effectiveness of memory access and higher utility ratio of buffer bandwidth, meanwhile incurs less jitters of instantaneous bandwidth on both inputs and outputs.Furthermore, the prototype system of network processor based on SoPC (System on Programmable Chip) is implemented on Altera FPGA. Four soft processor cores (i.e. Altera Nios II) are embedded into the prototype chip which can support four 1000Mbps Ethernet interfaces through co-processor acceleration under software control. Instruction set extension and co-processor sharing schemes for parallel processing architecture of NPs are analyzed and evaluated in depth in the prototype. Meanwhile, the F2CL-based packet scheduling algorithm is verified. The work in this dissertation can serve as an important guideline for the design of NPs.æ›´å¤š è¿˜åŽŸ

ã€å…³é”®è¯ã€‘ ç½‘ç»œå¤„ç†å™¨ï¼› è®¾è®¡ç©ºé—´å¼€å‘ï¼› æŠ¥æ–‡å¹¶è¡Œè°ƒåº¦ï¼› æŠ¥æ–‡ç¼“å†²ï¼› åå¤„ç†å™¨ï¼›
ã€Key wordsã€‘ Network processorï¼› Design space developmentï¼› Packet parallel schedulingï¼› Packet bufferingï¼› Co-Processorï¼›

ã€ç½‘ç»œå‡ºç‰ˆæŠ•ç¨¿äººã€‘ å›½é˜²ç§‘å¦æŠ€æœ¯å¤§å¦

ã€åˆ†ç±»å·ã€‘TP332
ã€è¢«å¼•é¢‘æ¬¡ã€‘13
ã€ä¸‹è½½é¢‘æ¬¡ã€‘471
æ”»è¯»æœŸæˆæžœ

çŸ¥ç½‘èŠ‚ä¸‹è½½

èŠ‚ç‚¹æ–‡çŒ®ä¸ï¼š

æœ¬æ–‡é“¾æŽ¥çš„æ–‡çŒ®ç½‘ç»œå›¾ç¤º:

æœ¬æ–‡çš„å¼•æ–‡ç½‘ç»œ

èŠ‚ç‚¹æ–‡çŒ®

èŠ‚ç‚¹æ–‡çŒ®

ç½‘ç»œå¤„ç†å™¨è®¾è®¡çš„è‹¥å¹²å…³é”®æŠ€æœ¯ç ”ç©¶

Research on Some Key Techniques in the Design of Network Processors

æœ¬æ–‡é“¾æŽ¥çš„æ–‡çŒ®ç½‘ç»œå›¾ç¤º:

ç½‘ç»œå¤„ç†å™¨è®¾è®¡çš„è‹¥å¹²å…³é”®æŠ€æœ¯ç ”ç©¶