节点文献

面向图像处理的可配置处理器设计与实现

Design and Implement a Configurable Processor for Image Processing

【作者】 谭星亮

【导师】 吴顺华;

【作者基本信息】 天津大学 , 微电子学与固体电子学, 2010, 硕士

【摘要】 可配置处理器可以针对具体应用做出配置,得到不同运算性能的硬件电路,并且具有可编程性,在SoC设计中,用可配置处理器完成数据密集型的运算任务时,比通用微处理器具有更强的计算能力,比ASIC(Application Specific Integrated Circuit)具有更大的灵活性,可加快开发周期。本论文设计了一个面向图像处理的可配置处理器模板——T*Core,该处理器是根据传输触发架构(TTA,Transport-Triggered Architecture)设计。使用时,根据特定图像处理的应用,配置相关参数就可生成一款具体的T*Core处理器硬件电路。本论文对图像处理中几种基本的算法进行了分析,包括图像增强常用的卷积滤波和中值滤波算法,图像压缩常用的离散余弦变换,以及图像的缩放算法,并总结出这些算法的特点,为T*Core处理器功能单元的定制提供依据。本论文对T*Core处理器的设计进行了详细说明,包括T*Core内部结构、指令格式与流水线、数据通路的构成、各个功能单元的设计、立即数存放机制,以及T*Core内部资源的编址等。功能单元作为T*Core处理中的核心运算组件,直接影响着处理器计算性能,其结构根据图像处理算法的特点来设计,如带排序功能的加减法功能单元、浮点乘累加功能单元、带二维寻址功能的存储器访问功能单元以及无延时跳转控制功能单元,都将提升图像处理程序运行的速度。硬件验证时,本论文采用C*Core C310作为主处理器,T*Core作为从处理器搭建了一个硬件SoC验证平台,系统工作主频为30MHz,图像处理的结果最终显示在QVGA屏幕上,结果说明T*Core能正确地完成图像处理的功能。并将T*Core可配置处理器与通用型微处理器C*Core C310和ARM926EJ进行速度对比,结果说明在完成同等计算量的任务时,T*Core执行速度比通用微处理器要高出很多。

【Abstract】 Configurable processor is a special processor, which will be implemented in hardware circuit by the configuration according to the application. The computing performance of configurable processor is better than general-purpose processor. And configurable processor is more flexible than ASIC (Application Specific Integrated Circuit) because of its programmability. In this thesis, a template of configurable processor named T*Core is designed for image processing. T*Core processor is designed based on Transport Triggered Architecture. A specific processor will be implemented by the configuration according to the application of image processing.This thesis analyzes several algorithms of image processing, including image convolution filtering, median filtering, discrete cosine transform (DCT), and image scaling. Details of design of T*Core processor are illustrated in this thesis, including the architecture of the processor, instruction format and pipeline, data path, function units (FU), immediate data, and addressing of internal resources. Function units are the most important computing components of T*Core processor, which influence the performance of the processor a lot. The circuit construction of function unit is customized according to the algorithms of image processing. Several function units are customized in this thesis, such as addition and subtraction FU with sorting function, multiply-accumulate FU of floating point operation, load-store FU with two dimension accessing of data memory, jump control FU without delay. When image processing programs are running, the speed will be increased by using the customized function units.In the hardware verification, a SoC platform is built, which works under the frequency of 30MHz. In this SoC platform, C*Core C310 is used as the main processor and T*Core as the co-processor. The image after processing by T*Core is displayed on the QVGA screen, and the result shows that T*Core can execute the image processing program correctly. The same image processing programs are also run at general-purpose microprocessors C*Core C310 and ARM926EJ, and the result shows that T*Core is faster than general-purpose microprocessors when executing the same computing program.

【关键词】 可配置处理器图像处理传输触发架构SoC
【Key words】 Configurable ProcessorImage ProcessingTTASoC
  • 【网络出版投稿人】 天津大学
  • 【网络出版年期】2012年 02期
  • 【分类号】TN47
  • 【被引频次】1
  • 【下载频次】39
节点文献中: