节点文献

GPU通用计算与基于SIFT特征的图像匹配并行算法研究

GPGPU and Image Matching Parallel Algorithm Based on SIFT

【作者】 年华

【导师】 刘西洋;

【作者基本信息】 西安电子科技大学 , 计算机软件与理论, 2010, 硕士

【摘要】 目前,将GPU用于通用计算一直是国内外研究的热点,早期的GPGPU (GPU for general purpose)开发直接使用图形学API编程。这种开发方式难度大,成本高。NVIDIA公司推出的CUDA(Compute Unified Device Architecture,统一计算设备架构)是专门用于GPU通用计算的平台,其简单的编程风格,高效的多线程并行处理模式使得人们在面对计算密集型任务时能够更好地利用到GPU庞大的并行计算资源。另一方面,在图像匹配领域,SIFT算法作为一种基于特征点的匹配算法,能够很好的处理两幅图像之间发生平移、旋转、仿射变换情况下的匹配问题。较强的匹配能力和良好的健壮性使得SIFT算法在图像匹配领域有着广泛的应用。本文深入地分析了CUDA多线程编程模型的硬件架构和软件体系,详细描述了CUDA程序任务的划分,性能的评估以及CUDA程序的优化策略,并仔细分析对比了CUDA GT200架构和新一代架构Fermi的特点,指出了Fermi架构在设计结构上的改进和性能上的优势。在本文的实现部分,详细描述了基于SIFT特征的图像匹配算法在CUDA平台的并行设计与实现,并与CPU上的实现做了对比。实验结果表明,本文所实现的基于CUDA平台的SIFT算法相对于CPU上的实现在性能上能够取得很好的加速效果。

【Abstract】 Currently, using GPU for general-purpose computing has been a hot research topic of the world. The early GPGPU programming is used in graphics API development. The disadvantage of this development method is very difficult and costly. CUDA(Compute Unified Device Architecture) is a tool introduced by NVIDIA which is designed for GPGPU. Its simple programming style and efficient multi-threaded processing model make it has more efficiency in using of GPU hardware resources when processing computing-intensive tasks.On the other side, as a feature point based matching algorithm, SIFT can process the matching problem between two images with translation, rotation and affine transformation. Strong matching ability and good robustness make the SIFT algorithm has a wide application in image match area.In this paper, the hardware architecture and software systems of CUDA are deeply analyzed first; Secondly, tasks dividing, performance and optimization strategy of CUDA program are described; Then GT200 architecture and the next-generation architecture Fermi are compared, and improvements and advantages of Fermi are pointing out.Finally, the implementation of SIFT algorithm on CUDA are described. Compared with its implementation on CPU, implementation on CUDA is able to achieve a good speedup.

  • 【分类号】TP391.41
  • 【被引频次】10
  • 【下载频次】511
节点文献中: