节点文献

基于多GPU的三维Kirchhoff积分法体偏移

3D Kirchhoff integral prestack migration based on GPUs

推荐 CAJ下载
PDF下载
不支持迅雷等下载工具，请取消加速工具后下载。

【Author】 Liu Weifeng1,2 Zhao Gaishan3 Kong Xiangning3 Cai Jiexiong3 Zhang Bing3(1 Sinopec Exploration and Production Research Institute,Beijing 100083,China; 2 Sinopec Key Laboratory of Multi-Components Seismic Technology,Beijing 100083,China; 3 Sinopec Geophysics Research Institute,Nanjing 210014,China)

【机构】中国石油化工股份有限公司石油勘探开发研究院；中国石油化工集团公司多波地震技术重点实验室；中国石油化工股份有限公司石油物探技术研究院；

【摘要】提出3种策略挖掘三维Kirchhoff积分法体偏移在众核GPU(图形处理器)上的并行性.首先,使用数据传输线程和GPU计算线程构造流水线并行框架,基于此框架直接实现异步输入输出(I/O)以减少GPU和网络存储之间数据传输所需的时间;其次,使用GPU的线程满载策略以使指令吞吐量最大化;最后,应用纹理缓存和常量缓存来减少片外存储器访问,并使用固定功能单元计算超越函数.实验结果表明:相比于IntelXeon E5430CPU上的算法串行版本,在nVidia Tesla C1060GPU上的优化算法实现了约20倍的加速比.比较了算法在3种不同GPU架构上的性能,并给出了CPU与GPU结果在0.5×10-4误差限下仅0.3×10-5的浮点数绝对误差.更多还原

【Abstract】 Three approaches were proposed to expose parallelism of 3D Kirchhoff integral prestack migration on many-core GPUs(graphic processing units).First,pipeline parallel framework was constructed using two separated host threads: data transfer thread and GPU compute context thread.From the pipeline parallel framework,asynchronous input/output(I/O) was directly realized to minimize the time taken of data transfer between GPUs and network attached storages.Second,GPU threads full-loaded arrangement was used to achieve maximum instruction throughput.Third,texture cache and constant cache was applied to minimize off-chip memory accessing,and fixed function units was used to calculate transcendental functions.The experimental results show that our optimized algorithm implementation on nVidia Tesla C1060 GPU achieves about 20 times speedup compare to its sequential version on Intel Xeon E5430 CPU.Finally,a comparison of our algorithm performance on 3 different GPU architectures was described,and an analysis of only 0.3×10-5 floating point number absolute error between CPU and our GPU results under 0.5×10-4 error threshold was demonstrated.更多还原

【关键词】并行计算；图形处理器； Kirchhoff积分法体偏移；流水线并行；异步输入输出；计算统一设备架构；
【Key words】 parallel computing； graphics processing unit； Kirchhoff integral prestack migration； pipeline parallel； asynchronous input/output； compute unified device architecture(CUDA)；

【基金】国家高技术研究发展计划资助项目(2009AA01A140);中国石化集团科技开发项目

【文献出处】华中科技大学学报(自然科学版) ,Journal of Huazhong University of Science and Technology(Natural Science Edition) , 编辑部邮箱 ,2011年S1期

【分类号】TP391.41
【网络出版时间】2011-06-17 12:45
【被引频次】2
【下载频次】123

知网节下载

节点文献中：

本文链接的文献网络图示:

本文的引文网络

节点文献