Department of Computer Science
Implementation of a lattice boltzmann method for large eddy simulation on multiple gpus
Recently, the Graphic Processor Unit (GPU) has evolved into a highly parallel, multithreaded, many-core processor with tremendous computational horsepower and very high memory bandwidth. To improve the simulation efficiency of complex flow phenomena in the field of computational fluid dynamics, a CUDA-based simulation algorithm of large eddy simulation using multiple GPUs is proposed. Our implementation adopted the "collision after propagation" scheme and performed the propagation process by global memory reading transactions. The working set is split up into equal sub-domains and assigned to each GPU for simplicity. Using recently released hardware, up to four GPUs can be controlled by a single CPU thread and run in parallel. The results show that our multi-GPU implementation could perform simulations on a rather large scale (meshes: 10240x10240) even using double-precision floating point calculation and achieved 190X speedup over the sequential implementation on CPU. © 2012 IEEE.
Large eddy simulation, Lattice Boltzmann method, Multi-GPU Computing, Parellel computing
Source Publication Title
Proceedings of the The 14th IEEE International Conference on High Performance Computing and Communications/ The 9th IEEE International Conference on Embedded Software and Systems
Liverpool, United Kingdom
Link to Publisher's Edition
Li, Qinjian, Chengwen Zhong, Kai Li, Guangyong Zhang, Xiaowei Lu, Qing Zhang, Kaiyong Zhao, and Xiaowen Chu. "Implementation of a lattice boltzmann method for large eddy simulation on multiple gpus." Proceedings of the The 14th IEEE International Conference on High Performance Computing and Communications/ The 9th IEEE International Conference on Embedded Software and Systems (2012): 818-823.