Automatically Optimized GPU Acceleration of Element Subroutines in Finite Element Method
Authors | |
---|---|
Year of publication | 2012 |
Type | Article in Proceedings |
Conference | Symposium on Application Accelerators in High Performance Computing |
MU Faculty or unit | |
Citation | |
Field | Informatics |
Keywords | GPGPU; code optimization; kernel fusion; FEM |
Description | The element subroutines in finite element method (FEM) provides enough parallelism to be successfully accelerated by contemporary GPUs. However, their efficient implementation is not straightforward and requires time-consuming exploration of numerous implementation variants. In this paper, we present optimization by kernel fusion for element subroutines. Moreover, we show how the optimization is automated using our source-to-source compiler. We demonstrate the optimization of the element subroutines for FEM model using St.\,Venant-Kirchhoff material. The performance of code generated by our compiler outperforms our previously published hand-tuned implementation by factor of 1.32 -- 1.54 depending on used GPU architecture. Although the optimization technique is demonstrated on element subroutines for using St.\,Venant-Kirchhoff material, it is generally usable for wider area of computationally-demanding problems. |
Related projects: |