Automatically Optimized GPU Acceleration of Element Subroutines in Finite Element Method
| Authors | |
|---|---|
| Year of publication | 2012 |
| Type | Article in Proceedings |
| Conference | Symposium on Application Accelerators in High Performance Computing |
| MU Faculty or unit | |
| Citation | |
| Field | Informatics |
| Keywords | GPGPU; code optimization; kernel fusion; FEM |
| Description | The element subroutines in finite element method (FEM) provides enough parallelism to be successfully accelerated by contemporary GPUs. However, their efficient implementation is not straightforward and requires time-consuming exploration of numerous implementation variants. In this paper, we present optimization by kernel fusion for element subroutines. Moreover, we show how the optimization is automated using our source-to-source compiler. We demonstrate the optimization of the element subroutines for FEM model using St.\,Venant-Kirchhoff material. The performance of code generated by our compiler outperforms our previously published hand-tuned implementation by factor of 1.32 -- 1.54 depending on used GPU architecture. Although the optimization technique is demonstrated on element subroutines for using St.\,Venant-Kirchhoff material, it is generally usable for wider area of computationally-demanding problems. |
| Related projects: |