Solution to PDEs using radial basis function finite-differences (RBF-FD) on multiple GPUs

Evan F. Bollig; Natasha Flyer; Gordon Erlebacher

doi:10.1016/j.jcp.2012.06.030

Solution to PDEs using radial basis function finite-differences (RBF-FD) on multiple GPUs

Evan F. Bollig, Natasha Flyer, Gordon Erlebacher

Research output: Contribution to journal › Article › peer-review

73 Scopus citations

Abstract

This paper presents parallelization strategies for the radial basis function-finite difference (RBF-FD) method. As a generalized finite differencing scheme, the RBF-FD method functions without the need for underlying meshes to structure nodes. It offers high-order accuracy approximation and scales as O(N) per time step, with N being with the total number of nodes. To our knowledge, this is the first implementation of the RBF-FD method to leverage GPU accelerators for the solution of PDEs. Additionally, this implementation is the first to span both multiple CPUs and multiple GPUs. OpenCL kernels target the GPUs and inter-processor communication and synchronization is managed by the Message Passing Interface (MPI). We verify our implementation of the RBF-FD method with two hyperbolic PDEs on the sphere, and demonstrate up to 9x speedup on a commodity GPU with unoptimized kernel implementations. On a high performance cluster, the method achieves up to 7x speedup for the maximum problem size of 27,556 nodes.

Original language	English (US)
Pages (from-to)	7133-7151
Number of pages	19
Journal	Journal of Computational Physics
Volume	231
Issue number	21
DOIs	https://doi.org/10.1016/j.jcp.2012.06.030
State	Published - Aug 30 2012
Externally published	Yes

Bibliographical note

Funding Information:
This work is supported by NSF awards DMS-#0934331 (FSU), DMS-#0934317 (NCAR) and ATM-#0602100 (NCAR).

Keywords

High-order finite differencing
Multi-GPU computing
OpenCL
Parallel computing
RBF-FD
Radial basis functions

Access

10.1016/j.jcp.2012.06.030

OpenUrl availability

Full text

Cite this

@article{7c8f58bacc2f40598ed023902c50d8c8,

title = "Solution to PDEs using radial basis function finite-differences (RBF-FD) on multiple GPUs",

abstract = "This paper presents parallelization strategies for the radial basis function-finite difference (RBF-FD) method. As a generalized finite differencing scheme, the RBF-FD method functions without the need for underlying meshes to structure nodes. It offers high-order accuracy approximation and scales as O(N) per time step, with N being with the total number of nodes. To our knowledge, this is the first implementation of the RBF-FD method to leverage GPU accelerators for the solution of PDEs. Additionally, this implementation is the first to span both multiple CPUs and multiple GPUs. OpenCL kernels target the GPUs and inter-processor communication and synchronization is managed by the Message Passing Interface (MPI). We verify our implementation of the RBF-FD method with two hyperbolic PDEs on the sphere, and demonstrate up to 9x speedup on a commodity GPU with unoptimized kernel implementations. On a high performance cluster, the method achieves up to 7x speedup for the maximum problem size of 27,556 nodes.",

keywords = "High-order finite differencing, Multi-GPU computing, OpenCL, Parallel computing, RBF-FD, Radial basis functions",

author = "Bollig, {Evan F.} and Natasha Flyer and Gordon Erlebacher",

note = "Funding Information: This work is supported by NSF awards DMS-#0934331 (FSU), DMS-#0934317 (NCAR) and ATM-#0602100 (NCAR). ",

year = "2012",

month = aug,

day = "30",

doi = "10.1016/j.jcp.2012.06.030",

language = "English (US)",

volume = "231",

pages = "7133--7151",

journal = "Journal of Computational Physics",

issn = "0021-9991",

publisher = "Academic Press Inc.",

number = "21",

}

TY - JOUR

T1 - Solution to PDEs using radial basis function finite-differences (RBF-FD) on multiple GPUs

AU - Bollig, Evan F.

AU - Flyer, Natasha

AU - Erlebacher, Gordon

N1 - Funding Information: This work is supported by NSF awards DMS-#0934331 (FSU), DMS-#0934317 (NCAR) and ATM-#0602100 (NCAR).

PY - 2012/8/30

Y1 - 2012/8/30

N2 - This paper presents parallelization strategies for the radial basis function-finite difference (RBF-FD) method. As a generalized finite differencing scheme, the RBF-FD method functions without the need for underlying meshes to structure nodes. It offers high-order accuracy approximation and scales as O(N) per time step, with N being with the total number of nodes. To our knowledge, this is the first implementation of the RBF-FD method to leverage GPU accelerators for the solution of PDEs. Additionally, this implementation is the first to span both multiple CPUs and multiple GPUs. OpenCL kernels target the GPUs and inter-processor communication and synchronization is managed by the Message Passing Interface (MPI). We verify our implementation of the RBF-FD method with two hyperbolic PDEs on the sphere, and demonstrate up to 9x speedup on a commodity GPU with unoptimized kernel implementations. On a high performance cluster, the method achieves up to 7x speedup for the maximum problem size of 27,556 nodes.

AB - This paper presents parallelization strategies for the radial basis function-finite difference (RBF-FD) method. As a generalized finite differencing scheme, the RBF-FD method functions without the need for underlying meshes to structure nodes. It offers high-order accuracy approximation and scales as O(N) per time step, with N being with the total number of nodes. To our knowledge, this is the first implementation of the RBF-FD method to leverage GPU accelerators for the solution of PDEs. Additionally, this implementation is the first to span both multiple CPUs and multiple GPUs. OpenCL kernels target the GPUs and inter-processor communication and synchronization is managed by the Message Passing Interface (MPI). We verify our implementation of the RBF-FD method with two hyperbolic PDEs on the sphere, and demonstrate up to 9x speedup on a commodity GPU with unoptimized kernel implementations. On a high performance cluster, the method achieves up to 7x speedup for the maximum problem size of 27,556 nodes.

KW - High-order finite differencing

KW - Multi-GPU computing

KW - OpenCL

KW - Parallel computing

KW - RBF-FD

KW - Radial basis functions

UR - http://www.scopus.com/inward/record.url?scp=84865400698&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84865400698&partnerID=8YFLogxK

U2 - 10.1016/j.jcp.2012.06.030

DO - 10.1016/j.jcp.2012.06.030

M3 - Article

AN - SCOPUS:84865400698

SN - 0021-9991

VL - 231

SP - 7133

EP - 7151

JO - Journal of Computational Physics

JF - Journal of Computational Physics

IS - 21

ER -

Solution to PDEs using radial basis function finite-differences (RBF-FD) on multiple GPUs

Abstract

Bibliographical note

Keywords

Access

OpenUrl availability

Other files and links

Fingerprint

Cite this