Withdraw
Loading…
Polynomial reduction with full domain decomposition preconditioner for spectral element poisson solvers
Bello-Maldonado, Pedro D.
Loading…
Permalink
https://hdl.handle.net/2142/117701
Description
- Title
- Polynomial reduction with full domain decomposition preconditioner for spectral element poisson solvers
- Author(s)
- Bello-Maldonado, Pedro D.
- Issue Date
- 2022-08-04
- Director of Research (if dissertation) or Advisor (if thesis)
- Fischer, Paul F
- Doctoral Committee Chair(s)
- Fischer, Paul F
- Committee Member(s)
- Olson, Luke N
- Kloeckner, Andreas
- Kolev, Tzanio
- Department of Study
- Computer Science
- Discipline
- Computer Science
- Degree Granting Institution
- University of Illinois at Urbana-Champaign
- Degree Name
- Ph.D.
- Degree Level
- Dissertation
- Keyword(s)
- Poisson
- scientific computing
- numerical analysis
- finite elements
- spectral element method
- high performance computing
- GPU
- Abstract
- Minimizing communication is central to realizing high performance for scalable execution of parallel algorithms. On GPU-based systems, iterative solvers can be prohibitively expensive without an algorithm that concentrates most of the work on the GPU devices and lightens the load on the network. This work focuses on increased local work per iteration to reduce iteration counts and thus internode communication. We present a polynomial reduction with full domain decomposition (PR+FDD) preconditioner that targets the solution of spectral-element-based Poisson problems discretized by high-order spectral elements on GPU-based exascale architectures. The algorithm constructs local composite grids by first reducing the polynomial order of the elements adjacent to the GPU-local partition, followed by progressive geometric coarsening all the way to the domain boundary. During the preconditioning step of the iterative solver, the residual is restricted to the different levels of the coarsening tree and communicated so that each processor can solve its local problem independently. Once completed, the local solutions are stitched together and the global iterative solver continues. This class of algorithms is known to achieve fast convergence at the cost of more expensive preconditioner evaluations. The added extra cost can be offset by using GPUs in order to retain an overall solver speedup. On structured domains, our method achieves a solve time of 0.39 s with 8 billion DOFs on 4096 GPUs, surpassing the 0.54 s of geometric-multigrid (GMG) for the same problem. On unstructured domains, we demonstrate the effectiveness of the preconditioner in reducing the number of outer iterations, achieving a 1.5-3 times reduction compared to low-order preconditioning on different computational domains. Strong and weak scaling results are presented along with timing data for each algorithm component.
- Graduation Semester
- 2022-12
- Type of Resource
- Thesis
- Copyright and License Information
- Copyright 2022 Pedro Bello-Maldonado
Owning Collections
Graduate Dissertations and Theses at Illinois PRIMARY
Graduate Theses and Dissertations at IllinoisManage Files
Loading…
Edit Collection Membership
Loading…
Edit Metadata
Loading…
Edit Properties
Loading…
Embargoes
Loading…