Publication
Title
A high arithmetic intensity Krylov subspace method based on stencil compiler programs
Author
Abstract
Stencil calculations and matrix-free Krylov subspace solvers represent important components of many scientific computing applications. In these solvers, stencil applications are often the dominant part of the computation; an efficient parallel implementation of the kernel is therefore crucial to reduce the time to solution. Inspired by polynomial preconditioning, we remove upper bounds on the arithmetic intensity of the Krylov subspace building block by replacing the matrix with a higher-degree matrix polynomial. Using the latest state-of-the-art stencil compiler programs with temporal blocking, reduced memory bandwidth usage and, consequently, better utilization of SIMD vectorization and thus speedup on modern hardware, we are able to obtain performance improvements for higher polynomial degrees than simpler cache-blocking approaches have yielded in the past, demonstrating the new appeal of polynomial techniques on emerging architectures. We present results in a shared-memory environment and an extension to a distributed-memory environment with local shared memory.
Language
English
Source (journal)
Lecture notes in computer science. - Berlin, 1973, currens
Source (book)
3rd International Conference on High Performance Computing in Science, and Engineering (HPCSE), MAY 22-25, 2017, Karolinka, CZECH REPUBLIC
Publication
Cham : Springer international publishing ag , 2018
ISBN
978-3-319-97136-0
978-3-319-97135-3
978-3-319-97135-3
DOI
10.1007/978-3-319-97136-0_1
Volume/pages
11087 (2018) , p. 1-18
ISI
000469334300001
Full text (Publisher's DOI)
UAntwerpen
Faculty/Department
Research group
Project info
Exascale Algorithms and Advanced Computational Techniques (EXA2CT).
Publication type
Subject
Affiliation
Publications with a UAntwerp address
External links
Web of Science
Record
Identifier
Creation 25.06.2019
Last edited 28.10.2024
To cite this reference