Using multiple graphics accelerators to solve the two-dimensional inverse heat conduction problem

In this paper, we present a novel parallel algorithm implemented on graphics accelerators to solve the two-dimensional inverse heat conduction problem (estimation of the temporo-spatial heat transfer coefficients without any prior knowledge of the thermal boundary conditions) based on Particle Swarm...

Full description

Saved in:
Bibliographic Details
Published in:Computer methods in applied mechanics and engineering Vol. 336; pp. 286 - 303
Main Authors: Szénási, Sándor, Felde, Imre
Format: Journal Article
Language:English
Published: Amsterdam Elsevier B.V 01.07.2018
Elsevier BV
Subjects:
ISSN:0045-7825, 1879-2138
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:In this paper, we present a novel parallel algorithm implemented on graphics accelerators to solve the two-dimensional inverse heat conduction problem (estimation of the temporo-spatial heat transfer coefficients without any prior knowledge of the thermal boundary conditions) based on Particle Swarm Optimisation method. In the absence of analytical solutions, there are several heuristic methods to solve the problem, but the unacceptable high runtime (several days) makes these unsuitable for practical use. This paper presents the methods on how to adapt the original sequential algorithm to an efficient data-parallel one, keeping in mind the main features of graphics processing units (launching multiple threads on all multiprocessors, storing data in fast on-chip memory, eliminating warp divergence and memory transfer latency, using the host and device together, etc.). The achieved ∼45× speed-up (without any accuracy degradation) makes the heuristic methods suitable for practical use. Some of the proposed ideas are generally useable; therefore, this paper can be considered a step-by-step guide for researchers of other fields to speed-up general purpose calculations and evaluate the results. •Our goal is to solve the two-dimensional IHCP using heuristic optimisation methods.•A multi-level parallel algorithm implemented on GPU and CPU cores is presented.•It is 100x faster than the original sequential one (the accuracy is the same).•Details of optimisation (memory transfers, warp divergence, etc.) are discussed.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:0045-7825
1879-2138
DOI:10.1016/j.cma.2018.03.024