GronOR: Massively parallel and GPU-accelerated non-orthogonal configuration interaction for large molecular systems

GronOR is a program package for non-orthogonal configuration interaction calculations for an electronic wave function built in terms of anti-symmetrized products of multi-configuration molecular fragment wave functions. The two-electron integrals that have to be processed may be expressed in terms o...

Full description

Saved in:
Bibliographic Details
Published in:The Journal of chemical physics Vol. 152; no. 6; p. 064111
Main Authors: Straatsma, T P, Broer, R, Faraji, S, Havenith, R W A, Suarez, L E Aguilar, Kathir, R K, Wibowo, M, de Graaf, C
Format: Journal Article
Language:English
Published: United States 14.02.2020
ISSN:1089-7690, 1089-7690
Online Access:Get more information
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:GronOR is a program package for non-orthogonal configuration interaction calculations for an electronic wave function built in terms of anti-symmetrized products of multi-configuration molecular fragment wave functions. The two-electron integrals that have to be processed may be expressed in terms of atomic orbitals or in terms of an orbital basis determined from the molecular orbitals of the fragments. The code has been specifically designed for execution on distributed memory massively parallel and Graphics Processing Unit (GPU)-accelerated computer architectures, using an MPI+OpenACC/OpenMP programming approach. The task-based execution model used in the implementation allows for linear scaling with the number of nodes on the largest pre-exascale architectures available, provides hardware fault resiliency, and enables effective execution on systems with distinct central processing unit-only and GPU-accelerated partitions. The code interfaces with existing multi-configuration electronic structure codes that provide optimized molecular fragment orbitals, configuration interaction coefficients, and the required integrals. Algorithm and implementation details, parallel and accelerated performance benchmarks, and an analysis of the sensitivity of the accuracy of results and computational performance to thresholds used in the calculations are presented.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:1089-7690
1089-7690
DOI:10.1063/1.5141358