Výsledky vyhľadávania - "communication avoiding algorithm"
-
1
An efficient randomized QLP algorithm for approximating the singular value decomposition
ISSN: 0020-0255, 1872-6291Vydavateľské údaje: Elsevier Inc 01.11.2023Vydané v Information sciences (01.11.2023)“…The rank-revealing pivoted QLP decomposition approximates the computationally prohibitive singular value decomposition (SVD) via two consecutive column-pivoted…”
Získať plný text
Journal Article -
2
Parallel Tall-and-Skinny QR Factorization Based on LU-CholeskyQR Algorithm
ISSN: 2168-9253Vydavateľské údaje: IEEE 02.09.2025Vydané v Proceedings / IEEE International Conference on Cluster Computing (02.09.2025)“…We present optimal parallel QR factorization algorithms with reduced communication overhead. QR factorization is widely applied to solve various problems in…”
Získať plný text
Konferenčný príspevok.. -
3
A communication-avoiding implicit–explicit method for a free-surface ocean model
ISSN: 0021-9991, 1090-2716Vydavateľské údaje: United States Elsevier Inc 15.01.2016Vydané v Journal of computational physics (15.01.2016)“…We examine a nonlinear elimination method for the free-surface ocean equations based on barotropic–baroclinic decomposition. The two dimensional scalar…”
Získať plný text
Journal Article -
4
Unified Communication Optimization Strategies for Sparse Triangular Solver on CPU and GPU Clusters
ISSN: 2167-4337Vydavateľské údaje: ACM 11.11.2023Vydané v International Conference for High Performance Computing, Networking, Storage and Analysis (Online) (11.11.2023)“…This paper presents a unified communication optimization frame-work for sparse triangular solve (SpTRSV) algorithms on CPU and GPU clusters. The framework…”
Získať plný text
Konferenčný príspevok.. -
5
Efficient parallel CP decomposition with pairwise perturbation and multi-sweep dimension tree
ISSN: 1530-2075Vydavateľské údaje: IEEE 01.05.2021Vydané v Proceedings - IEEE International Parallel and Distributed Processing Symposium (01.05.2021)“…The widely used alternating least squares (ALS) algorithm for the canonical polyadic (CP) tensor decomposition is dominated in cost by the matricized-tensor…”
Získať plný text
Konferenčný príspevok.. -
6
Communication-Avoiding Cholesky-QR2 for Rectangular Matrices
ISSN: 1530-2075Vydavateľské údaje: IEEE 01.05.2019Vydané v Proceedings - IEEE International Parallel and Distributed Processing Symposium (01.05.2019)“…Scalable QR factorization algorithms for solving least squares and eigenvalue problems are critical given the increasing parallelism within modern machines. We…”
Získať plný text
Konferenčný príspevok.. -
7
A Communication-Avoiding 3D LU Factorization Algorithm for Sparse Matrices
ISSN: 1530-2075Vydavateľské údaje: IEEE 01.05.2018Vydané v 2018 IEEE International Parallel and Distributed Processing Symposium (IPDPS) (01.05.2018)“…We propose a new algorithm to improve the strong scalability of right-looking sparse LU factorization on distributed memory systems. Our 3D sparse LU algorithm…”
Získať plný text
Konferenčný príspevok.. -
8
A Sparse Direct Solver for Distributed Memory Xeon Phi-Accelerated Systems
ISSN: 1530-2075Vydavateľské údaje: IEEE 01.05.2015Vydané v Proceedings - IEEE International Parallel and Distributed Processing Symposium (01.05.2015)“…This paper presents the first sparse direct solver for distributed memory systems comprising hybrid multicourse CPU and Intel Xeon Pico-processors. It builds…”
Získať plný text
Konferenčný príspevok.. -
9
A massively parallel tensor contraction framework for coupled-cluster computations
ISSN: 0743-7315, 1096-0848Vydavateľské údaje: Elsevier Inc 01.12.2014Vydané v Journal of parallel and distributed computing (01.12.2014)“…Precise calculation of molecular electronic wavefunctions by methods such as coupled-cluster requires the computation of tensor contractions, the cost of which…”
Získať plný text
Journal Article -
10
Multiscale high-order/low-order (HOLO) algorithms and applications
ISSN: 0021-9991, 1090-2716Vydavateľské údaje: Cambridge Elsevier Inc 01.02.2017Vydané v Journal of computational physics (01.02.2017)“…We review the state of the art in the formulation, implementation, and performance of so-called high-order/low-order (HOLO) algorithms for challenging…”
Získať plný text
Journal Article -
11
Communication Avoiding Block Low-Rank Parallel Multifrontal Triangular Solve with Many Right-Hand Sides
ISSN: 0895-4798, 1095-7162Vydavateľské údaje: Society for Industrial and Applied Mathematics 01.01.2024Vydané v SIAM journal on matrix analysis and applications (01.01.2024)“…Block low-rank (BLR) compression can significantly reduce the memory and time costs of parallel sparse direct solvers. In this paper, we investigate the…”
Získať plný text
Journal Article -
12
Parallel Fast Multipole Method accelerated FFT on HPC clusters
ISSN: 0167-8191, 1872-7336Vydavateľské údaje: Elsevier B.V 01.07.2021Vydané v Parallel computing (01.07.2021)“…With increasing sizes of distributed systems, there comes an increased risk of communication bottlenecks. In the past decade there has been a growing interest…”
Získať plný text
Journal Article -
13
Multiscale high-order/low-order (HOLO) algorithms and applications
ISSN: 0021-9991, 1090-2716Vydavateľské údaje: United States Elsevier 11.11.2016Vydané v Journal of computational physics (11.11.2016)“…Here, we review the state of the art in the formulation, implementation, and performance of so-called high-order/low-order (HOLO) algorithms for challenging…”
Získať plný text
Journal Article -
14
Communication-Avoiding Recursive Aggregation
ISSN: 2168-9253Vydavateľské údaje: IEEE 31.10.2023Vydané v Proceedings / IEEE International Conference on Cluster Computing (31.10.2023)“…Recursive aggregation has been of considerable interest due to its unifying a wide range of deductive-analytic workloads, including social-media mining and…”
Získať plný text
Konferenčný príspevok.. -
15
Translational process: Mathematical software perspective
ISSN: 1877-7503, 1877-7511Vydavateľské údaje: Netherlands Elsevier B.V 01.05.2021Vydané v Journal of computational science (01.05.2021)“…Each successive generation of computer architecture has brought new challenges to achieving high performance mathematical solvers, necessitating development…”
Získať plný text
Journal Article -
16
Accelerating solutions of one-dimensional unsteady PDEs with GPU-based swept time–space decomposition
ISSN: 0021-9991, 1090-2716Vydavateľské údaje: Cambridge Elsevier Inc 15.03.2018Vydané v Journal of computational physics (15.03.2018)“…•A GPU implementation of the swept time–space decomposition rule is presented.•Three versions of the scheme are considered.•The shared-memory implementation…”
Získať plný text
Journal Article -
17
Reconstructing Householder vectors from Tall-Skinny QR
ISSN: 0743-7315, 1096-0848Vydavateľské údaje: United States Elsevier Inc 01.11.2015Vydané v Journal of parallel and distributed computing (01.11.2015)“…The Tall-Skinny QR (TSQR) algorithm is more communication efficient than the standard Householder algorithm for QR decomposition of matrices with many more…”
Získať plný text
Journal Article -
18
Applying the swept rule for solving explicit partial differential equations on heterogeneous computing systems
ISSN: 0920-8542, 1573-0484Vydavateľské údaje: New York Springer US 01.02.2021Vydané v The Journal of supercomputing (01.02.2021)“…Applications that exploit the architectural details of high-performance computing (HPC) systems have become increasingly invaluable in academia and industry…”
Získať plný text
Journal Article -
19
Reducing Communication in Graph Neural Network Training
ISSN: 2167-4329Vydavateľské údaje: United States IEEE 01.11.2020Vydané v International Conference for High Performance Computing, Networking, Storage and Analysis (Online) (01.11.2020)“…Graph Neural Networks (GNNs) are powerful and flexible neural networks that use the naturally sparse connectivity information of the data. GNNs represent this…”
Získať plný text
Konferenčný príspevok.. Journal Article -
20
Communication-Avoiding Symmetric-Indefinite Factorization
ISSN: 0895-4798, 1095-7162Vydavateľské údaje: United States SIAM 01.01.2014Vydané v SIAM journal on matrix analysis and applications (01.01.2014)“…We describe and analyze a novel symmetric triangular factorization algorithm. The algorithm is essentially a block version of Aasen's triangular…”
Získať plný text
Journal Article