Výsledky vyhledávání - "Communication avoiding algorithm"
-
1
An efficient randomized QLP algorithm for approximating the singular value decomposition
ISSN: 0020-0255, 1872-6291Vydáno: Elsevier Inc 01.11.2023Vydáno v Information sciences (01.11.2023)“…The rank-revealing pivoted QLP decomposition approximates the computationally prohibitive singular value decomposition (SVD) via two consecutive column-pivoted…”
Získat plný text
Journal Article -
2
Parallel Tall-and-Skinny QR Factorization Based on LU-CholeskyQR Algorithm
ISSN: 2168-9253Vydáno: IEEE 02.09.2025Vydáno v Proceedings / IEEE International Conference on Cluster Computing (02.09.2025)“…We present optimal parallel QR factorization algorithms with reduced communication overhead. QR factorization is widely applied to solve various problems in…”
Získat plný text
Konferenční příspěvek -
3
A communication-avoiding implicit–explicit method for a free-surface ocean model
ISSN: 0021-9991, 1090-2716Vydáno: United States Elsevier Inc 15.01.2016Vydáno v Journal of computational physics (15.01.2016)“…We examine a nonlinear elimination method for the free-surface ocean equations based on barotropic–baroclinic decomposition. The two dimensional scalar…”
Získat plný text
Journal Article -
4
Unified Communication Optimization Strategies for Sparse Triangular Solver on CPU and GPU Clusters
ISSN: 2167-4337Vydáno: ACM 11.11.2023Vydáno v International Conference for High Performance Computing, Networking, Storage and Analysis (Online) (11.11.2023)“…This paper presents a unified communication optimization frame-work for sparse triangular solve (SpTRSV) algorithms on CPU and GPU clusters. The framework…”
Získat plný text
Konferenční příspěvek -
5
Efficient parallel CP decomposition with pairwise perturbation and multi-sweep dimension tree
ISSN: 1530-2075Vydáno: IEEE 01.05.2021Vydáno v Proceedings - IEEE International Parallel and Distributed Processing Symposium (01.05.2021)“…The widely used alternating least squares (ALS) algorithm for the canonical polyadic (CP) tensor decomposition is dominated in cost by the matricized-tensor…”
Získat plný text
Konferenční příspěvek -
6
Communication-Avoiding Cholesky-QR2 for Rectangular Matrices
ISSN: 1530-2075Vydáno: IEEE 01.05.2019Vydáno v Proceedings - IEEE International Parallel and Distributed Processing Symposium (01.05.2019)“…Scalable QR factorization algorithms for solving least squares and eigenvalue problems are critical given the increasing parallelism within modern machines. We…”
Získat plný text
Konferenční příspěvek -
7
A Communication-Avoiding 3D LU Factorization Algorithm for Sparse Matrices
ISSN: 1530-2075Vydáno: IEEE 01.05.2018Vydáno v 2018 IEEE International Parallel and Distributed Processing Symposium (IPDPS) (01.05.2018)“…We propose a new algorithm to improve the strong scalability of right-looking sparse LU factorization on distributed memory systems. Our 3D sparse LU algorithm…”
Získat plný text
Konferenční příspěvek -
8
A Sparse Direct Solver for Distributed Memory Xeon Phi-Accelerated Systems
ISSN: 1530-2075Vydáno: IEEE 01.05.2015Vydáno v Proceedings - IEEE International Parallel and Distributed Processing Symposium (01.05.2015)“…This paper presents the first sparse direct solver for distributed memory systems comprising hybrid multicourse CPU and Intel Xeon Pico-processors. It builds…”
Získat plný text
Konferenční příspěvek -
9
A massively parallel tensor contraction framework for coupled-cluster computations
ISSN: 0743-7315, 1096-0848Vydáno: Elsevier Inc 01.12.2014Vydáno v Journal of parallel and distributed computing (01.12.2014)“…Precise calculation of molecular electronic wavefunctions by methods such as coupled-cluster requires the computation of tensor contractions, the cost of which…”
Získat plný text
Journal Article -
10
Multiscale high-order/low-order (HOLO) algorithms and applications
ISSN: 0021-9991, 1090-2716Vydáno: Cambridge Elsevier Inc 01.02.2017Vydáno v Journal of computational physics (01.02.2017)“…We review the state of the art in the formulation, implementation, and performance of so-called high-order/low-order (HOLO) algorithms for challenging…”
Získat plný text
Journal Article -
11
Communication Avoiding Block Low-Rank Parallel Multifrontal Triangular Solve with Many Right-Hand Sides
ISSN: 0895-4798, 1095-7162Vydáno: Society for Industrial and Applied Mathematics 01.01.2024Vydáno v SIAM journal on matrix analysis and applications (01.01.2024)“…Block low-rank (BLR) compression can significantly reduce the memory and time costs of parallel sparse direct solvers. In this paper, we investigate the…”
Získat plný text
Journal Article -
12
Parallel Fast Multipole Method accelerated FFT on HPC clusters
ISSN: 0167-8191, 1872-7336Vydáno: Elsevier B.V 01.07.2021Vydáno v Parallel computing (01.07.2021)“…With increasing sizes of distributed systems, there comes an increased risk of communication bottlenecks. In the past decade there has been a growing interest…”
Získat plný text
Journal Article -
13
Multiscale high-order/low-order (HOLO) algorithms and applications
ISSN: 0021-9991, 1090-2716Vydáno: United States Elsevier 11.11.2016Vydáno v Journal of computational physics (11.11.2016)“…Here, we review the state of the art in the formulation, implementation, and performance of so-called high-order/low-order (HOLO) algorithms for challenging…”
Získat plný text
Journal Article -
14
Communication-Avoiding Recursive Aggregation
ISSN: 2168-9253Vydáno: IEEE 31.10.2023Vydáno v Proceedings / IEEE International Conference on Cluster Computing (31.10.2023)“…Recursive aggregation has been of considerable interest due to its unifying a wide range of deductive-analytic workloads, including social-media mining and…”
Získat plný text
Konferenční příspěvek -
15
Translational process: Mathematical software perspective
ISSN: 1877-7503, 1877-7511Vydáno: Netherlands Elsevier B.V 01.05.2021Vydáno v Journal of computational science (01.05.2021)“…Each successive generation of computer architecture has brought new challenges to achieving high performance mathematical solvers, necessitating development…”
Získat plný text
Journal Article -
16
Accelerating solutions of one-dimensional unsteady PDEs with GPU-based swept time–space decomposition
ISSN: 0021-9991, 1090-2716Vydáno: Cambridge Elsevier Inc 15.03.2018Vydáno v Journal of computational physics (15.03.2018)“…•A GPU implementation of the swept time–space decomposition rule is presented.•Three versions of the scheme are considered.•The shared-memory implementation…”
Získat plný text
Journal Article -
17
Reconstructing Householder vectors from Tall-Skinny QR
ISSN: 0743-7315, 1096-0848Vydáno: United States Elsevier Inc 01.11.2015Vydáno v Journal of parallel and distributed computing (01.11.2015)“…The Tall-Skinny QR (TSQR) algorithm is more communication efficient than the standard Householder algorithm for QR decomposition of matrices with many more…”
Získat plný text
Journal Article -
18
Applying the swept rule for solving explicit partial differential equations on heterogeneous computing systems
ISSN: 0920-8542, 1573-0484Vydáno: New York Springer US 01.02.2021Vydáno v The Journal of supercomputing (01.02.2021)“…Applications that exploit the architectural details of high-performance computing (HPC) systems have become increasingly invaluable in academia and industry…”
Získat plný text
Journal Article -
19
Reducing Communication in Graph Neural Network Training
ISSN: 2167-4329Vydáno: United States IEEE 01.11.2020Vydáno v International Conference for High Performance Computing, Networking, Storage and Analysis (Online) (01.11.2020)“…Graph Neural Networks (GNNs) are powerful and flexible neural networks that use the naturally sparse connectivity information of the data. GNNs represent this…”
Získat plný text
Konferenční příspěvek Journal Article -
20
Communication-Avoiding Symmetric-Indefinite Factorization
ISSN: 0895-4798, 1095-7162Vydáno: United States SIAM 01.01.2014Vydáno v SIAM journal on matrix analysis and applications (01.01.2014)“…We describe and analyze a novel symmetric triangular factorization algorithm. The algorithm is essentially a block version of Aasen's triangular…”
Získat plný text
Journal Article