Suchergebnisse - "Communication-avoiding algorithm"

  1. 1

    Parallel Communication-Avoiding Algorithm for Triangular Matrix Inversion on Homogeneous and Heterogeneous Platforms von Mahfoudhi, Ryma, Mahjoub, Zaher, Nasri, Wahid

    ISSN: 0885-7458, 1573-7640
    Veröffentlicht: Boston Springer US 01.08.2015
    Veröffentlicht in International journal of parallel programming (01.08.2015)
    “… We address in this paper the parallelization of a recursive algorithm for large scale triangular matrix inversion based on the ‘Divide and Conquer’ (D&C) …”
    Volltext
    Journal Article
  2. 2

    An efficient randomized QLP algorithm for approximating the singular value decomposition von Kaloorazi, M.F., Liu, K., Chen, J., de Lamare, R.C.

    ISSN: 0020-0255, 1872-6291
    Veröffentlicht: Elsevier Inc 01.11.2023
    Veröffentlicht in Information sciences (01.11.2023)
    “… The rank-revealing pivoted QLP decomposition approximates the computationally prohibitive singular value decomposition (SVD) via two consecutive column-pivoted …”
    Volltext
    Journal Article
  3. 3

    Parallel Tall-and-Skinny QR Factorization Based on LU-CholeskyQR Algorithm von Uchino, Yuki, Imamura, Toshiyuki

    ISSN: 2168-9253
    Veröffentlicht: IEEE 02.09.2025
    “… We present optimal parallel QR factorization algorithms with reduced communication overhead. QR factorization is widely applied to solve various problems in …”
    Volltext
    Tagungsbericht
  4. 4

    EA4RCA:Efficient AIE accelerator design framework for Regular Communication-Avoiding Algorithm von Zhang, W B, Liu, Y Q, Zang, T H, Bao, Z S

    ISSN: 2331-8422
    Veröffentlicht: Ithaca Cornell University Library, arXiv.org 09.07.2024
    Veröffentlicht in arXiv.org (09.07.2024)
    “… With the introduction of the Adaptive Intelligence Engine (AIE), the Versal Adaptive Compute Acceleration Platform (Versal ACAP) has garnered great attention …”
    Volltext
    Paper
  5. 5

    A communication-avoiding implicit–explicit method for a free-surface ocean model von Newman, Christopher, Womeldorff, Geoffrey, Knoll, Dana A., Chacón, Luis

    ISSN: 0021-9991, 1090-2716
    Veröffentlicht: United States Elsevier Inc 15.01.2016
    Veröffentlicht in Journal of computational physics (15.01.2016)
    “… We examine a nonlinear elimination method for the free-surface ocean equations based on barotropic–baroclinic decomposition. The two dimensional scalar …”
    Volltext
    Journal Article
  6. 6

    CholeskyQR2: a simple and communication-avoiding algorithm for computing a tall-skinny QR factorization on a large-scale parallel system von Fukaya, Takeshi, Nakatsukasa, Yuji, Yanagisawa, Yuka, Yamamoto, Yusaku

    ISBN: 1479975621, 9781479975624
    Veröffentlicht: Piscataway, NJ, USA IEEE Press 01.11.2014
    “… Designing communication-avoiding algorithms is crucial for high performance computing on a large-scale parallel system …”
    Volltext
    Tagungsbericht
  7. 7

    Unified Communication Optimization Strategies for Sparse Triangular Solver on CPU and GPU Clusters von Liu, Yang, Ding, Nan, Sao, Piyush, Williams, Samuel, Li, Xiaoye Sherry

    ISSN: 2167-4337
    Veröffentlicht: ACM 11.11.2023
    “… This paper presents a unified communication optimization frame-work for sparse triangular solve (SpTRSV) algorithms on CPU and GPU clusters. The framework …”
    Volltext
    Tagungsbericht
  8. 8

    Efficient parallel CP decomposition with pairwise perturbation and multi-sweep dimension tree von Ma, Linjian, Solomonik, Edgar

    ISSN: 1530-2075
    Veröffentlicht: IEEE 01.05.2021
    “… The widely used alternating least squares (ALS) algorithm for the canonical polyadic (CP) tensor decomposition is dominated in cost by the matricized-tensor …”
    Volltext
    Tagungsbericht
  9. 9

    Communication-Avoiding Cholesky-QR2 for Rectangular Matrices von Hutter, Edward, Solomonik, Edgar

    ISSN: 1530-2075
    Veröffentlicht: IEEE 01.05.2019
    “… Scalable QR factorization algorithms for solving least squares and eigenvalue problems are critical given the increasing parallelism within modern machines. We …”
    Volltext
    Tagungsbericht
  10. 10

    A Communication-Avoiding 3D LU Factorization Algorithm for Sparse Matrices von Sao, Piyush, Li, Xiaoye Sherry, Vuduc, Richard

    ISSN: 1530-2075
    Veröffentlicht: IEEE 01.05.2018
    “… We propose a new algorithm to improve the strong scalability of right-looking sparse LU factorization on distributed memory systems. Our 3D sparse LU algorithm …”
    Volltext
    Tagungsbericht
  11. 11

    A Sparse Direct Solver for Distributed Memory Xeon Phi-Accelerated Systems von Sao, Piyush, Xing Liu, Vuduc, Richard, Xiaoye Li

    ISSN: 1530-2075
    Veröffentlicht: IEEE 01.05.2015
    “… This paper presents the first sparse direct solver for distributed memory systems comprising hybrid multicourse CPU and Intel Xeon Pico-processors. It builds …”
    Volltext
    Tagungsbericht
  12. 12

    Communication Avoiding Algorithms: Analysis and Code Generation for Parallel Systems von Murthy, Karthik, Mellor-Crummey, John

    ISSN: 1089-795X
    Veröffentlicht: IEEE 01.10.2015
    “… The class of .5D communication-avoiding algorithms were developed to address this bottleneck …”
    Volltext
    Tagungsbericht
  13. 13

    A massively parallel tensor contraction framework for coupled-cluster computations von Solomonik, Edgar, Matthews, Devin, Hammond, Jeff R., Stanton, John F., Demmel, James

    ISSN: 0743-7315, 1096-0848
    Veröffentlicht: Elsevier Inc 01.12.2014
    Veröffentlicht in Journal of parallel and distributed computing (01.12.2014)
    “… Precise calculation of molecular electronic wavefunctions by methods such as coupled-cluster requires the computation of tensor contractions, the cost of which …”
    Volltext
    Journal Article
  14. 14

    A Class of Communication-avoiding Algorithms for Solving General Dense Linear Systems on CPU/GPU Parallel Machines von Baboulin, Marc, Donfack, Simplice, Dongarra, Jack, Grigori, Laura, Rémy, Adrien, Tomov, Stanimire

    ISSN: 1877-0509, 1877-0509
    Veröffentlicht: Elsevier B.V 2012
    Veröffentlicht in Procedia computer science (2012)
    “… We study several solvers for the solution of general linear systems where the main objective is to reduce the communication overhead due to pivoting. We first …”
    Volltext
    Journal Article
  15. 15

    Communication-Avoiding Algorithms for a High-Performance Hyperbolic Pde Engine von Charrier, Dominic Etienne

    Veröffentlicht: ProQuest Dissertations & Theses 01.01.2020
    “… The study of waves has always been an important subject of research. Earthquakes, for example, have a direct impact on the daily lives of millions of people …”
    Volltext
    Dissertation
  16. 16

    Multiscale high-order/low-order (HOLO) algorithms and applications von Chacón, L., Chen, G., Knoll, D.A., Newman, C., Park, H., Taitano, W., Willert, J.A., Womeldorff, G.

    ISSN: 0021-9991, 1090-2716
    Veröffentlicht: Cambridge Elsevier Inc 01.02.2017
    Veröffentlicht in Journal of computational physics (01.02.2017)
    “… We review the state of the art in the formulation, implementation, and performance of so-called high-order/low-order (HOLO) algorithms for challenging …”
    Volltext
    Journal Article
  17. 17

    Communication Avoiding Block Low-Rank Parallel Multifrontal Triangular Solve with Many Right-Hand Sides von Amestoy, Patrick, Boiteau, Olivier, Buttari, Alfredo, Gerest, Matthieu, Jézéquel, Fabienne, L’Excellent, Jean-Yves, Mary, Theo

    ISSN: 0895-4798, 1095-7162
    Veröffentlicht: Society for Industrial and Applied Mathematics 01.01.2024
    Veröffentlicht in SIAM journal on matrix analysis and applications (01.01.2024)
    “… Block low-rank (BLR) compression can significantly reduce the memory and time costs of parallel sparse direct solvers. In this paper, we investigate the …”
    Volltext
    Journal Article
  18. 18

    Parallel Fast Multipole Method accelerated FFT on HPC clusters von Mehta, Chahak, Karthi, Amarnath, Jetly, Vishrut, Chaudhury, Bhaskar

    ISSN: 0167-8191, 1872-7336
    Veröffentlicht: Elsevier B.V 01.07.2021
    Veröffentlicht in Parallel computing (01.07.2021)
    “… In the past decade there has been a growing interest in communication-avoiding algorithms. The distributed memory Fast Fourier Transform is an important algorithm which suffers from major communication bottlenecks …”
    Volltext
    Journal Article
  19. 19

    Communication avoiding algorithms von Demmel, Jim

    ISBN: 9781467362184, 1467362182
    Veröffentlicht: IEEE 01.11.2012
    “… This article consists of a collection of slides from the author's conference presentation. Some of the specific areas/topics discussed include: To redesign …”
    Volltext
    Tagungsbericht
  20. 20

    Multiscale high-order/low-order (HOLO) algorithms and applications von Chacon, Luis, Chen, Guangye, Knoll, Dana Alan, Newman, Christopher Kyle, Park, HyeongKae, Taitano, William, Willert, Jeff A., Womeldorff, Geoffrey Alan

    ISSN: 0021-9991, 1090-2716
    Veröffentlicht: United States Elsevier 11.11.2016
    Veröffentlicht in Journal of computational physics (11.11.2016)
    “… Here, we review the state of the art in the formulation, implementation, and performance of so-called high-order/low-order (HOLO) algorithms for challenging …”
    Volltext
    Journal Article