Search Results - Communication-avoiding algorithm*

Refine Results
  1. 1

    A communication-avoiding 3D algorithm for sparse LU factorization on heterogeneous systems by Sao, Piyush, Li, Xiaoye S., Vuduc, Richard

    ISSN: 0743-7315, 1096-0848
    Published: United States Elsevier Inc 01.09.2019
    “…We propose a new algorithm to improve the strong scalability of right-looking sparse LU factorization on distributed memory systems…”
    Get full text
    Journal Article
  2. 2

    Communication Avoiding Algorithms: Analysis and Code Generation for Parallel Systems by Murthy, Karthik, Mellor-Crummey, John

    ISSN: 1089-795X
    Published: IEEE 01.10.2015
    “… The class of .5D communication-avoiding algorithms were developed to address this bottleneck…”
    Get full text
    Conference Proceeding
  3. 3

    Communication avoiding algorithms by Demmel, Jim

    ISBN: 9781467362184, 1467362182
    Published: IEEE 01.11.2012
    “… Some of the specific areas/topics discussed include: To redesign algorithms to avoid communication between all memory hierarchy levels…”
    Get full text
    Conference Proceeding
  4. 4

    A communication-avoiding 3D algorithm for sparse LU factorization on heterogeneous systems by Sao, Piyush, Li, Xiaoye Sherry, Vuduc, Richard

    ISSN: 0743-7315, 1096-0848
    Published: United States Elsevier 19.08.2019
    “…We propose a new algorithm to improve the strong scalability of right-looking sparse LU factorization on distributed memory systems…”
    Get full text
    Journal Article
  5. 5

    An efficient randomized QLP algorithm for approximating the singular value decomposition by Kaloorazi, M.F., Liu, K., Chen, J., de Lamare, R.C.

    ISSN: 0020-0255, 1872-6291
    Published: Elsevier Inc 01.11.2023
    Published in Information sciences (01.11.2023)
    “…The rank-revealing pivoted QLP decomposition approximates the computationally prohibitive singular value decomposition (SVD) via two consecutive column-pivoted…”
    Get full text
    Journal Article
  6. 6

    Parallel Communication-Avoiding Algorithm for Triangular Matrix Inversion on Homogeneous and Heterogeneous Platforms by Mahfoudhi, Ryma, Mahjoub, Zaher, Nasri, Wahid

    ISSN: 0885-7458, 1573-7640
    Published: Boston Springer US 01.08.2015
    “… Afterwards, we develop in the second part of the paper, an optimal parallel avoiding-communication algorithm for a given number of available homogeneous and heterogeneous processors…”
    Get full text
    Journal Article
  7. 7

    Multiscale high-order/low-order (HOLO) algorithms and applications by Chacón, L., Chen, G., Knoll, D.A., Newman, C., Park, H., Taitano, W., Willert, J.A., Womeldorff, G.

    ISSN: 0021-9991, 1090-2716
    Published: Cambridge Elsevier Inc 01.02.2017
    Published in Journal of computational physics (01.02.2017)
    “…) algorithms for challenging multiscale problems. HOLO algorithms attempt to couple one or several high-complexity physical models…”
    Get full text
    Journal Article
  8. 8

    Communication Avoiding Block Low-Rank Parallel Multifrontal Triangular Solve with Many Right-Hand Sides by Amestoy, Patrick, Boiteau, Olivier, Buttari, Alfredo, Gerest, Matthieu, Jézéquel, Fabienne, L’Excellent, Jean-Yves, Mary, Theo

    ISSN: 0895-4798, 1095-7162
    Published: Society for Industrial and Applied Mathematics 01.01.2024
    “…Block low-rank (BLR) compression can significantly reduce the memory and time costs of parallel sparse direct solvers. In this paper, we investigate the…”
    Get full text
    Journal Article
  9. 9

    A massively parallel tensor contraction framework for coupled-cluster computations by Solomonik, Edgar, Matthews, Devin, Hammond, Jeff R., Stanton, John F., Demmel, James

    ISSN: 0743-7315, 1096-0848
    Published: Elsevier Inc 01.12.2014
    “…Precise calculation of molecular electronic wavefunctions by methods such as coupled-cluster requires the computation of tensor contractions, the cost of which…”
    Get full text
    Journal Article
  10. 10

    A Class of Communication-avoiding Algorithms for Solving General Dense Linear Systems on CPU/GPU Parallel Machines by Baboulin, Marc, Donfack, Simplice, Dongarra, Jack, Grigori, Laura, Rémy, Adrien, Tomov, Stanimire

    ISSN: 1877-0509, 1877-0509
    Published: Elsevier B.V 2012
    Published in Procedia computer science (2012)
    “… Then we introduce a solver where the panel factorization is performed using a communication-avoiding pivoting heuristic while the update of the trailing submatrix is performed by the GPU…”
    Get full text
    Journal Article
  11. 11

    Parallel Tall-and-Skinny QR Factorization Based on LU-CholeskyQR Algorithm by Uchino, Yuki, Imamura, Toshiyuki

    ISSN: 2168-9253
    Published: IEEE 02.09.2025
    “…We present optimal parallel QR factorization algorithms with reduced communication overhead…”
    Get full text
    Conference Proceeding
  12. 12

    Communication-Avoiding Recursive Aggregation by Sun, Yihao, Kumar, Sidharth, Gilray, Thomas, Micinski, Kristopher

    ISSN: 2168-9253
    Published: IEEE 31.10.2023
    “… Implementing recursive aggregation has posed a serious algorithmic challenge, with state-of-the-art work identifying sufficient conditions (e.g., pre-mappability…”
    Get full text
    Conference Proceeding
  13. 13

    Communication-Avoiding Algorithms for a High-Performance Hyperbolic Pde Engine by Charrier, Dominic Etienne

    Published: ProQuest Dissertations & Theses 01.01.2020
    “…The study of waves has always been an important subject of research. Earthquakes, for example, have a direct impact on the daily lives of millions of people…”
    Get full text
    Dissertation
  14. 14

    A communication-avoiding implicit–explicit method for a free-surface ocean model by Newman, Christopher, Womeldorff, Geoffrey, Knoll, Dana A., Chacón, Luis

    ISSN: 0021-9991, 1090-2716
    Published: United States Elsevier Inc 15.01.2016
    Published in Journal of computational physics (15.01.2016)
    “… The method is second-order accurate and scales algorithmically, with allowed timesteps much larger than fully explicit methods…”
    Get full text
    Journal Article
  15. 15

    Multiscale high-order/low-order (HOLO) algorithms and applications by Chacon, Luis, Chen, Guangye, Knoll, Dana Alan, Newman, Christopher Kyle, Park, HyeongKae, Taitano, William, Willert, Jeff A., Womeldorff, Geoffrey Alan

    ISSN: 0021-9991, 1090-2716
    Published: United States Elsevier 11.11.2016
    Published in Journal of computational physics (11.11.2016)
    “…) algorithms for challenging multiscale problems. HOLO algorithms attempt to couple one or several high-complexity physical models…”
    Get full text
    Journal Article
  16. 16

    Parallel Fast Multipole Method accelerated FFT on HPC clusters by Mehta, Chahak, Karthi, Amarnath, Jetly, Vishrut, Chaudhury, Bhaskar

    ISSN: 0167-8191, 1872-7336
    Published: Elsevier B.V 01.07.2021
    Published in Parallel computing (01.07.2021)
    “… In the past decade there has been a growing interest in communication-avoiding algorithms. The distributed memory Fast Fourier Transform is an important algorithm which suffers from major communication bottlenecks…”
    Get full text
    Journal Article
  17. 17

    Communication-Avoiding Symmetric-Indefinite Factorization by Ballard, Grey, Becker, Dulceneia, Demmel, James, Dongarra, Jack, Druinsky, Alex, Peled, Inon, Schwartz, Oded, Toledo, Sivan, Yamazaki, Ichitaro

    ISSN: 0895-4798, 1095-7162
    Published: United States SIAM 01.01.2014
    “…$ is lower triangular, and $T$ is block tridiagonal and banded. The algorithm is the first symmetric-indefinite communication-avoiding factorization…”
    Get full text
    Journal Article
  18. 18

    EA4RCA:Efficient AIE accelerator design framework for Regular Communication-Avoiding Algorithm by Zhang, W B, Liu, Y Q, Zang, T H, Bao, Z S

    ISSN: 2331-8422
    Published: Ithaca Cornell University Library, arXiv.org 09.07.2024
    Published in arXiv.org (09.07.2024)
    “…) algorithm is considered a typical application within the AIE architecture. Nevertheless, the effective utilization of AIE in CA applications remains an area that requires further exploration…”
    Get full text
    Paper
  19. 19

    Accelerating solutions of one-dimensional unsteady PDEs with GPU-based swept time–space decomposition by Magee, Daniel J., Niemeyer, Kyle E.

    ISSN: 0021-9991, 1090-2716
    Published: Cambridge Elsevier Inc 15.03.2018
    Published in Journal of computational physics (15.03.2018)
    “…•A GPU implementation of the swept time–space decomposition rule is presented.•Three versions of the scheme are considered.•The shared-memory implementation…”
    Get full text
    Journal Article
  20. 20

    Translational process: Mathematical software perspective by Dongarra, Jack, Gates, Mark, Luszczek, Piotr, Tomov, Stanimire

    ISSN: 1877-7503, 1877-7511
    Published: Netherlands Elsevier B.V 01.05.2021
    Published in Journal of computational science (01.05.2021)
    “…Each successive generation of computer architecture has brought new challenges to achieving high performance mathematical solvers, necessitating development and analysis of new algorithms…”
    Get full text
    Journal Article