Suchergebnisse - Communication-avoiding algorithm~

  1. 1

    A communication-avoiding 3D algorithm for sparse LU factorization on heterogeneous systems von Sao, Piyush, Li, Xiaoye S., Vuduc, Richard

    ISSN: 0743-7315, 1096-0848
    Veröffentlicht: United States Elsevier Inc 01.09.2019
    Veröffentlicht in Journal of parallel and distributed computing (01.09.2019)
    “… We propose a new algorithm to improve the strong scalability of right-looking sparse LU factorization on distributed memory systems …”
    Volltext
    Journal Article
  2. 2

    Communication Avoiding Algorithms: Analysis and Code Generation for Parallel Systems von Murthy, Karthik, Mellor-Crummey, John

    ISSN: 1089-795X
    Veröffentlicht: IEEE 01.10.2015
    “… The class of .5D communication-avoiding algorithms were developed to address this bottleneck …”
    Volltext
    Tagungsbericht
  3. 3

    A communication-avoiding 3D algorithm for sparse LU factorization on heterogeneous systems von Sao, Piyush, Li, Xiaoye Sherry, Vuduc, Richard

    ISSN: 0743-7315, 1096-0848
    Veröffentlicht: United States Elsevier 19.08.2019
    Veröffentlicht in Journal of parallel and distributed computing (19.08.2019)
    “… We propose a new algorithm to improve the strong scalability of right-looking sparse LU factorization on distributed memory systems …”
    Volltext
    Journal Article
  4. 4

    Communication-Avoiding Recursive Aggregation von Sun, Yihao, Kumar, Sidharth, Gilray, Thomas, Micinski, Kristopher

    ISSN: 2168-9253
    Veröffentlicht: IEEE 31.10.2023
    “… Recursive aggregation has been of considerable interest due to its unifying a wide range of deductive-analytic workloads, including social-media mining and …”
    Volltext
    Tagungsbericht
  5. 5

    Communication avoiding algorithms von Demmel, Jim

    ISBN: 9781467362184, 1467362182
    Veröffentlicht: IEEE 01.11.2012
    “… Some of the specific areas/topics discussed include: To redesign algorithms to avoid communication between all memory hierarchy levels …”
    Volltext
    Tagungsbericht
  6. 6

    Parallel Communication-Avoiding Algorithm for Triangular Matrix Inversion on Homogeneous and Heterogeneous Platforms von Mahfoudhi, Ryma, Mahjoub, Zaher, Nasri, Wahid

    ISSN: 0885-7458, 1573-7640
    Veröffentlicht: Boston Springer US 01.08.2015
    Veröffentlicht in International journal of parallel programming (01.08.2015)
    “… Afterwards, we develop in the second part of the paper, an optimal parallel avoiding-communication algorithm for a given number of available homogeneous and heterogeneous processors …”
    Volltext
    Journal Article
  7. 7

    Communication Avoiding Block Low-Rank Parallel Multifrontal Triangular Solve with Many Right-Hand Sides von Amestoy, Patrick, Boiteau, Olivier, Buttari, Alfredo, Gerest, Matthieu, Jézéquel, Fabienne, L’Excellent, Jean-Yves, Mary, Theo

    ISSN: 0895-4798, 1095-7162
    Veröffentlicht: Society for Industrial and Applied Mathematics 01.01.2024
    Veröffentlicht in SIAM journal on matrix analysis and applications (01.01.2024)
    “… Block low-rank (BLR) compression can significantly reduce the memory and time costs of parallel sparse direct solvers. In this paper, we investigate the …”
    Volltext
    Journal Article
  8. 8

    An efficient randomized QLP algorithm for approximating the singular value decomposition von Kaloorazi, M.F., Liu, K., Chen, J., de Lamare, R.C.

    ISSN: 0020-0255, 1872-6291
    Veröffentlicht: Elsevier Inc 01.11.2023
    Veröffentlicht in Information sciences (01.11.2023)
    “… The rank-revealing pivoted QLP decomposition approximates the computationally prohibitive singular value decomposition (SVD) via two consecutive column-pivoted …”
    Volltext
    Journal Article
  9. 9

    A communication-avoiding implicit–explicit method for a free-surface ocean model von Newman, Christopher, Womeldorff, Geoffrey, Knoll, Dana A., Chacón, Luis

    ISSN: 0021-9991, 1090-2716
    Veröffentlicht: United States Elsevier Inc 15.01.2016
    Veröffentlicht in Journal of computational physics (15.01.2016)
    “… Moreover, the hierarchical nature of the algorithm lends itself readily to emerging architectures …”
    Volltext
    Journal Article
  10. 10

    Multiscale high-order/low-order (HOLO) algorithms and applications von Chacón, L., Chen, G., Knoll, D.A., Newman, C., Park, H., Taitano, W., Willert, J.A., Womeldorff, G.

    ISSN: 0021-9991, 1090-2716
    Veröffentlicht: Cambridge Elsevier Inc 01.02.2017
    Veröffentlicht in Journal of computational physics (01.02.2017)
    “… ) algorithms for challenging multiscale problems. HOLO algorithms attempt to couple one or several high-complexity physical models …”
    Volltext
    Journal Article
  11. 11

    A massively parallel tensor contraction framework for coupled-cluster computations von Solomonik, Edgar, Matthews, Devin, Hammond, Jeff R., Stanton, John F., Demmel, James

    ISSN: 0743-7315, 1096-0848
    Veröffentlicht: Elsevier Inc 01.12.2014
    Veröffentlicht in Journal of parallel and distributed computing (01.12.2014)
    “… Precise calculation of molecular electronic wavefunctions by methods such as coupled-cluster requires the computation of tensor contractions, the cost of which …”
    Volltext
    Journal Article
  12. 12

    Communication-Avoiding Symmetric-Indefinite Factorization von Ballard, Grey, Becker, Dulceneia, Demmel, James, Dongarra, Jack, Druinsky, Alex, Peled, Inon, Schwartz, Oded, Toledo, Sivan, Yamazaki, Ichitaro

    ISSN: 0895-4798, 1095-7162
    Veröffentlicht: United States SIAM 01.01.2014
    Veröffentlicht in SIAM journal on matrix analysis and applications (01.01.2014)
    “… $ is lower triangular, and $T$ is block tridiagonal and banded. The algorithm is the first symmetric-indefinite communication-avoiding factorization …”
    Volltext
    Journal Article
  13. 13

    A Class of Communication-avoiding Algorithms for Solving General Dense Linear Systems on CPU/GPU Parallel Machines von Baboulin, Marc, Donfack, Simplice, Dongarra, Jack, Grigori, Laura, Rémy, Adrien, Tomov, Stanimire

    ISSN: 1877-0509, 1877-0509
    Veröffentlicht: Elsevier B.V 2012
    Veröffentlicht in Procedia computer science (2012)
    “… Then we introduce a solver where the panel factorization is performed using a communication-avoiding pivoting heuristic while the update of the trailing submatrix is performed by the GPU …”
    Volltext
    Journal Article
  14. 14

    Parallel Tall-and-Skinny QR Factorization Based on LU-CholeskyQR Algorithm von Uchino, Yuki, Imamura, Toshiyuki

    ISSN: 2168-9253
    Veröffentlicht: IEEE 02.09.2025
    “… We present optimal parallel QR factorization algorithms with reduced communication overhead …”
    Volltext
    Tagungsbericht
  15. 15

    Communication-Avoiding Algorithms for a High-Performance Hyperbolic Pde Engine von Charrier, Dominic Etienne

    Veröffentlicht: ProQuest Dissertations & Theses 01.01.2020
    “… The study of waves has always been an important subject of research. Earthquakes, for example, have a direct impact on the daily lives of millions of people …”
    Volltext
    Dissertation
  16. 16

    Multiscale high-order/low-order (HOLO) algorithms and applications von Chacon, Luis, Chen, Guangye, Knoll, Dana Alan, Newman, Christopher Kyle, Park, HyeongKae, Taitano, William, Willert, Jeff A., Womeldorff, Geoffrey Alan

    ISSN: 0021-9991, 1090-2716
    Veröffentlicht: United States Elsevier 11.11.2016
    Veröffentlicht in Journal of computational physics (11.11.2016)
    “… ) algorithms for challenging multiscale problems. HOLO algorithms attempt to couple one or several high-complexity physical models …”
    Volltext
    Journal Article
  17. 17

    EA4RCA:Efficient AIE accelerator design framework for Regular Communication-Avoiding Algorithm von Zhang, W B, Liu, Y Q, Zang, T H, Bao, Z S

    ISSN: 2331-8422
    Veröffentlicht: Ithaca Cornell University Library, arXiv.org 09.07.2024
    Veröffentlicht in arXiv.org (09.07.2024)
    “… ) algorithm is considered a typical application within the AIE architecture. Nevertheless, the effective utilization of AIE in CA applications remains an area that requires further exploration …”
    Volltext
    Paper
  18. 18

    Parallel Fast Multipole Method accelerated FFT on HPC clusters von Mehta, Chahak, Karthi, Amarnath, Jetly, Vishrut, Chaudhury, Bhaskar

    ISSN: 0167-8191, 1872-7336
    Veröffentlicht: Elsevier B.V 01.07.2021
    Veröffentlicht in Parallel computing (01.07.2021)
    “… In the past decade there has been a growing interest in communication-avoiding algorithms. The distributed memory Fast Fourier Transform is an important algorithm which suffers from major communication bottlenecks …”
    Volltext
    Journal Article
  19. 19

    Translational process: Mathematical software perspective von Dongarra, Jack, Gates, Mark, Luszczek, Piotr, Tomov, Stanimire

    ISSN: 1877-7503, 1877-7511
    Veröffentlicht: Netherlands Elsevier B.V 01.05.2021
    Veröffentlicht in Journal of computational science (01.05.2021)
    “… Each successive generation of computer architecture has brought new challenges to achieving high performance mathematical solvers, necessitating development and analysis of new algorithms …”
    Volltext
    Journal Article
  20. 20

    Accelerating solutions of one-dimensional unsteady PDEs with GPU-based swept time–space decomposition von Magee, Daniel J., Niemeyer, Kyle E.

    ISSN: 0021-9991, 1090-2716
    Veröffentlicht: Cambridge Elsevier Inc 15.03.2018
    Veröffentlicht in Journal of computational physics (15.03.2018)
    “… •A GPU implementation of the swept time–space decomposition rule is presented.•Three versions of the scheme are considered.•The shared-memory implementation …”
    Volltext
    Journal Article