Výsledky vyhledávání - "Communication-avoiding algorithms"

  1. 1

    A massively parallel tensor contraction framework for coupled-cluster computations Autor Solomonik, Edgar, Matthews, Devin, Hammond, Jeff R., Stanton, John F., Demmel, James

    ISSN: 0743-7315, 1096-0848
    Vydáno: Elsevier Inc 01.12.2014
    “…Precise calculation of molecular electronic wavefunctions by methods such as coupled-cluster requires the computation of tensor contractions, the cost of which…”
    Získat plný text
    Journal Article
  2. 2

    Multiscale high-order/low-order (HOLO) algorithms and applications Autor Chacón, L., Chen, G., Knoll, D.A., Newman, C., Park, H., Taitano, W., Willert, J.A., Womeldorff, G.

    ISSN: 0021-9991, 1090-2716
    Vydáno: Cambridge Elsevier Inc 01.02.2017
    Vydáno v Journal of computational physics (01.02.2017)
    “…We review the state of the art in the formulation, implementation, and performance of so-called high-order/low-order (HOLO) algorithms for challenging…”
    Získat plný text
    Journal Article
  3. 3

    Communication Avoiding Block Low-Rank Parallel Multifrontal Triangular Solve with Many Right-Hand Sides Autor Amestoy, Patrick, Boiteau, Olivier, Buttari, Alfredo, Gerest, Matthieu, Jézéquel, Fabienne, L’Excellent, Jean-Yves, Mary, Theo

    ISSN: 0895-4798, 1095-7162
    Vydáno: Society for Industrial and Applied Mathematics 01.01.2024
    “…Block low-rank (BLR) compression can significantly reduce the memory and time costs of parallel sparse direct solvers. In this paper, we investigate the…”
    Získat plný text
    Journal Article
  4. 4

    Parallel Fast Multipole Method accelerated FFT on HPC clusters Autor Mehta, Chahak, Karthi, Amarnath, Jetly, Vishrut, Chaudhury, Bhaskar

    ISSN: 0167-8191, 1872-7336
    Vydáno: Elsevier B.V 01.07.2021
    Vydáno v Parallel computing (01.07.2021)
    “…With increasing sizes of distributed systems, there comes an increased risk of communication bottlenecks. In the past decade there has been a growing interest…”
    Získat plný text
    Journal Article
  5. 5

    Multiscale high-order/low-order (HOLO) algorithms and applications Autor Chacon, Luis, Chen, Guangye, Knoll, Dana Alan, Newman, Christopher Kyle, Park, HyeongKae, Taitano, William, Willert, Jeff A., Womeldorff, Geoffrey Alan

    ISSN: 0021-9991, 1090-2716
    Vydáno: United States Elsevier 11.11.2016
    Vydáno v Journal of computational physics (11.11.2016)
    “…Here, we review the state of the art in the formulation, implementation, and performance of so-called high-order/low-order (HOLO) algorithms for challenging…”
    Získat plný text
    Journal Article
  6. 6

    Communication-Avoiding Recursive Aggregation Autor Sun, Yihao, Kumar, Sidharth, Gilray, Thomas, Micinski, Kristopher

    ISSN: 2168-9253
    Vydáno: IEEE 31.10.2023
    “…Recursive aggregation has been of considerable interest due to its unifying a wide range of deductive-analytic workloads, including social-media mining and…”
    Získat plný text
    Konferenční příspěvek
  7. 7

    Translational process: Mathematical software perspective Autor Dongarra, Jack, Gates, Mark, Luszczek, Piotr, Tomov, Stanimire

    ISSN: 1877-7503, 1877-7511
    Vydáno: Netherlands Elsevier B.V 01.05.2021
    Vydáno v Journal of computational science (01.05.2021)
    “…Each successive generation of computer architecture has brought new challenges to achieving high performance mathematical solvers, necessitating development…”
    Získat plný text
    Journal Article
  8. 8

    Accelerating solutions of one-dimensional unsteady PDEs with GPU-based swept time–space decomposition Autor Magee, Daniel J., Niemeyer, Kyle E.

    ISSN: 0021-9991, 1090-2716
    Vydáno: Cambridge Elsevier Inc 15.03.2018
    Vydáno v Journal of computational physics (15.03.2018)
    “…•A GPU implementation of the swept time–space decomposition rule is presented.•Three versions of the scheme are considered.•The shared-memory implementation…”
    Získat plný text
    Journal Article
  9. 9

    Reconstructing Householder vectors from Tall-Skinny QR Autor Ballard, G., Demmel, J., Grigori, L., Jacquelin, M., Knight, N., Nguyen, H.D.

    ISSN: 0743-7315, 1096-0848
    Vydáno: United States Elsevier Inc 01.11.2015
    “…The Tall-Skinny QR (TSQR) algorithm is more communication efficient than the standard Householder algorithm for QR decomposition of matrices with many more…”
    Získat plný text
    Journal Article
  10. 10

    Applying the swept rule for solving explicit partial differential equations on heterogeneous computing systems Autor Magee, Daniel J., Walker, Anthony S., Niemeyer, Kyle E.

    ISSN: 0920-8542, 1573-0484
    Vydáno: New York Springer US 01.02.2021
    Vydáno v The Journal of supercomputing (01.02.2021)
    “…Applications that exploit the architectural details of high-performance computing (HPC) systems have become increasingly invaluable in academia and industry…”
    Získat plný text
    Journal Article
  11. 11

    Reducing Communication in Graph Neural Network Training Autor Tripathy, Alok, Yelick, Katherine, Buluc, Aydin

    ISSN: 2167-4329
    Vydáno: United States IEEE 01.11.2020
    “…Graph Neural Networks (GNNs) are powerful and flexible neural networks that use the naturally sparse connectivity information of the data. GNNs represent this…”
    Získat plný text
    Konferenční příspěvek Journal Article
  12. 12

    Communication-Avoiding Symmetric-Indefinite Factorization Autor Ballard, Grey, Becker, Dulceneia, Demmel, James, Dongarra, Jack, Druinsky, Alex, Peled, Inon, Schwartz, Oded, Toledo, Sivan, Yamazaki, Ichitaro

    ISSN: 0895-4798, 1095-7162
    Vydáno: United States SIAM 01.01.2014
    “…We describe and analyze a novel symmetric triangular factorization algorithm. The algorithm is essentially a block version of Aasen's triangular…”
    Získat plný text
    Journal Article
  13. 13

    A Class of Communication-avoiding Algorithms for Solving General Dense Linear Systems on CPU/GPU Parallel Machines Autor Baboulin, Marc, Donfack, Simplice, Dongarra, Jack, Grigori, Laura, Rémy, Adrien, Tomov, Stanimire

    ISSN: 1877-0509, 1877-0509
    Vydáno: Elsevier B.V 2012
    Vydáno v Procedia computer science (2012)
    “…We study several solvers for the solution of general linear systems where the main objective is to reduce the communication overhead due to pivoting. We first…”
    Získat plný text
    Journal Article
  14. 14

    Cyclops Tensor Framework: Reducing Communication and Eliminating Load Imbalance in Massively Parallel Contractions Autor Solomonik, Edgar, Matthews, Devin, Hammond, Jeff R., Demmel, James

    ISBN: 146736066X, 9781467360661
    ISSN: 1530-2075
    Vydáno: IEEE 01.05.2013
    “…Cyclops (cyclic-operations) Tensor Framework (CTF) 1 is a distributed library for tensor contractions. CTF aims to scale high-dimensional tensor contractions…”
    Získat plný text
    Konferenční příspěvek
  15. 15

    Distributed-Memory Sparse Kernels for Machine Learning Autor Bharadwaj, Vivek, Buluc, Aydin, Demmel, James

    ISSN: 1530-2075
    Vydáno: IEEE 01.05.2022
    “…Sampled Dense Times Dense Matrix Multiplication (SDDMM) and Sparse Times Dense Matrix Multiplication (SpMM) appear in diverse settings, such as collaborative…”
    Získat plný text
    Konferenční příspěvek
  16. 16

    Communication-Avoiding Parallel Sparse-Dense Matrix-Matrix Multiplication Autor Koanantakool, Penporn, Azad, Ariful, Buluc, Aydin, Morozov, Dmitriy, Sang-Yun Oh, Oliker, Leonid, Yelick, Katherine

    ISSN: 1530-2075
    Vydáno: IEEE 01.05.2016
    “…Multiplication of a sparse matrix with a dense matrix is a building block of an increasing number of applications in many areas such as machine learning and…”
    Získat plný text
    Konferenční příspěvek
  17. 17

    Event-Triggered Communication in Parallel Computing Autor Ghosh, Soumyadip, Saha, Kamal K., Gupta, Vijay, Tryggvason, Gretar

    Vydáno: IEEE 01.11.2018
    “…Communication overhead in parallel systems can be a significant bottleneck in scaling up parallel computation. In this paper, we propose event-triggered…”
    Získat plný text
    Konferenční příspěvek
  18. 18

    Write-Avoiding Algorithms Autor Carson, Erin, Demmel, James, Grigori, Laura, Knight, Nicholas, Koanantakool, Penporn, Schwartz, Oded, Simhadri, Harsha Vardhan

    ISSN: 1530-2075
    Vydáno: IEEE 01.05.2016
    “…Communication, i.e., moving data between levels of a memory hierarchy or between processors over a network, is much more expensive (in time or energy) than…”
    Získat plný text
    Konferenční příspěvek
  19. 19

    Minimizing Communication in All-Pairs Shortest Paths Autor Solomonik, Edgar, Buluç, Aydın, Demmel, James

    ISBN: 146736066X, 9781467360661
    ISSN: 1530-2075
    Vydáno: IEEE 01.05.2013
    “…We consider distributed memory algorithms for the all-pairs shortest paths (APSP) problem. Scaling the APSP problem to high concurrencies requires both…”
    Získat plný text
    Konferenční příspěvek
  20. 20

    Recent Developments in Iterative Methods for Reducing Synchronization Autor Zou, Qinmeng, Magoules, Frederic

    ISSN: 2473-3636
    Vydáno: IEEE 01.11.2019
    “…On modern parallel architectures, the cost of synchronization among processors can often dominate the cost of floating-point computation. Several modifications…”
    Získat plný text
    Konferenční příspěvek