Výsledky vyhledávání - distributed-memory parallelism

Upřesnit hledání
  1. 1

    3D DFT by block tensor-matrix multiplication via a modified Cannon's algorithm: Implementation and scaling on distributed-memory clusters with fat tree networks Autor Malapally, Nitin, Bolnykh, Viacheslav, Suarez, Estela, Carloni, Paolo, Lippert, Thomas, Mandelli, Davide

    ISSN: 0743-7315
    Vydáno: Elsevier Inc 01.11.2024
    “…A known scalability bottleneck of the parallel 3D FFT is its use of all-to-all communications. Here, we present S3DFT, a library that circumvents this by using…”
    Získat plný text
    Journal Article
  2. 2

    A framework for exploiting task and data parallelism on distributed memory multicomputers Autor Ramaswamy, S., Sapatnekar, S., Banerjee, P.

    ISSN: 1045-9219
    Vydáno: IEEE 01.11.1997
    “… compiler and run-time support for distributed memory machines. In this paper, we explore a new compiler optimization for regular scientific applications-the simultaneous exploitation of task and data parallelism…”
    Získat plný text
    Journal Article
  3. 3

    Axially-deformed solution of the Skyrme-Hartree-Fock-Bogoliubov equations using the transformed harmonic oscillator basis (IV) hfbtho (v4.0): A new version of the program Autor Marević, P., Schunck, N., Ney, E.M., Navarro Pérez, R., Verriere, M., O'Neal, J.

    ISSN: 0010-4655, 1879-2944
    Vydáno: United States Elsevier B.V 01.07.2022
    Vydáno v Computer physics communications (01.07.2022)
    “…We describe the new version 4.0 of the code hfbtho that solves the nuclear Hartree-Fock-Bogoliubov problem by using the deformed harmonic oscillator basis in…”
    Získat plný text
    Journal Article
  4. 4
  5. 5

    Iterators, Schedulers, and Distributed-memory Parallelism Autor GRAEFE, GOETZ

    ISSN: 0038-0644, 1097-024X
    Vydáno: New York John Wiley & Sons, Ltd 01.04.1996
    Vydáno v Software, practice & experience (01.04.1996)
    “…’ for sequential and parallel query evaluation. Unfortunately, those earlier models have a severe drawback with respect to resource allocation in distributedmemory systems…”
    Získat plný text
    Journal Article
  6. 6

    Massively parallel implementation and approaches to simulate quantum dynamics using Krylov subspace techniques Autor Brenes, Marlon, Varma, Vipin Kerala, Scardicchio, Antonello, Girotto, Ivan

    ISSN: 0010-4655, 1879-2944
    Vydáno: Elsevier B.V 01.02.2019
    Vydáno v Computer physics communications (01.02.2019)
    “…We have developed an application and implemented parallel algorithms in order to provide a computational framework suitable for massively parallel…”
    Získat plný text
    Journal Article
  7. 7

    Leveraging HPC accelerator architectures with modern techniques — hydrologic modeling on GPUs with ParFlow Autor Hokkanen, Jaro, Kollet, Stefan, Kraus, Jiri, Herten, Andreas, Hrywniak, Markus, Pleiter, Dirk

    ISSN: 1420-0597, 1573-1499
    Vydáno: Cham Springer International Publishing 01.10.2021
    Vydáno v Computational geosciences (01.10.2021)
    “…Rapidly changing heterogeneous supercomputer architectures pose a great challenge to many scientific communities trying to leverage the latest technology in…”
    Získat plný text
    Journal Article
  8. 8

    MPI+X: task-based parallelisation and dynamic load balance of finite element assembly Autor Garcia-Gasulla, Marta, Houzeaux, Guillaume, Ferrer, Roger, Artigues, Antoni, López, Victor, Labarta, Jesús, Vázquez, Mariano

    ISSN: 1061-8562, 1029-0257
    Vydáno: Abingdon Taylor & Francis 16.03.2019
    “… of the MPI partitions to compute element matrices and vectors and then of their assemblies. In a MPI+X hybrid parallelism context, X has consisted traditionally of loop…”
    Získat plný text
    Journal Article
  9. 9

    Parallelization of a distributed ecohydrological model Autor Liu, Ning, Shaikh, Mohsin Ahmed, Kala, Jatin, Harper, Richard J., Dell, Bernard, Liu, Shirong, Sun, Ge

    ISSN: 1364-8152, 1873-6726
    Vydáno: Oxford Elsevier Ltd 01.03.2018
    “… High resolution simulations at a large scale are therefore computationally expensive and cause a run-time memory burden. Using distributed (MPI) and shared (OpenMP…”
    Získat plný text
    Journal Article
  10. 10

    A scalable scheduling scheme for functional parallelism on distributed memory multiprocessor systems Autor Pande, S., Agrawal, D.P., Mauney, J.

    ISSN: 1045-9219
    Vydáno: Los Alamitos, CA IEEE 01.04.1995
    “… and partially at run time. Assuming infinite number of processors, the compile time schedule is found using a new concept of the threshold of a task that quantifies a trade-off between the schedule-length and the degree of parallelism…”
    Získat plný text
    Journal Article
  11. 11

    A shared compilation stack for distributed-memory parallelism in stencil DSLs Autor Bisbas, George, Lydike, Anton, Bauer, Emilien, Brown, Nick, Fehr, Mathieu, Mitchell, Lawrence, Rodriguez-Canal, Gabriel, Jamieson, Maurice, Kelly, Paul H J, Steuwer, Michel, Grosser, Tobias

    ISSN: 2331-8422
    Vydáno: Ithaca Cornell University Library, arXiv.org 02.04.2024
    Vydáno v arXiv.org (02.04.2024)
    “…Domain Specific Languages (DSLs) increase programmer productivity and provide high performance. Their targeted abstractions allow scientists to express…”
    Získat plný text
    Paper
  12. 12

    A Robust Compile Time Method for Scheduling Task Parallelism on Distributed Memory Machines Autor Darbha, Sekhar, Pande, Santosh

    ISSN: 0920-8542, 1573-0484
    Vydáno: 01.10.1998
    Vydáno v The Journal of supercomputing (01.10.1998)
    “…A compile time scheduling algorithm for a variable number of available processors is introduced and the impact of the change of computation and communication…”
    Získat plný text
    Journal Article
  13. 13

    On the Test Particle Monte-Carlo method to solve the steady state Boltzmann equation, the congruity of its results with experiments and its potential for shared memory parallelism Autor Rondeau, Maxime, Arès, R.

    ISSN: 0021-9991, 1090-2716
    Vydáno: Cambridge Elsevier Inc 01.11.2021
    Vydáno v Journal of computational physics (01.11.2021)
    “…The Test Particle Monte Carlo is a known method to solve the steady state Boltzmann particle transport equation in rarefied gas systems. A description of the…”
    Získat plný text
    Journal Article
  14. 14

    High-Performance Sorting-Based k-mer Counting in Distributed Memory with Flexible Hybrid Parallelism Autor Li, Yifan, Guidi, Giulia

    ISSN: 2331-8422
    Vydáno: Ithaca Cornell University Library, arXiv.org 10.07.2024
    Vydáno v arXiv.org (10.07.2024)
    “… Due to the growing volume of data, the scaling of the counting process is critical. In the literature, distributed memory software uses hash tables, which exhibit poor cache friendliness and consume excessive memory…”
    Získat plný text
    Paper
  15. 15

    CAPTURE: Memory-Centric Partitioning for Distributed DNN Training with Hybrid Parallelism Autor Dreuning, Henk, Verstoep, Kees, Bal, Henri E., van Nieuwpoort, Rob V.

    ISSN: 2640-0316
    Vydáno: IEEE 18.12.2023
    “… Hybrid-parallel training approaches have emerged that combine pipelining with data and tensor parallelism to facilitate the training of large DL models on distributed hardware setups…”
    Získat plný text
    Konferenční příspěvek
  16. 16

    A study of shared-memory parallelism in a multifrontal solver Autor L’Excellent, Jean-Yves, Sid-Lakhdar, Wissam M.

    ISSN: 0167-8191, 1872-7336
    Vydáno: Elsevier B.V 01.03.2014
    Vydáno v Parallel computing (01.03.2014)
    “… We introduce shared-memory parallelism in a parallel distributed-memory solver, targeting multi-core architectures…”
    Získat plný text
    Journal Article
  17. 17

    A robust compile time method for scheduling task parallelism on distributed memory machines Autor Darbha, S., Pande, S.

    ISBN: 9780818676338, 0818676337
    ISSN: 1089-795X
    Vydáno: IEEE 1996
    “…A desirable property of a compile time scheduling algorithm is robustness against the variations in the computation and communication costs so that the run…”
    Získat plný text
    Konferenční příspěvek
  18. 18

    Reservoir Echo State Network for Classification of Multivariate Time Series Autor Purkayastha, Basab Bijoy, Barma, Shovan

    ISSN: 2770-0135
    Vydáno: IEEE 18.12.2023
    “… It leverages both CPU-shared memory and parallel distributed memory architecture to efficiently capture reservoir state's optimal model space representation, addressing computational challenges in MTS analysis…”
    Získat plný text
    Konferenční příspěvek
  19. 19

    Automated MPI-X Code Generation for Scalable Finite-Difference Solvers Autor Bisbas, George, Nelson, Rhodri, Louboutin, Mathias, Luporini, Fabio, Kelly, Paul H.J., Gorman, Gerard

    ISSN: 1530-2075
    Vydáno: IEEE 03.06.2025
    “… This paper introduces automated codegeneration techniques specifically tailored for distributed memory parallelism (DMP…”
    Získat plný text
    Konferenční příspěvek
  20. 20

    Scalable Adaptive PDE Solvers in Arbitrary Domains Autor Kumar, Saurabh, Ishii, Masado, Fernando, Milinda, Gao, Boshun, Tan, Kendrick, Hsu, Ming-Chen, Krishnamurthy, Adarsh, Sundar, Hari, Ganapathysubramanian, Baskar

    ISSN: 2167-4337
    Vydáno: ACM 14.11.2021
    “…Efficiently and accurately simulating partial differential equations (PDEs) in and around arbitrarily defined geometries, especially with high levels of…”
    Získat plný text
    Konferenční příspěvek