Suchergebnisse - "distributed-memory parallelism"

Andere Suchmöglichkeiten:

  1. 1

    Axially-deformed solution of the Skyrme-Hartree-Fock-Bogoliubov equations using the transformed harmonic oscillator basis (IV) hfbtho (v4.0): A new version of the program von Marević, P., Schunck, N., Ney, E.M., Navarro Pérez, R., Verriere, M., O'Neal, J.

    ISSN: 0010-4655, 1879-2944
    Veröffentlicht: United States Elsevier B.V 01.07.2022
    Veröffentlicht in Computer physics communications (01.07.2022)
    “… We describe the new version 4.0 of the code hfbtho that solves the nuclear Hartree-Fock-Bogoliubov problem by using the deformed harmonic oscillator basis in …”
    Volltext
    Journal Article
  2. 2
  3. 3

    3D DFT by block tensor-matrix multiplication via a modified Cannon's algorithm: Implementation and scaling on distributed-memory clusters with fat tree networks von Malapally, Nitin, Bolnykh, Viacheslav, Suarez, Estela, Carloni, Paolo, Lippert, Thomas, Mandelli, Davide

    ISSN: 0743-7315
    Veröffentlicht: Elsevier Inc 01.11.2024
    Veröffentlicht in Journal of parallel and distributed computing (01.11.2024)
    “… A known scalability bottleneck of the parallel 3D FFT is its use of all-to-all communications. Here, we present S3DFT, a library that circumvents this by using …”
    Volltext
    Journal Article
  4. 4

    Massively parallel implementation and approaches to simulate quantum dynamics using Krylov subspace techniques von Brenes, Marlon, Varma, Vipin Kerala, Scardicchio, Antonello, Girotto, Ivan

    ISSN: 0010-4655, 1879-2944
    Veröffentlicht: Elsevier B.V 01.02.2019
    Veröffentlicht in Computer physics communications (01.02.2019)
    “… We have developed an application and implemented parallel algorithms in order to provide a computational framework suitable for massively parallel …”
    Volltext
    Journal Article
  5. 5

    Leveraging HPC accelerator architectures with modern techniques — hydrologic modeling on GPUs with ParFlow von Hokkanen, Jaro, Kollet, Stefan, Kraus, Jiri, Herten, Andreas, Hrywniak, Markus, Pleiter, Dirk

    ISSN: 1420-0597, 1573-1499
    Veröffentlicht: Cham Springer International Publishing 01.10.2021
    Veröffentlicht in Computational geosciences (01.10.2021)
    “… Rapidly changing heterogeneous supercomputer architectures pose a great challenge to many scientific communities trying to leverage the latest technology in …”
    Volltext
    Journal Article
  6. 6

    MPI+X: task-based parallelisation and dynamic load balance of finite element assembly von Garcia-Gasulla, Marta, Houzeaux, Guillaume, Ferrer, Roger, Artigues, Antoni, López, Victor, Labarta, Jesús, Vázquez, Mariano

    ISSN: 1061-8562, 1029-0257
    Veröffentlicht: Abingdon Taylor & Francis 16.03.2019
    “… The main computing phases of numerical methods for solving partial differential equations are the algebraic system assembly and the iterative solver. This work …”
    Volltext
    Journal Article
  7. 7

    A shared compilation stack for distributed-memory parallelism in stencil DSLs von Bisbas, George, Lydike, Anton, Bauer, Emilien, Brown, Nick, Fehr, Mathieu, Mitchell, Lawrence, Rodriguez-Canal, Gabriel, Jamieson, Maurice, Kelly, Paul H J, Steuwer, Michel, Grosser, Tobias

    ISSN: 2331-8422
    Veröffentlicht: Ithaca Cornell University Library, arXiv.org 02.04.2024
    Veröffentlicht in arXiv.org (02.04.2024)
    “… Domain Specific Languages (DSLs) increase programmer productivity and provide high performance. Their targeted abstractions allow scientists to express …”
    Volltext
    Paper
  8. 8

    Parallelization of a distributed ecohydrological model von Liu, Ning, Shaikh, Mohsin Ahmed, Kala, Jatin, Harper, Richard J., Dell, Bernard, Liu, Shirong, Sun, Ge

    ISSN: 1364-8152, 1873-6726
    Veröffentlicht: Oxford Elsevier Ltd 01.03.2018
    “… WaSSI-C is an ecohydrological model which couples water and carbon cycles with water use efficiency (WUE) derived from global eddy flux observations. However, …”
    Volltext
    Journal Article
  9. 9

    Iterators, Schedulers, and Distributed-memory Parallelism von GRAEFE, GOETZ

    ISSN: 0038-0644, 1097-024X
    Veröffentlicht: New York John Wiley & Sons, Ltd 01.04.1996
    Veröffentlicht in Software, practice & experience (01.04.1996)
    “… In previous work, we demonstrated the advantages of encapsulating query evaluation algorithms as ‘iterators’ for sequential and parallel query evaluation …”
    Volltext
    Journal Article
  10. 10

    Automated MPI-X Code Generation for Scalable Finite-Difference Solvers von Bisbas, George, Nelson, Rhodri, Louboutin, Mathias, Luporini, Fabio, Kelly, Paul H.J., Gorman, Gerard

    ISSN: 1530-2075
    Veröffentlicht: IEEE 03.06.2025
    “… This paper introduces automated codegeneration techniques specifically tailored for distributed memory parallelism (DMP …”
    Volltext
    Tagungsbericht
  11. 11

    Scalable Adaptive PDE Solvers in Arbitrary Domains von Kumar, Saurabh, Ishii, Masado, Fernando, Milinda, Gao, Boshun, Tan, Kendrick, Hsu, Ming-Chen, Krishnamurthy, Adarsh, Sundar, Hari, Ganapathysubramanian, Baskar

    ISSN: 2167-4337
    Veröffentlicht: ACM 14.11.2021
    “… Efficiently and accurately simulating partial differential equations (PDEs) in and around arbitrarily defined geometries, especially with high levels of …”
    Volltext
    Tagungsbericht
  12. 12

    QuIDS: A Large-Scale Distributed Framework for Quantum Irregular Dynamics Simulations von Touzet, Joseph, Kaya, Oguz, Arrighi, Pablo, Durbec, Amelia

    ISSN: 2995-066X
    Veröffentlicht: IEEE 03.06.2025
    “… In traditional quantum computing, e.g. in the quantum circuit model, the size of the data structure describing basis elements is well known, because the …”
    Volltext
    Tagungsbericht
  13. 13

    Reservoir Echo State Network for Classification of Multivariate Time Series von Purkayastha, Basab Bijoy, Barma, Shovan

    ISSN: 2770-0135
    Veröffentlicht: IEEE 18.12.2023
    “… Multivariate time series (MTS) classification has been tackled using various methods, including Reservoir Computing (RC), which generates efficient vectorized …”
    Volltext
    Tagungsbericht
  14. 14

    CAFe: Coarray Fortran Extensions for Heterogeneous Computing von Rasmussen, Craig, Sottile, Matthew, Rasmussen, Soren, Nagle, Dan, Dumas, William

    Veröffentlicht: IEEE 01.05.2016
    “… Emerging hybrid accelerator architectures are often proposed for inclusion as components in an exascale machine, not only for performance reasons but also to …”
    Volltext
    Tagungsbericht
  15. 15

    Chapter 9 - Parallel Algorithms von Thomas Sterling, Matthew Anderson, Maciej Brodowicz

    ISBN: 9780124202153, 9780124201583, 0124202152, 012420158X
    Veröffentlicht: Elsevier Inc 2018
    Veröffentlicht in High Performance Computing (2018)
    “… Given a specific algorithm or numerical method, there are several ways to express it in parallel computation. The choice of way will almost certainly be …”
    Volltext
    Buchkapitel
  16. 16

    Algorithms for high-throughput disk-to-disk sorting von Sundar, Hari, Malhotra, Dhairya, Schulz, Karl W.

    ISBN: 9781450323789, 1450323782
    ISSN: 2167-4329
    Veröffentlicht: New York, NY, USA ACM 17.11.2013
    “… In this paper, we present a new out-of-core sort algorithm, designed for problems that are too large to fit into the aggregate RAM available on modern …”
    Volltext
    Tagungsbericht
  17. 17

    Poster: Preliminary Report for a High Precision Distributed Memory Parallel Eigenvalue Solver von Imamura, Toshiyuki, Yamada, Susumu, Machida, Masahiko

    ISBN: 1467362182, 9781467362184
    Veröffentlicht: IEEE 01.11.2012
    “… This study covers the design and implementation of a DD (double-double) extended parallel eigenvalue solver, namely QPEigenK. We extended most of underlying …”
    Volltext
    Tagungsbericht
  18. 18

    Analyzing Asynchronous Pipeline Schedules von Donaldson, Val, Ferrante, Jeanne

    ISSN: 0885-7458, 1573-7640
    Veröffentlicht: New York, NY Plenum Press 01.02.1998
    Veröffentlicht in International journal of parallel programming (01.02.1998)
    “… Asynchronous pipelining is a form of parallelism that is useful in both distributed and shared memory systems. We show that asynchronous pipeline schedules are …”
    Volltext
    Journal Article Tagungsbericht
  19. 19

    Multi-Level Domain-Decomposition Strategy for Solving the Eikonal Equation with the Fast-Sweeping Method von Shrestha, Anup, Senocak, Inanc

    ISSN: 1045-9219, 1558-2183
    Veröffentlicht: New York IEEE 01.10.2018
    “… Most parallel strategies have focused on shared-memory parallelism and do not readily extend to distributed-memory parallelism to handle large-scale problems …”
    Volltext
    Journal Article
  20. 20

    TTK is Getting MPI-Ready von Le Guillou, Eve, Will, Michael, Guillou, Pierre, Lukasczyk, Jonas, Fortin, Pierre, Garth, Christoph, Tierny, Julien

    ISSN: 1077-2626, 1941-0506, 1941-0506
    Veröffentlicht: United States IEEE 01.08.2024
    “… ) to distributed-memory parallelism with the Message Passing Interface (MPI). While several recent papers introduced topology-based approaches for distributed-memory …”
    Volltext
    Journal Article