Výsledky vyhledávání - Shared memory programming on NUMA

  1. 1

    ARS: an adaptive runtime system for locality optimization Autor Tao, Jie, Schulz, Martin, Karl, Wolfgang

    ISSN: 0167-739X, 1872-7115
    Vydáno: Elsevier B.V 01.07.2003
    Vydáno v Future generation computer systems (01.07.2003)
    “…Shared memory programs running on Non-Uniform Memory Access (NUMA) machines usually face inherent performance problems stemming from excessive remote memory accesses…”
    Získat plný text
    Journal Article
  2. 2

    Shared memory NUMA programming on I-WAY Autor Nieplocha, J., Harrison, R.J.

    ISBN: 0818675829, 9780818675829
    ISSN: 1082-8907
    Vydáno: IEEE 1996
    “…The performance of the Global Array shared-memory non-uniform memory-access programming model is explored on the I-WAY, wide-area network distributed supercomputer environment…”
    Získat plný text
    Konferenční příspěvek
  3. 3

    Programming parallel dense matrix factorizations and inversion for new-generation NUMA architectures Autor Catalán, Sandra, Igual, Francisco D., Herrero, José R., Rodríguez-Sánchez, Rafael, Quintana-Ortí, Enrique S.

    ISSN: 0743-7315, 1096-0848
    Vydáno: Elsevier Inc 01.05.2023
    “…We propose a methodology to address the programmability issues derived from the emergence of new-generation shared-memory NUMA architectures…”
    Získat plný text
    Journal Article
  4. 4

    Memory Access Behavior Analysis of NUMA‐Based Shared Memory Programs Autor Tao, Jie, Karl, Wolfgang, Schulz, Martin

    ISSN: 1058-9244, 1875-919X
    Vydáno: 01.01.2002
    Vydáno v Scientific programming (01.01.2002)
    “…Shared memory applications running transparently on top of NUMA architectures often face severe performance problems due to bad data locality and excessive remote memory accesses…”
    Získat plný text
    Journal Article
  5. 5

    Scalable task parallelism for NUMA: A uniform abstraction for coordinated scheduling and memory management Autor Drebes, Andi, Pop, Antoniu, Heydemann, Karine, Cohen, Albert, Drach, Nathalie

    Vydáno: ACM 01.09.2016
    “…Dynamic task-parallel programming models are popular on shared-memory systems, promising enhanced scalability, load balancing and locality…”
    Získat plný text
    Konferenční příspěvek
  6. 6

    Unfair Scheduling Patterns in NUMA Architectures Autor Ben-David, Naama, Scully, Ziv, Blelloch, Guy E.

    ISSN: 2641-7936
    Vydáno: IEEE 01.09.2019
    “… This begs the question: what concurrent scheduling models are realistic? This issue is complicated by the intricacies of modern hardware, such as cache coherence protocols and non-uniform memory access (NUMA…”
    Získat plný text
    Konferenční příspěvek
  7. 7

    Mitigating the NUMA effect on task-based runtime systems Autor Maroñas, Marcos, Navarro, Antoni, Ayguadé, Eduard, Beltran, Vicenç

    ISSN: 0920-8542, 1573-0484
    Vydáno: New York Springer US 01.09.2023
    Vydáno v The Journal of supercomputing (01.09.2023)
    “… However, due to hardware restrictions, they adopt a NUMA approach, where each processor accesses local memory faster than remote memories…”
    Získat plný text
    Journal Article
  8. 8

    NUMASFP: NUMA-Aware Dynamic Service Function Chain Placement in Multi-Core Servers Autor Chintapalli, Venkatarami Reddy, Korrapati, Sai Balaram, Tamma, Bheemarjuna Reddy, A, Antony Franklin

    ISSN: 2155-2509
    Vydáno: IEEE 04.01.2022
    “… However, sophisticated servers follow non-uniform memory access (NUMA) architecture in which CPU cores are distributed across different NUMA nodes to enhance scalability…”
    Získat plný text
    Konferenční příspěvek
  9. 9

    Performance prediction and evaluation of parallel processing on a NUMA multiprocessor Autor Zhang, X., Qin, X.

    ISSN: 0098-5589, 1939-3520
    Vydáno: New York, NY IEEE 01.10.1991
    “…, where network contention and memory contention are considered. Performance measurements to support the models and analyses through several numerical examples have been done on the BBN GP1000, a NUMA shared-memory multiprocessor…”
    Získat plný text
    Journal Article
  10. 10

    PNS Lock: A Portable NUMA-Aware Lock with a Standard Interface Autor Gandham, Brahmaiah, Alapati, Praveen

    Vydáno: IEEE 21.09.2024
    “…In shared memory programming, there is a need for synchronization primitives that are suitable for modern hardware to achieve high performance and reduce contention…”
    Získat plný text
    Konferenční příspěvek
  11. 11

    Empirical Installation of Linear Algebra Shared-Memory Subroutines for Auto-Tuning Autor Cámara, Jesús, Cuenca, Javier, Giménez, Domingo, García, Luis Pedro, Vidal, Antonio M.

    ISSN: 0885-7458, 1573-7640
    Vydáno: Boston Springer US 01.06.2014
    “… Medium NUMA and large cc-NUMA systems are used in the experiments. This variety of routines, libraries and systems allows us to obtain general conclusions about the methodology to use for linear algebra shared-memory routines auto-tuning…”
    Získat plný text
    Journal Article
  12. 12

    Some useful strategies for unstructured edge-based solvers on shared memory machines Autor Aubry, R., Houzeaux, G., Vázquez, M., Cela, J. M.

    ISSN: 0029-5981, 1097-0207, 1097-0207
    Vydáno: Chichester, UK John Wiley & Sons, Ltd 04.02.2011
    “…Three strategies for shared memory parallel edge‐based solvers are proposed which guarantee that nodes belonging to one thread are not accessed by other threads for vertex…”
    Získat plný text
    Journal Article
  13. 13

    NestedMP: Enabling cache-aware thread mapping for nested parallel shared memory applications Autor He, Jiangzhou, Chen, Wenguang, Tang, Zhizhong

    ISSN: 0167-8191, 1872-7336
    Vydáno: Elsevier B.V 01.01.2016
    Vydáno v Parallel computing (01.01.2016)
    “…•Task-core mapping schemas for nested-parallel applications may affect performance.•NestedMP allows programmers to declare number of threads for parallel…”
    Získat plný text
    Journal Article
  14. 14

    Analyzing the execution of sparse matrix-vector product on the Finisterrae SMP-NUMA system Autor Pichel, Juan C., Lorenzo, Juan A., Heras, Dora B., Cabaleiro, Jose C., Pena, Tomás F.

    ISSN: 0920-8542, 1573-0484
    Vydáno: Boston Springer US 01.11.2011
    Vydáno v The Journal of supercomputing (01.11.2011)
    “…In this paper, the sparse matrix-vector product (SpMV) is evaluated on the FinisTerrae SMP-NUMA supercomputer…”
    Získat plný text
    Journal Article Konferenční příspěvek
  15. 15

    An OpenMP Compiler for Efficient Use of Distributed Scratchpad Memory in MPSoCs Autor Marongiu, A., Benini, L.

    ISSN: 0018-9340, 1557-9956
    Vydáno: New York IEEE 01.02.2012
    Vydáno v IEEE transactions on computers (01.02.2012)
    “… To efficiently exploit the advantages of low-latency high-bandwidth memory modules in the hierarchy, there is the need for programming models and/or language features that expose such architectural details…”
    Získat plný text
    Journal Article
  16. 16

    Online scalability characterization of data-parallel programs on many cores Autor Younghyun Cho, Oh, Surim, Egger, Bernhard

    Vydáno: ACM 01.09.2016
    “… Reflecting the architecture of NUMA systems, contention is modeled at the last-level caches of the compute nodes and the memory nodes using a two-level queuing model to estimate the mean service time…”
    Získat plný text
    Konferenční příspěvek
  17. 17

    An evaluation of MPI and OpenMP paradigms in finite‐difference explicit methods for PDEs on sharedmemory multi‐ and manycore systems Autor Cabral, Frederico L., Gonzaga de Oliveira, Sanderson L., Osthoff, Carla, Costa, Gabriel P., Brandão, Diego N., Kischinhevsky, Mauricio

    ISSN: 1532-0626, 1532-0634
    Vydáno: Hoboken Wiley Subscription Services, Inc 25.10.2020
    Vydáno v Concurrency and computation (25.10.2020)
    “…® Scalable Processor and the coprocessor Knights Landing. In this study, the performance of a hybrid parallel programming with message passing interface (MPI) and Open Multi‐Processing (OpenMP…”
    Získat plný text
    Journal Article
  18. 18

    Zippy: A Framework for Computation and Visualization on a GPU Cluster Autor Fan, Zhe, Qiu, Feng, Kaufman, Arie E.

    ISSN: 0167-7055, 1467-8659
    Vydáno: Oxford, UK Blackwell Publishing Ltd 01.04.2008
    Vydáno v Computer graphics forum (01.04.2008)
    “…‐level parallelism hierarchy and a non‐uniform memory access (NUMA) model. Zippy preserves the advantages of both message passing and sharedmemory models…”
    Získat plný text
    Journal Article
  19. 19

    REPLICA MBTAC: multithreaded dual-mode processor Autor Forsell, Martti, Roivainen, Jussi, Leppänen, Ville

    ISSN: 0920-8542, 1573-0484
    Vydáno: New York Springer US 01.05.2018
    Vydáno v The Journal of supercomputing (01.05.2018)
    “… These include support for cost-efficient machine instruction-level synchronization and uniform shared global memory for enabling easy-to-program memory allocation of data structures and data movement…”
    Získat plný text
    Journal Article
  20. 20

    Decentralized lock-free distributed queue in MPI remote memory access model Autor Paznikov, Alexey A., Burachenko, Alexander V., Abuelsoud, Mohamed M.

    ISSN: 2267-1242, 2555-0403, 2267-1242
    Vydáno: Les Ulis EDP Sciences 01.01.2024
    Vydáno v E3S web of conferences (01.01.2024)
    “… (concurrent, distributed) data structures. In shared-memory machines (such as SMP/NUMA systems…”
    Získat plný text
    Journal Article Konferenční příspěvek