Suchergebnisse - "Shared-memory algorithm"

  1. 1

    Parallel intersection counting on shared-memory multiprocessors and GPUs von Marzolla, Moreno, Birolo, Giovanni, D’Angelo, Gabriele, Fariselli, Piero

    ISSN: 0167-739X, 1872-7115
    Veröffentlicht: Elsevier B.V 01.10.2024
    Veröffentlicht in Future generation computer systems (01.10.2024)
    “… Computing intersections among sets of one-dimensional intervals is an ubiquitous problem in computational geometry with important applications in …”
    Volltext
    Journal Article
  2. 2

    Mutable locks: Combining the best of spin and sleep locks von Marotta, Romolo, Tiriticco, Davide, Di Sanzo, Pierangelo, Pellegrini, Alessandro, Ciciani, Bruno, Quaglia, Francesco

    ISSN: 1532-0626, 1532-0634
    Veröffentlicht: Hoboken Wiley Subscription Services, Inc 25.11.2020
    Veröffentlicht in Concurrency and computation (25.11.2020)
    “… Summary In this article, we present mutable locks, a synchronization construct with the same semantic of traditional locks (such as spin locks or sleep locks), …”
    Volltext
    Journal Article
  3. 3

    PowerRTF: Power Diagram based Restricted Tangent Face for Surface Remeshing von Yao, Yuyou, Liu, Jingjing, Fei, Yue, Wu, Wenming, Zhang, Gaofeng, Yan, Dong‐Ming, Zheng, Liping

    ISSN: 0167-7055, 1467-8659
    Veröffentlicht: Oxford Blackwell Publishing Ltd 01.08.2023
    Veröffentlicht in Computer graphics forum (01.08.2023)
    “… Triangular meshes of superior quality are important for geometric processing in practical applications. Existing approximative CVT‐based remeshing methodology …”
    Volltext
    Journal Article
  4. 4

    qblaze: An Efficient and Scalable Sparse Quantum Simulator von Venev, Hristo, Udomsrirungruang, Thien, Dimitrov, Dimitar, Gehr, Timon, Vechev, Martin

    ISSN: 2475-1421, 2475-1421
    Veröffentlicht: New York, NY, USA ACM 09.10.2025
    Veröffentlicht in Proceedings of ACM on programming languages (09.10.2025)
    “… Classical simulation of quantum circuits is critical for the development of implementations of quantum algorithms: it does not require access to specialized …”
    Volltext
    Journal Article
  5. 5

    NBBS: A Non-Blocking Buddy System for Multi-Core Machines von Marotta, Romolo, Ianni, Mauro, Pellegrini, Alessandro, Quaglia, Francesco

    ISSN: 0018-9340, 1557-9956
    Veröffentlicht: New York IEEE 01.03.2022
    Veröffentlicht in IEEE transactions on computers (01.03.2022)
    “… Common implementations of core memory allocation components handle concurrent allocation/release requests by synchronizing threads via spin-locks. This …”
    Volltext
    Journal Article
  6. 6

    A Family of Fast and Memory Efficient Lock- and Wait-Free Reclamation von Nikolaev, Ruslan, Ravindran, Binoy

    ISSN: 2475-1421, 2475-1421
    Veröffentlicht: New York, NY, USA ACM 20.06.2024
    Veröffentlicht in Proceedings of ACM on programming languages (20.06.2024)
    “… Historically, memory management based on lock-free reference counting was very inefficient, especially for read-dominated workloads. Thus, approaches such as …”
    Volltext
    Journal Article
  7. 7

    Making Formulog Fast: An Argument for Unconventional Datalog Evaluation von Bembenek, Aaron, Greenberg, Michael, Chong, Stephen

    ISSN: 2475-1421, 2475-1421
    Veröffentlicht: New York, NY, USA ACM 08.10.2024
    Veröffentlicht in Proceedings of ACM on programming languages (08.10.2024)
    “… With its combination of Datalog, SMT solving, and functional programming, the language Formulog provides an appealing mix of features for implementing …”
    Volltext
    Journal Article
  8. 8

    Dynamic Buffer Management in Massively Parallel Systems: The Power of Randomness von Pham, Minh, Yuan, Yongke, Li, Hao, Mou, Chengcheng, Tu, Yicheng, Xu, Zichen, Meng, Jinghan

    ISSN: 2329-4957, 2329-4957
    Veröffentlicht: 11.02.2025
    Veröffentlicht in ACM transactions on parallel computing (11.02.2025)
    “… Massively parallel systems, such as Graphics Processing Units (GPUs), play an increasingly crucial role in today's data-intensive computing. The unique …”
    Weitere Angaben
    Journal Article
  9. 9

    Max-PIM: Fast and Efficient Max/Min Searching in DRAM von Zhang, Fan, Angizi, Shaahin, Fan, Deliang

    Veröffentlicht: IEEE 05.12.2021
    “… Recently, in-DRAM computing is becoming one promising technique to address the notorious 'memory-wall' issue for big data processing. In this work, for the …”
    Volltext
    Tagungsbericht
  10. 10

    Concurrent size von Sela, Gal, Petrank, Erez

    ISSN: 2475-1421, 2475-1421
    Veröffentlicht: New York, NY, USA ACM 31.10.2022
    Veröffentlicht in Proceedings of ACM on programming languages (31.10.2022)
    “… The size of a data structure (i.e., the number of elements in it) is a widely used property of a data set. However, for concurrent programs, obtaining a …”
    Volltext
    Journal Article
  11. 11

    Mat2Stencil: A Modular Matrix-Based DSL for Explicit and Implicit Matrix-Free PDE Solvers on Structured Grid von Cao, Huanqi, Tang, Shizhi, Zhu, Qianchao, Yu, Bowen, Chen, Wenguang

    ISSN: 2475-1421, 2475-1421
    Veröffentlicht: New York, NY, USA ACM 16.10.2023
    Veröffentlicht in Proceedings of ACM on programming languages (16.10.2023)
    “… Partial differential equation (PDE) solvers are extensively utilized across numerous scientific and engineering fields. However, achieving high performance and …”
    Volltext
    Journal Article
  12. 12

    SplitSync: Bank Group-Level Split-Synchronization for High-Performance DRAM PIM von Yoon, Byungkuk, Han, Sanghyeok, Park, Gyeonghwan, Kim, Jae-Joon

    Veröffentlicht: IEEE 22.06.2025
    “… Processing in Memory (PIM) architectures enhance memory bandwidth by utilizing bank-level parallelism, typically implemented with a SIMD structure where all …”
    Volltext
    Tagungsbericht
  13. 13

    PIMDup: An Optimized Deduplication Design on a Real Processing-in-Memory System von Yeh, Chun-Le, Chen, Liang-Chi, Ho, Chien-Chung, Chang, Yu-Ming, Chang, Da-Wei

    Veröffentlicht: IEEE 22.06.2025
    “… Data deduplication enhances storage efficiency through non-destructive compression but is often hindered by the chunking process, which requires scanning the …”
    Volltext
    Tagungsbericht
  14. 14

    3D Tiled Code Generation for Nussinov’s Algorithm von Bielecki, Włodzimierz, Błaszyński, Piotr, Pałkowski, Marek

    ISSN: 2076-3417, 2076-3417
    Veröffentlicht: Basel MDPI AG 01.06.2022
    Veröffentlicht in Applied sciences (01.06.2022)
    “… Current state-of-the-art parallel codes used to calculate the maximum number of pairs for a given RNA sequence by means of Nussinov’s algorithm do not allow …”
    Volltext
    Journal Article
  15. 15

    Migration in Hardware Transactional Memory on Asymmetric Multiprocessor von Sustran, Zivojin, Protic, Jelica

    ISSN: 2169-3536, 2169-3536
    Veröffentlicht: Piscataway IEEE 2021
    Veröffentlicht in IEEE access (2021)
    “… In this paper, a system is presented which implements transactions migration to an asymmetric multiprocessor in order to decrease the probability of conflicts …”
    Volltext
    Journal Article
  16. 16

    AttenPIM: Accelerating LLM Attention with Dual-mode GEMV in Processing-in-Memory von Chen, Liyan, Lyu, Dongxu, Li, Zhenyu, Jiang, Jianfei, Wang, Qin, Mao, Zhigang, Jing, Naifeng

    Veröffentlicht: IEEE 22.06.2025
    “… Large Language Models (LLMs) have demonstrated unprecedented generative performance across a wide range of applications. While recent heterogeneous …”
    Volltext
    Tagungsbericht
  17. 17

    UPVSS: Jointly Managing Vector Similarity Search with Near-Memory Processing Systems von Liu, Chun-Chien, Wu, Chun-Feng, Jin, Yunho

    Veröffentlicht: IEEE 22.06.2025
    “… Vector similarity search plays a pivotal role in modern applications, including recommendation systems, image search, large language models (LLMs), and …”
    Volltext
    Tagungsbericht
  18. 18

    A Shared-Memory Algorithm for Updating Tree-Based Properties of Large Dynamic Networks von Srinivasan, Sriram, Pollard, Samuel D., Norris, Boyana, Das, Sajal K., Bhowmick, Sanjukta

    ISSN: 2332-7790, 2372-2096
    Veröffentlicht: Piscataway IEEE 01.04.2022
    Veröffentlicht in IEEE transactions on big data (01.04.2022)
    “… This paper presents a network-based template for analyzing large-scale dynamic data. Specifically, we propose a novel shared-memory parallel algorithm for …”
    Volltext
    Journal Article
  19. 19

    pSyncPIM: Partially Synchronous Execution of Sparse Matrix Operations for All-Bank PIM Architectures von Baek, Daehyeon, Hwang, Soojin, Huh, Jaehyuk

    Veröffentlicht: IEEE 29.06.2024
    “… Recent commercial incarnations of processing-in-memory (PIM) maintain the standard DRAM interface and employ the all-bank mode execution to maximize bank-level …”
    Volltext
    Tagungsbericht
  20. 20

    Chasing Away RAts: Semantics and evaluation for relaxed atomics on heterogeneous systems von Sinclair, Matthew D., Alsop, Johnathan, Adve, Sarita V.

    Veröffentlicht: ACM 01.06.2017
    “… An unambiguous and easy-to-understand memory consistency model is crucial for ensuring correct synchronization and guiding future design of heterogeneous …”
    Volltext
    Tagungsbericht