Suchergebnisse - "Shared-memory algorithm"
-
1
Parallel intersection counting on shared-memory multiprocessors and GPUs
ISSN: 0167-739X, 1872-7115Veröffentlicht: Elsevier B.V 01.10.2024Veröffentlicht in Future generation computer systems (01.10.2024)“… Computing intersections among sets of one-dimensional intervals is an ubiquitous problem in computational geometry with important applications in …”
Volltext
Journal Article -
2
Mutable locks: Combining the best of spin and sleep locks
ISSN: 1532-0626, 1532-0634Veröffentlicht: Hoboken Wiley Subscription Services, Inc 25.11.2020Veröffentlicht in Concurrency and computation (25.11.2020)“… Summary In this article, we present mutable locks, a synchronization construct with the same semantic of traditional locks (such as spin locks or sleep locks), …”
Volltext
Journal Article -
3
PowerRTF: Power Diagram based Restricted Tangent Face for Surface Remeshing
ISSN: 0167-7055, 1467-8659Veröffentlicht: Oxford Blackwell Publishing Ltd 01.08.2023Veröffentlicht in Computer graphics forum (01.08.2023)“… Triangular meshes of superior quality are important for geometric processing in practical applications. Existing approximative CVT‐based remeshing methodology …”
Volltext
Journal Article -
4
qblaze: An Efficient and Scalable Sparse Quantum Simulator
ISSN: 2475-1421, 2475-1421Veröffentlicht: New York, NY, USA ACM 09.10.2025Veröffentlicht in Proceedings of ACM on programming languages (09.10.2025)“… Classical simulation of quantum circuits is critical for the development of implementations of quantum algorithms: it does not require access to specialized …”
Volltext
Journal Article -
5
NBBS: A Non-Blocking Buddy System for Multi-Core Machines
ISSN: 0018-9340, 1557-9956Veröffentlicht: New York IEEE 01.03.2022Veröffentlicht in IEEE transactions on computers (01.03.2022)“… Common implementations of core memory allocation components handle concurrent allocation/release requests by synchronizing threads via spin-locks. This …”
Volltext
Journal Article -
6
A Family of Fast and Memory Efficient Lock- and Wait-Free Reclamation
ISSN: 2475-1421, 2475-1421Veröffentlicht: New York, NY, USA ACM 20.06.2024Veröffentlicht in Proceedings of ACM on programming languages (20.06.2024)“… Historically, memory management based on lock-free reference counting was very inefficient, especially for read-dominated workloads. Thus, approaches such as …”
Volltext
Journal Article -
7
Making Formulog Fast: An Argument for Unconventional Datalog Evaluation
ISSN: 2475-1421, 2475-1421Veröffentlicht: New York, NY, USA ACM 08.10.2024Veröffentlicht in Proceedings of ACM on programming languages (08.10.2024)“… With its combination of Datalog, SMT solving, and functional programming, the language Formulog provides an appealing mix of features for implementing …”
Volltext
Journal Article -
8
Dynamic Buffer Management in Massively Parallel Systems: The Power of Randomness
ISSN: 2329-4957, 2329-4957Veröffentlicht: 11.02.2025Veröffentlicht in ACM transactions on parallel computing (11.02.2025)“… Massively parallel systems, such as Graphics Processing Units (GPUs), play an increasingly crucial role in today's data-intensive computing. The unique …”
Weitere Angaben
Journal Article -
9
Max-PIM: Fast and Efficient Max/Min Searching in DRAM
Veröffentlicht: IEEE 05.12.2021Veröffentlicht in 2021 58th ACM/IEEE Design Automation Conference (DAC) (05.12.2021)“… Recently, in-DRAM computing is becoming one promising technique to address the notorious 'memory-wall' issue for big data processing. In this work, for the …”
Volltext
Tagungsbericht -
10
Concurrent size
ISSN: 2475-1421, 2475-1421Veröffentlicht: New York, NY, USA ACM 31.10.2022Veröffentlicht in Proceedings of ACM on programming languages (31.10.2022)“… The size of a data structure (i.e., the number of elements in it) is a widely used property of a data set. However, for concurrent programs, obtaining a …”
Volltext
Journal Article -
11
Mat2Stencil: A Modular Matrix-Based DSL for Explicit and Implicit Matrix-Free PDE Solvers on Structured Grid
ISSN: 2475-1421, 2475-1421Veröffentlicht: New York, NY, USA ACM 16.10.2023Veröffentlicht in Proceedings of ACM on programming languages (16.10.2023)“… Partial differential equation (PDE) solvers are extensively utilized across numerous scientific and engineering fields. However, achieving high performance and …”
Volltext
Journal Article -
12
SplitSync: Bank Group-Level Split-Synchronization for High-Performance DRAM PIM
Veröffentlicht: IEEE 22.06.2025Veröffentlicht in 2025 62nd ACM/IEEE Design Automation Conference (DAC) (22.06.2025)“… Processing in Memory (PIM) architectures enhance memory bandwidth by utilizing bank-level parallelism, typically implemented with a SIMD structure where all …”
Volltext
Tagungsbericht -
13
PIMDup: An Optimized Deduplication Design on a Real Processing-in-Memory System
Veröffentlicht: IEEE 22.06.2025Veröffentlicht in 2025 62nd ACM/IEEE Design Automation Conference (DAC) (22.06.2025)“… Data deduplication enhances storage efficiency through non-destructive compression but is often hindered by the chunking process, which requires scanning the …”
Volltext
Tagungsbericht -
14
3D Tiled Code Generation for Nussinov’s Algorithm
ISSN: 2076-3417, 2076-3417Veröffentlicht: Basel MDPI AG 01.06.2022Veröffentlicht in Applied sciences (01.06.2022)“… Current state-of-the-art parallel codes used to calculate the maximum number of pairs for a given RNA sequence by means of Nussinov’s algorithm do not allow …”
Volltext
Journal Article -
15
Migration in Hardware Transactional Memory on Asymmetric Multiprocessor
ISSN: 2169-3536, 2169-3536Veröffentlicht: Piscataway IEEE 2021Veröffentlicht in IEEE access (2021)“… In this paper, a system is presented which implements transactions migration to an asymmetric multiprocessor in order to decrease the probability of conflicts …”
Volltext
Journal Article -
16
AttenPIM: Accelerating LLM Attention with Dual-mode GEMV in Processing-in-Memory
Veröffentlicht: IEEE 22.06.2025Veröffentlicht in 2025 62nd ACM/IEEE Design Automation Conference (DAC) (22.06.2025)“… Large Language Models (LLMs) have demonstrated unprecedented generative performance across a wide range of applications. While recent heterogeneous …”
Volltext
Tagungsbericht -
17
UPVSS: Jointly Managing Vector Similarity Search with Near-Memory Processing Systems
Veröffentlicht: IEEE 22.06.2025Veröffentlicht in 2025 62nd ACM/IEEE Design Automation Conference (DAC) (22.06.2025)“… Vector similarity search plays a pivotal role in modern applications, including recommendation systems, image search, large language models (LLMs), and …”
Volltext
Tagungsbericht -
18
A Shared-Memory Algorithm for Updating Tree-Based Properties of Large Dynamic Networks
ISSN: 2332-7790, 2372-2096Veröffentlicht: Piscataway IEEE 01.04.2022Veröffentlicht in IEEE transactions on big data (01.04.2022)“… This paper presents a network-based template for analyzing large-scale dynamic data. Specifically, we propose a novel shared-memory parallel algorithm for …”
Volltext
Journal Article -
19
pSyncPIM: Partially Synchronous Execution of Sparse Matrix Operations for All-Bank PIM Architectures
Veröffentlicht: IEEE 29.06.2024Veröffentlicht in 2024 ACM/IEEE 51st Annual International Symposium on Computer Architecture (ISCA) (29.06.2024)“… Recent commercial incarnations of processing-in-memory (PIM) maintain the standard DRAM interface and employ the all-bank mode execution to maximize bank-level …”
Volltext
Tagungsbericht -
20
Chasing Away RAts: Semantics and evaluation for relaxed atomics on heterogeneous systems
Veröffentlicht: ACM 01.06.2017Veröffentlicht in 2017 ACM/IEEE 44th Annual International Symposium on Computer Architecture (ISCA) (01.06.2017)“… An unambiguous and easy-to-understand memory consistency model is crucial for ensuring correct synchronization and guiding future design of heterogeneous …”
Volltext
Tagungsbericht