Search Results - "shared memory algorithms"
-
1
Mutable locks: Combining the best of spin and sleep locks
ISSN: 1532-0626, 1532-0634Published: Hoboken Wiley Subscription Services, Inc 25.11.2020Published in Concurrency and computation (25.11.2020)“…Summary In this article, we present mutable locks, a synchronization construct with the same semantic of traditional locks (such as spin locks or sleep locks),…”
Get full text
Journal Article -
2
PowerRTF: Power Diagram based Restricted Tangent Face for Surface Remeshing
ISSN: 0167-7055, 1467-8659Published: Oxford Blackwell Publishing Ltd 01.08.2023Published in Computer graphics forum (01.08.2023)“…Triangular meshes of superior quality are important for geometric processing in practical applications. Existing approximative CVT‐based remeshing methodology…”
Get full text
Journal Article -
3
qblaze: An Efficient and Scalable Sparse Quantum Simulator
ISSN: 2475-1421, 2475-1421Published: New York, NY, USA ACM 09.10.2025Published in Proceedings of ACM on programming languages (09.10.2025)“…Classical simulation of quantum circuits is critical for the development of implementations of quantum algorithms: it does not require access to specialized…”
Get full text
Journal Article -
4
NBBS: A Non-Blocking Buddy System for Multi-Core Machines
ISSN: 0018-9340, 1557-9956Published: New York IEEE 01.03.2022Published in IEEE transactions on computers (01.03.2022)“…Common implementations of core memory allocation components handle concurrent allocation/release requests by synchronizing threads via spin-locks. This…”
Get full text
Journal Article -
5
A Family of Fast and Memory Efficient Lock- and Wait-Free Reclamation
ISSN: 2475-1421, 2475-1421Published: New York, NY, USA ACM 20.06.2024Published in Proceedings of ACM on programming languages (20.06.2024)“…Historically, memory management based on lock-free reference counting was very inefficient, especially for read-dominated workloads. Thus, approaches such as…”
Get full text
Journal Article -
6
Making Formulog Fast: An Argument for Unconventional Datalog Evaluation
ISSN: 2475-1421, 2475-1421Published: New York, NY, USA ACM 08.10.2024Published in Proceedings of ACM on programming languages (08.10.2024)“…With its combination of Datalog, SMT solving, and functional programming, the language Formulog provides an appealing mix of features for implementing…”
Get full text
Journal Article -
7
Dynamic Buffer Management in Massively Parallel Systems: The Power of Randomness
ISSN: 2329-4957, 2329-4957Published: 11.02.2025Published in ACM transactions on parallel computing (11.02.2025)“…Massively parallel systems, such as Graphics Processing Units (GPUs), play an increasingly crucial role in today's data-intensive computing. The unique…”
Get more information
Journal Article -
8
Max-PIM: Fast and Efficient Max/Min Searching in DRAM
Published: IEEE 05.12.2021Published in 2021 58th ACM/IEEE Design Automation Conference (DAC) (05.12.2021)“…Recently, in-DRAM computing is becoming one promising technique to address the notorious 'memory-wall' issue for big data processing. In this work, for the…”
Get full text
Conference Proceeding -
9
Concurrent size
ISSN: 2475-1421, 2475-1421Published: New York, NY, USA ACM 31.10.2022Published in Proceedings of ACM on programming languages (31.10.2022)“…The size of a data structure (i.e., the number of elements in it) is a widely used property of a data set. However, for concurrent programs, obtaining a…”
Get full text
Journal Article -
10
Mat2Stencil: A Modular Matrix-Based DSL for Explicit and Implicit Matrix-Free PDE Solvers on Structured Grid
ISSN: 2475-1421, 2475-1421Published: New York, NY, USA ACM 16.10.2023Published in Proceedings of ACM on programming languages (16.10.2023)“…Partial differential equation (PDE) solvers are extensively utilized across numerous scientific and engineering fields. However, achieving high performance and…”
Get full text
Journal Article -
11
SplitSync: Bank Group-Level Split-Synchronization for High-Performance DRAM PIM
Published: IEEE 22.06.2025Published in 2025 62nd ACM/IEEE Design Automation Conference (DAC) (22.06.2025)“…Processing in Memory (PIM) architectures enhance memory bandwidth by utilizing bank-level parallelism, typically implemented with a SIMD structure where all…”
Get full text
Conference Proceeding -
12
PIMDup: An Optimized Deduplication Design on a Real Processing-in-Memory System
Published: IEEE 22.06.2025Published in 2025 62nd ACM/IEEE Design Automation Conference (DAC) (22.06.2025)“…Data deduplication enhances storage efficiency through non-destructive compression but is often hindered by the chunking process, which requires scanning the…”
Get full text
Conference Proceeding -
13
3D Tiled Code Generation for Nussinov’s Algorithm
ISSN: 2076-3417, 2076-3417Published: Basel MDPI AG 01.06.2022Published in Applied sciences (01.06.2022)“…Current state-of-the-art parallel codes used to calculate the maximum number of pairs for a given RNA sequence by means of Nussinov’s algorithm do not allow…”
Get full text
Journal Article -
14
Migration in Hardware Transactional Memory on Asymmetric Multiprocessor
ISSN: 2169-3536, 2169-3536Published: Piscataway IEEE 2021Published in IEEE access (2021)“…In this paper, a system is presented which implements transactions migration to an asymmetric multiprocessor in order to decrease the probability of conflicts…”
Get full text
Journal Article -
15
UPVSS: Jointly Managing Vector Similarity Search with Near-Memory Processing Systems
Published: IEEE 22.06.2025Published in 2025 62nd ACM/IEEE Design Automation Conference (DAC) (22.06.2025)“…Vector similarity search plays a pivotal role in modern applications, including recommendation systems, image search, large language models (LLMs), and…”
Get full text
Conference Proceeding -
16
AttenPIM: Accelerating LLM Attention with Dual-mode GEMV in Processing-in-Memory
Published: IEEE 22.06.2025Published in 2025 62nd ACM/IEEE Design Automation Conference (DAC) (22.06.2025)“…Large Language Models (LLMs) have demonstrated unprecedented generative performance across a wide range of applications. While recent heterogeneous…”
Get full text
Conference Proceeding -
17
A Shared-Memory Algorithm for Updating Tree-Based Properties of Large Dynamic Networks
ISSN: 2332-7790, 2372-2096Published: Piscataway IEEE 01.04.2022Published in IEEE transactions on big data (01.04.2022)“…This paper presents a network-based template for analyzing large-scale dynamic data. Specifically, we propose a novel shared-memory parallel algorithm for…”
Get full text
Journal Article -
18
pSyncPIM: Partially Synchronous Execution of Sparse Matrix Operations for All-Bank PIM Architectures
Published: IEEE 29.06.2024Published in 2024 ACM/IEEE 51st Annual International Symposium on Computer Architecture (ISCA) (29.06.2024)“…Recent commercial incarnations of processing-in-memory (PIM) maintain the standard DRAM interface and employ the all-bank mode execution to maximize bank-level…”
Get full text
Conference Proceeding -
19
Chasing Away RAts: Semantics and evaluation for relaxed atomics on heterogeneous systems
Published: ACM 01.06.2017Published in 2017 ACM/IEEE 44th Annual International Symposium on Computer Architecture (ISCA) (01.06.2017)“…An unambiguous and easy-to-understand memory consistency model is crucial for ensuring correct synchronization and guiding future design of heterogeneous…”
Get full text
Conference Proceeding -
20
SDM: Sharing-Enabled Disaggregated Memory System with Cache Coherent Compute Express Link
Published: IEEE 21.10.2023Published in 2023 32nd International Conference on Parallel Architectures and Compilation Techniques (PACT) (21.10.2023)“…Disaggregated memory has been gaining significant traction as a promising solution for scaling memory capacity and better utilizing memory resources in data…”
Get full text
Conference Proceeding