Search Results - "Parallel Programming Languages"
-
1
Special issue on advances in techniques for assessment performance portability of HPC applications
ISSN: 0167-739XPublished: Elsevier B.V 01.10.2025Published in Future generation computer systems (01.10.2025)“…This special issue aims to present new developments and advances in techniques for assessment performance portability of high performance computing…”
Get full text
Journal Article -
2
Splitwise: Efficient Generative LLM Inference Using Phase Splitting
Published: IEEE 29.06.2024Published in 2024 ACM/IEEE 51st Annual International Symposium on Computer Architecture (ISCA) (29.06.2024)“…Generative large language model (LLM) applications are growing rapidly, leading to large-scale deployments of expensive and power-hungry GPUs. Our…”
Get full text
Conference Proceeding -
3
Polygeist: Raising C to Polyhedral MLIR
Published: IEEE 01.09.2021Published in 2021 30th International Conference on Parallel Architectures and Compilation Techniques (PACT) (01.09.2021)“…We present Polygeist, a new compilation flow that connects the MLIR compiler infrastructure to cutting edge polyhedral optimization tools. It consists of a C…”
Get full text
Conference Proceeding -
4
Qilin: exploiting parallelism on heterogeneous multiprocessors with adaptive mapping
ISBN: 9781605587981, 1605587982ISSN: 1072-4451Published: New York, NY, USA ACM 12.12.2009Published in 2009 42nd Annual IEEE/ACM International Symposium on Microarchitecture (MICRO) (12.12.2009)“…Heterogeneous multiprocessors are increasingly important in the multi-core era due to their potential for high performance and energy efficiency. In order for…”
Get full text
Conference Proceeding -
5
SimplePIM: A Software Framework for Productive and Efficient Processing-in-Memory
Published: IEEE 21.10.2023Published in 2023 32nd International Conference on Parallel Architectures and Compilation Techniques (PACT) (21.10.2023)“…Data movement between memory and processors is a major bottleneck in modern computing systems. The processing-in-memory (PIM) paradigm aims to alleviate this…”
Get full text
Conference Proceeding -
6
Opportunistically Parallel Lambda Calculus
ISSN: 2475-1421, 2475-1421Published: New York, NY, USA ACM 09.10.2025Published in Proceedings of ACM on programming languages (09.10.2025)“…Scripting languages are widely used to compose external calls such as native libraries and network services. In such scripts, execution time is often dominated…”
Get full text
Journal Article -
7
Dynamically Fusing Python HPC Kernels
ISSN: 2994-970X, 2994-970XPublished: New York, NY, USA ACM 22.06.2025Published in Proceedings of the ACM on software engineering (22.06.2025)“…Recent trends in high-performance computing show an increase in the adoption of performance portable frameworks such as Kokkos and interpreted languages such…”
Get full text
Journal Article -
8
Mars: A MapReduce Framework on graphics processors
Published: ACM 01.10.2008Published in PACT'08 : proceedings of the Seventeenth International Conference on Parallel Architectures and Compilation Techniques : Toronto, Ontario, Canada, October 25-29, 2008 (01.10.2008)“…We design and implement Mars, a MapReduce framework, on graphics processors (GPUs). MapReduce is a distributed programming framework originally proposed by…”
Get full text
Conference Proceeding -
9
Efficient Execution of OpenMP on GPUs
Published: IEEE 02.04.2022Published in 2022 IEEE/ACM International Symposium on Code Generation and Optimization (CGO) (02.04.2022)“…OpenMP is the preferred choice for CPU parallelism in High-Performance-Computing (HPC) applications written in C, C++, or Fortran. As HPC systems became…”
Get full text
Conference Proceeding -
10
Python shared atomic data types
ISSN: 0038-0644, 1097-024XPublished: Bognor Regis Wiley Subscription Services, Inc 01.12.2023Published in Software, practice & experience (01.12.2023)“…Although atomicity plays a key role in data operations of shared variables in parallel computation, researchers haven't treated atomicity in Python in much…”
Get full text
Journal Article -
11
PID-Comm: A Fast and Flexible Collective Communication Framework for Commodity Processing-in-DIMM Devices
Published: IEEE 29.06.2024Published in 2024 ACM/IEEE 51st Annual International Symposium on Computer Architecture (ISCA) (29.06.2024)“…Recent dual in-line memory modules (DIMMs) are starting to support processing-in-memory (PIM) by associating their memory banks with processing elements (PEs),…”
Get full text
Conference Proceeding -
12
Cognitive Correlative Encoding for Genome Sequence Matching in Hyperdimensional System
Published: IEEE 05.12.2021Published in 2021 58th ACM/IEEE Design Automation Conference (DAC) (05.12.2021)“…Pattern matching is one of the key algorithms in identifying and analyzing genomic data. In this paper, we propose HYPERS, a novel framework supporting highly…”
Get full text
Conference Proceeding -
13
HiSpTRSV: Exploring Tile-Level Parallelism for SpTRSV Acceleration on FPGAs
Published: IEEE 22.06.2025Published in 2025 62nd ACM/IEEE Design Automation Conference (DAC) (22.06.2025)“…Sparse Triangular Solve (SpTRSV) is a critical level2 kernel in sparse Basic Linear Algebra Subprograms (BLAS). While Field-Programmable Gate Array (FPGA)…”
Get full text
Conference Proceeding -
14
Adaptive heterogeneous scheduling for integrated GPUs
Published: ACM 01.08.2014Published in PACT '14 : proceedings of the 23rd International Conference on Parallel Architectures and Compilation Techniques : August 24-27, 2014, Edmonton, AB, Canada (01.08.2014)“…Many processors today integrate a CPU and GPU on the same die, which allows them to share resources like physical memory and lowers the cost of CPU-GPU…”
Get full text
Conference Proceeding -
15
An effective GPU implementation of breadth-first search
ISBN: 9781424466771, 1424466776ISSN: 0738-100XPublished: IEEE 13.06.2010Published in Design Automation Conference (13.06.2010)“…Breadth-first search (BFS) has wide applications in electronic design automation (EDA) as well as in other fields. Researchers have tried to accelerate BFS on…”
Get full text
Conference Proceeding -
16
Scalable, Programmable and Dense: The HammerBlade Open-Source RISC-V Manycore
Published: IEEE 29.06.2024Published in 2024 ACM/IEEE 51st Annual International Symposium on Computer Architecture (ISCA) (29.06.2024)“…Existing tiled manycore architectures propose to convert abundant silicon resources into general-purpose parallel processors with unmatched computational…”
Get full text
Conference Proceeding -
17
SafeRace: Assessing and Addressing WebGPU Memory Safety in the Presence of Data Races
ISSN: 2475-1421, 2475-1421Published: New York, NY, USA ACM 09.10.2025Published in Proceedings of ACM on programming languages (09.10.2025)“…In untrusted execution environments such as web browsers, code from remote sources is regularly executed. To harden these environments against attacks,…”
Get full text
Journal Article -
18
PairGraph: An Efficient Search-space-aware Accelerator for High-performance Concurrent Pairwise Queries
Published: IEEE 22.06.2025Published in 2025 62nd ACM/IEEE Design Automation Conference (DAC) (22.06.2025)“…Pairwise queries have been widely used in many applications. Although several approaches have been recently proposed to accelerate a single query, they still…”
Get full text
Conference Proceeding -
19
Automatic Parallelism Management
ISSN: 2475-1421, 2475-1421Published: New York, NY, USA ACM 02.01.2024Published in Proceedings of ACM on programming languages (02.01.2024)“…On any modern computer architecture today, parallelism comes with a modest cost, born from the creation and management of threads or tasks. Today, programmers…”
Get full text
Journal Article -
20
Evaluation of Blue Gene/Q hardware support for transactional memories
Published: ACM 01.09.2012Published in PACT'12 : proceedings of the 21st International Conference on Parallel Architectures and Compilation Techniques, September 19-23, Minneapolis, Minnesota, USA (01.09.2012)“…This paper describes an end-to-end system implementation of the transactional memory (TM) programming model on top of the hardware transactional memory (HTM)…”
Get full text
Conference Proceeding

