Suchergebnisse - software tools for parallel programming
-
1
From Design Patterns to Parallel Architectural Skeletons
ISSN: 0743-7315, 1096-0848Veröffentlicht: San Diego, CA Elsevier Inc 01.04.2002Veröffentlicht in Journal of parallel and distributed computing (01.04.2002)“… The concept of design patterns has been extensively studied and applied in the context of object-oriented software design …”
Volltext
Journal Article -
2
The Direction of Development Parallel Programming of Software Tools Software Complexes Military Purpose
ISSN: 1999-494X, 2313-6057Veröffentlicht: Krasnoyarsk Siberian Federal University 01.01.2022Veröffentlicht in Journal of Siberian Federal University. Engineering & Technologies (01.01.2022)“… В статье рассмотрены вопросы классификационного выбора предпочтительных алгоритмов распараллеливания (с минимальным временем выполнения), реализованных в …”
Volltext
Journal Article -
3
The Direction of Development Parallel Programming of Software Tools Software Complexes Military Purpose
ISSN: 1999-494X, 2313-6057Veröffentlicht: 01.02.2022Veröffentlicht in Journal of Siberian Federal University. Engineering & Technologies (01.02.2022)“… (with minimal execution time) implemented in parallel software development tools for multi-core (multiprocessor …”
Volltext
Journal Article -
4
Splitwise: Efficient Generative LLM Inference Using Phase Splitting
Veröffentlicht: IEEE 29.06.2024Veröffentlicht in 2024 ACM/IEEE 51st Annual International Symposium on Computer Architecture (ISCA) (29.06.2024)“… Generative large language model (LLM) applications are growing rapidly, leading to large-scale deployments of expensive and power-hungry GPUs. Our …”
Volltext
Tagungsbericht -
5
Polygeist: Raising C to Polyhedral MLIR
Veröffentlicht: IEEE 01.09.2021Veröffentlicht in 2021 30th International Conference on Parallel Architectures and Compilation Techniques (PACT) (01.09.2021)“… We present Polygeist, a new compilation flow that connects the MLIR compiler infrastructure to cutting edge polyhedral optimization tools …”
Volltext
Tagungsbericht -
6
SimplePIM: A Software Framework for Productive and Efficient Processing-in-Memory
Veröffentlicht: IEEE 21.10.2023Veröffentlicht in 2023 32nd International Conference on Parallel Architectures and Compilation Techniques (PACT) (21.10.2023)“… This paper presents a new software framework, SimplePIM, to aid programming real PIM systems …”
Volltext
Tagungsbericht -
7
UWOmppro: UWOmp++ with Point-to-Point Synchronization, Reduction and Schedules
Veröffentlicht: IEEE 21.10.2023Veröffentlicht in 2023 32nd International Conference on Parallel Architectures and Compilation Techniques (PACT) (21.10.2023)“… (for example, iterations of a parallel-for-loop). While OpenMP allows synchronization among these threads, many classes of computations can be conveniently expressed by specifying synchronization among the parallel activities …”
Volltext
Tagungsbericht -
8
LLM-Based Java Concurrent Program to ArkTS Converter
ISSN: 2643-1572Veröffentlicht: ACM 27.10.2024Veröffentlicht in IEEE/ACM International Conference on Automated Software Engineering : [proceedings] (27.10.2024)“… However, HarmonyOS utilizes ArkTS, a superset of TypeScript, as the programming language for application development …”
Volltext
Tagungsbericht -
9
Efficient Execution of OpenMP on GPUs
Veröffentlicht: IEEE 02.04.2022Veröffentlicht in 2022 IEEE/ACM International Symposium on Code Generation and Optimization (CGO) (02.04.2022)“… OpenMP is the preferred choice for CPU parallelism in High-Performance-Computing (HPC) applications written in C, C++, or Fortran. As HPC systems became …”
Volltext
Tagungsbericht -
10
PID-Comm: A Fast and Flexible Collective Communication Framework for Commodity Processing-in-DIMM Devices
Veröffentlicht: IEEE 29.06.2024Veröffentlicht in 2024 ACM/IEEE 51st Annual International Symposium on Computer Architecture (ISCA) (29.06.2024)“… Many highly parallel applications have been shown to benefit from these PIM-enabled DIMMs, but further speedup is often limited by the huge overhead of inter-PE collective communication …”
Volltext
Tagungsbericht -
11
Cognitive Correlative Encoding for Genome Sequence Matching in Hyperdimensional System
Veröffentlicht: IEEE 05.12.2021Veröffentlicht in 2021 58th ACM/IEEE Design Automation Conference (DAC) (05.12.2021)“… In this paper, we propose HYPERS, a novel framework supporting highly efficient and parallel pattern matching based on HyperDimensional computing (HDC …”
Volltext
Tagungsbericht -
12
Architecture-Aware Currying
Veröffentlicht: IEEE 21.10.2023Veröffentlicht in 2023 32nd International Conference on Parallel Architectures and Compilation Techniques (PACT) (21.10.2023)“… In near-data computing (NDC), computation is brought into data, as opposed to bringing data to computation. While there is prior work focusing on different NDC …”
Volltext
Tagungsbericht -
13
Scalable, Programmable and Dense: The HammerBlade Open-Source RISC-V Manycore
Veröffentlicht: IEEE 29.06.2024Veröffentlicht in 2024 ACM/IEEE 51st Annual International Symposium on Computer Architecture (ISCA) (29.06.2024)“… Existing tiled manycore architectures propose to convert abundant silicon resources into general-purpose parallel processors with unmatched computational density and programmability …”
Volltext
Tagungsbericht -
14
BOLT: Optimizing OpenMP Parallel Regions with User-Level Threads
ISSN: 2641-7936Veröffentlicht: IEEE 01.09.2019Veröffentlicht in Proceedings / International Conference on Parallel Architectures and Compilation Techniques (01.09.2019)“… As a result, multiple levels of the software stack use OpenMP independently of one another, often leading to nested parallel regions …”
Volltext
Tagungsbericht -
15
A Finer-Grained Blocking Analysis for Parallel Real-Time Tasks with Spin-Locks
Veröffentlicht: IEEE 05.12.2021Veröffentlicht in 2021 58th ACM/IEEE Design Automation Conference (DAC) (05.12.2021)“… Real-time synchronization is one of the essential theories in real-time systems, and the recent booming of parallel real-time tasks has brought new challenges to the synchronization analysis …”
Volltext
Tagungsbericht -
16
HiSpTRSV: Exploring Tile-Level Parallelism for SpTRSV Acceleration on FPGAs
Veröffentlicht: IEEE 22.06.2025Veröffentlicht in 2025 62nd ACM/IEEE Design Automation Conference (DAC) (22.06.2025)“… HiSpTRSV addresses these challenges through dependency graph parsing, tile-based highly parallel algorithm, filtering mechanisms, and bidirectional matching with modular indexing …”
Volltext
Tagungsbericht -
17
One Automaton to Rule Them All: Beyond Multiple Regular Expressions Execution
ISSN: 2643-2838Veröffentlicht: IEEE 02.03.2024Veröffentlicht in Proceedings / International Symposium on Code Generation and Optimization (02.03.2024)“… Regular Expressions (REs) matching is crucial to identify strings exhibiting certain morphological properties in a data stream, resulting paramount in contexts …”
Volltext
Tagungsbericht -
18
A Framework for Fine-Grained Synchronization of Dependent GPU Kernels
ISSN: 2643-2838Veröffentlicht: IEEE 02.03.2024Veröffentlicht in Proceedings / International Symposium on Code Generation and Optimization (02.03.2024)“… Machine Learning (ML) models execute several parallel computations including Generalized Matrix Multiplication, Convolution, Dropout, etc …”
Volltext
Tagungsbericht -
19
Synergically Rebalancing Parallel Execution via DCT and Turbo Boosting
Veröffentlicht: IEEE 05.12.2021Veröffentlicht in 2021 58th ACM/IEEE Design Automation Conference (DAC) (05.12.2021)“… Many dynamic concurrency throttling (DCT) techniques have successfully used to tune the number of executing threads to better balance a parallel application according to its available scalability …”
Volltext
Tagungsbericht -
20
CELLO: Compiler-Assisted Efficient Load-Load Ordering in Data-Race-Free Regions
Veröffentlicht: IEEE 21.10.2023Veröffentlicht in 2023 32nd International Conference on Parallel Architectures and Compilation Techniques (PACT) (21.10.2023)“… ) property imposed by modern programming languages. To leverage this observation …”
Volltext
Tagungsbericht