Suchergebnisse - "Task-Based Programming Models"
-
1
Callback-based completion notification using MPI Continuations
ISSN: 0167-8191, 1872-7336Veröffentlicht: Netherlands Elsevier B.V 01.09.2021Veröffentlicht in Parallel computing (01.09.2021)“… Asynchronous programming models (APM) are gaining more and more traction, allowing applications to expose the available concurrency to a runtime system tasked …”
Volltext
Journal Article -
2
OmpSs@FPGA Framework for High Performance FPGA Computing
ISSN: 0018-9340, 1557-9956Veröffentlicht: New York IEEE 01.12.2021Veröffentlicht in IEEE transactions on computers (01.12.2021)“… This article presents the new features of the OmpSs@FPGA framework. OmpSs is a data-flow programming model that supports task nesting and dependencies to …”
Volltext
Journal Article -
3
Storage-Heterogeneity Aware Task-based Programming Models to Optimize I/O Intensive Applications
ISSN: 1045-9219, 1558-2183Veröffentlicht: New York IEEE 01.12.2022Veröffentlicht in IEEE transactions on parallel and distributed systems (01.12.2022)“… Task-based programming models have enabled the optimized execution of the computation workloads of applications. These programming models can take advantage of …”
Volltext
Journal Article -
4
A Hardware Runtime for Task-Based Programming Models
ISSN: 1045-9219, 1558-2183Veröffentlicht: New York IEEE 01.09.2019Veröffentlicht in IEEE transactions on parallel and distributed systems (01.09.2019)“… Task-based programming models such as OpenMP 5.0 and OmpSs are simple to use and powerful enough to exploit task parallelism of applications over multicore, …”
Volltext
Journal Article -
5
Towards enabling I/O awareness in task-based programming models
ISSN: 0167-739X, 1872-7115Veröffentlicht: Elsevier B.V 01.08.2021Veröffentlicht in Future generation computer systems (01.08.2021)“… Storage systems have not kept the same technology improvement rate as computing systems. As applications produce more and more data, I/O becomes the limiting …”
Volltext
Journal Article -
6
DDS: integrating data analytics transformations in task-based workflows
ISSN: 2732-5121, 2732-5121Veröffentlicht: F1000 Research Ltd 2022Veröffentlicht in Open research Europe (2022)“… High-performance data analytics (HPDA) is a current trend in e-science research that aims to integrate traditional HPC with recent data analytic frameworks …”
Volltext
Journal Article -
7
DDS: integrating data analytics transformations in task-based workflows
ISSN: 2732-5121, 2732-5121Veröffentlicht: London, UK F1000 Research Limited 01.01.2022Veröffentlicht in Open research Europe (01.01.2022)“… High-performance data analytics (HPDA) is a current trend in e-science research that aims to integrate traditional HPC with recent data analytic frameworks …”
Volltext
Journal Article -
8
Reducing Cache Coherence Traffic with a NUMA-Aware Runtime Approach
ISSN: 1045-9219, 1558-2183Veröffentlicht: New York IEEE 01.05.2018Veröffentlicht in IEEE transactions on parallel and distributed systems (01.05.2018)“… Cache Coherent NUMA (ccNUMA) architectures are a widespread paradigm due to the benefits they provide for scaling core count and memory capacity. Also, the …”
Volltext
Journal Article -
9
Architectural Support for Task Dependence Management with Flexible Software Scheduling
ISSN: 2378-203XVeröffentlicht: IEEE 01.02.2018Veröffentlicht in Proceedings - International Symposium on High-Performance Computer Architecture (01.02.2018)“… The growing complexity of multi-core architectures has motivated a wide range of software mechanisms to improve the orchestration of parallel executions. Task …”
Volltext
Tagungsbericht -
10
Runtime-Guided Management of Scratchpad Memories in Multicore Architectures
ISSN: 1089-795XVeröffentlicht: IEEE 01.10.2015Veröffentlicht in 2015 International Conference on Parallel Architecture and Compilation (PACT) (01.10.2015)“… The increasing number of cores and the anticipated level of heterogeneity in upcoming multicore architectures cause important problems in traditional cache …”
Volltext
Tagungsbericht -
11
A Linux Kernel Scheduler Extension for Multi-core Systems
ISSN: 2640-0316Veröffentlicht: IEEE 01.12.2019Veröffentlicht in Proceedings - International Conference on High Performance Computing (01.12.2019)“… The Linux kernel is mostly designed for multi-programed environments, but high-performance applications have other requirements. Such applications are run …”
Volltext
Tagungsbericht -
12
Adaptive Runtime-Assisted Block Prefetching on Chip-Multiprocessors
ISSN: 0885-7458, 1573-7640Veröffentlicht: New York Springer US 01.06.2017Veröffentlicht in International journal of parallel programming (01.06.2017)“… Memory stalls are a significant source of performance degradation in modern processors. Data prefetching is a widely adopted and well studied technique used to …”
Volltext
Journal Article -
13
Enhancing OmpSs-2 Suspendable Tasks by Combining Operating System and User-Level Threads with C++ Coroutines
ISSN: 1530-2075Veröffentlicht: IEEE 03.06.2025Veröffentlicht in Proceedings - IEEE International Parallel and Distributed Processing Symposium (03.06.2025)“… This paper explores three methods for implementing suspendable tasks within task-based programming models: OS threads (pthreads), User-Level Threads (ULTs), …”
Volltext
Tagungsbericht -
14
Boosting Earth System Model Outputs And Saving PetaBytes in Their Storage Using Exascale Climate Emulators
Veröffentlicht: IEEE 17.11.2024Veröffentlicht in SC24: International Conference for High Performance Computing, Networking, Storage and Analysis (17.11.2024)“… We present the design and scalable implementation of an exascale climate emulator for addressing the escalating computational and storage requirements of …”
Volltext
Tagungsbericht -
15
Reducing cache coherence traffic with hierarchical directory cache and NUMA-aware runtime scheduling
Veröffentlicht: ACM 01.09.2016Veröffentlicht in 2016 International Conference on Parallel Architecture and Compilation Techniques (PACT) (01.09.2016)“… Cache Coherent NUMA (ccNUMA) architectures are a widespread paradigm due to the benefits they provide for scaling core count and memory capacity. Also, the …”
Volltext
Tagungsbericht -
16
On the Application Task Granularity and the Interplay with the Scheduling Overhead in Many-Core Shared Memory Systems
ISSN: 1552-5244Veröffentlicht: IEEE 01.09.2015Veröffentlicht in Proceedings / IEEE International Conference on Cluster Computing (01.09.2015)“… Task-based programming models are considered one of the most promising programming model approaches for exascale supercomputers because of their ability to …”
Volltext
Tagungsbericht -
17
Available Task-Level Parallelism on the Cell BE
ISSN: 1058-9244, 1875-919XVeröffentlicht: 2009Veröffentlicht in Scientific programming (2009)“… There is a clear industrial trend towards chip multiprocessors (CMP) as the most power efficient way of further increasing performance. Heterogeneous CMP …”
Volltext
Journal Article -
18
An Integrated MPI and OpenMP Approach for Plasma Dynamics Simulations
Veröffentlicht: IEEE 22.11.2024Veröffentlicht in 2024 International Conference on Computing, Semiconductor, Mechatronics, Intelligent Systems and Communications (COSMIC) (22.11.2024)“… Plasma dynamics is the behavior exhibited by two or more charged species with respect to electric or magnetic fields. In high-performance computing (HPC) …”
Volltext
Tagungsbericht -
19
Reshaping Geostatistical Modeling and Prediction for Extreme-Scale Environmental Applications
ISSN: 2167-4337Veröffentlicht: IEEE 01.01.2022Veröffentlicht in International Conference for High Performance Computing, Networking, Storage and Analysis (Online) (01.01.2022)“… We extend the capability of space-time geostatistical modeling using algebraic approximations, illustrating application-expected accuracy worthy of double …”
Volltext
Tagungsbericht -
20
Evaluating Data Redistribution in PaRSEC
ISSN: 1045-9219, 1558-2183Veröffentlicht: New York IEEE 01.08.2022Veröffentlicht in IEEE transactions on parallel and distributed systems (01.08.2022)“… Data redistribution aims to reshuffle data to optimize some objective for an algorithm. The objective can be multi-dimensional, such as improving computational …”
Volltext
Journal Article