Výsledky vyhľadávania - parallel amd distributed computing
-
1
Automated parallel execution of distributed task graphs with FPGA clusters
ISSN: 0167-739XVydavateľské údaje: Elsevier B.V 01.11.2024Vydané v Future generation computer systems (01.11.2024)“…Over the years, Field Programmable Gate Arrays (FPGA) have been gaining popularity in the High Performance Computing (HPC…”
Získať plný text
Journal Article -
2
PRNGine: Massively Parallel Pseudo-Random Number Generation and Probability Distribution Approximations on AMD AI Engines
ISSN: 2995-066XVydavateľské údaje: IEEE 03.06.2025Vydané v 2025 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW) (03.06.2025)“…Generating large volumes of random numbers is essential for high-performance computing applications such as Monte Carlo simulations, machine learning, and dynamic game-play…”
Získať plný text
Konferenčný príspevok.. -
3
StreamMR: An Optimized MapReduce Framework for AMD GPUs
ISBN: 1457718758, 9781457718755ISSN: 1521-9097Vydavateľské údaje: IEEE 01.12.2011Vydané v 2011 IEEE 17th International Conference on Parallel and Distributed Systems (01.12.2011)“…MapReduce is a programming model from Google that facilitates parallel processing on a cluster of thousands of commodity computers…”
Získať plný text
Konferenčný príspevok.. -
4
Optimization and Portability of a Fusion OpenACC-based FORTRAN HPC Code from NVIDIA to AMD GPUs
ISSN: 2331-8422Vydavateľské údaje: Ithaca Cornell University Library, arXiv.org 17.05.2023Vydané v arXiv.org (17.05.2023)“… Recent exascale HPC systems are, however, introducing GPUs from other vendors, e.g. with the AMD GPU-based OLCF Frontier system just becoming available…”
Získať plný text
Paper -
5
A method for decompilation of AMD GCN kernels to OpenCL
ISSN: 2331-8422Vydavateľské údaje: Ithaca Cornell University Library, arXiv.org 16.07.2021Vydané v arXiv.org (16.07.2021)“… They are available for many hardware architectures and programming languages. However, none of the existing decompilers support modern AMD GPU architectures such as AMD GCN and RDNA. Purpose…”
Získať plný text
Paper -
6
Efficient and Distributed Computation of Electron Repulsion Integrals on AMD AI Engines
ISSN: 2576-2621Vydavateľské údaje: IEEE 04.05.2025Vydané v Proceedings ... Annual IEEE Symposium on Field-Programmable Custom Computing Machines (Online) (04.05.2025)“…Computing electron repulsion integrals (ERIs) is the major computational bottleneck of many quantum mechanical simulation methods, requiring trillions of ERI evaluations per time step…”
Získať plný text
Konferenčný príspevok.. -
7
Distributed computation of the critical path from execution traces
ISSN: 0038-0644, 1097-024XVydavateľské údaje: Bognor Regis Wiley Subscription Services, Inc 01.08.2023Vydané v Software, practice & experience (01.08.2023)“…Due to the ever‐increasing number of computer nodes in distributed systems, efficient and effective tools have become crucial for their analysis…”
Získať plný text
Journal Article -
8
Performance portable Vlasov code with C++ parallel algorithm
ISSN: 2831-3909Vydavateľské údaje: IEEE 01.11.2022Vydané v IEEE/ACM International Workshop on Performance, Portability and Productivity in HPC (Online) (01.11.2022)“… parallel algorithm to run across multiple CPUs and GPUs. Relying on the language standard parallelism stdpar and proposed language standard multi-dimensional array…”
Získať plný text
Konferenčný príspevok.. -
9
GPU-Accelerated Tree-Search in Chapel Versus CUDA and HIP
Vydavateľské údaje: IEEE 27.05.2024Vydané v 2024 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW) (27.05.2024)“…In the context of exascale programming, the PGAS-based Chapel is among the rare languages targeting the holistic handling of high-performance computing issues including the productivity-aware…”
Získať plný text
Konferenčný príspevok.. -
10
TaPaSCo-AIE: An Open-Source Framework for Streaming-Based Heterogeneous Acceleration Using AMD AI Engines
Vydavateľské údaje: IEEE 27.05.2024Vydané v 2024 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW) (27.05.2024)“…AMD AI Engines (AIEs) extend the design space and open up new options for coarse-grained processing in re-configurable accelerators…”
Získať plný text
Konferenčný príspevok.. -
11
On the performance of a highly-scalable Computational Fluid Dynamics code on AMD, ARM and Intel processors
ISSN: 2331-8422Vydavateľské údaje: Ithaca Cornell University Library, arXiv.org 12.10.2020Vydané v arXiv.org (12.10.2020)“…No area of computing is hungrier for performance than High Performance Computing (HPC…”
Získať plný text
Paper -
12
A Performance Model for GPUs with Caches
ISSN: 1045-9219, 1558-2183Vydavateľské údaje: New York IEEE 01.07.2015Vydané v IEEE transactions on parallel and distributed systems (01.07.2015)“…To exploit the abundant computational power of the world's fastest supercomputers, an even workload distribution to the typically heterogeneous compute devices…”
Získať plný text
Journal Article -
13
BCSR on GPU: A Way Forward Extreme-scale Graph Processing on Accelerator-enabled Frontier Supercomputer
Vydavateľské údaje: IEEE 17.11.2024Vydané v SC24-W: Workshops of the International Conference for High Performance Computing, Networking, Storage and Analysis (17.11.2024)“…Handling large graphs in a distributed environment requires effective partitioning across processors and efficient management of local partitions…”
Získať plný text
Konferenčný príspevok.. -
14
Dissecting the Software-Based Measurement of CPU Energy Consumption: A Comparative Analysis
ISSN: 1045-9219, 1558-2183Vydavateľské údaje: IEEE 01.01.2025Vydané v IEEE transactions on parallel and distributed systems (01.01.2025)“… (and more) without the need for additional hardware. Since 2017, it is available on most x86 processors, including AMD processors…”
Získať plný text
Journal Article -
15
Cloud Colonography: Distributed Medical Testbed over Cloud
ISSN: 2168-7161, 2372-0018Vydavateľské údaje: Piscataway IEEE Computer Society 01.04.2020Vydané v IEEE transactions on cloud computing (01.04.2020)“… The proposed AMD has the potential to play a role of the core classifier in the cloud computing framework…”
Získať plný text
Journal Article -
16
Integer Sum Reduction with OpenMP on an AMD MI100 GPU
ISBN: 9781665497480Vydavateľské údaje: IEEE 01.05.2022Vydané v 2022 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW) (01.05.2022)“…Sum reduction is a primitive operation in parallel computing. Device offload support allows a user to use OpenMP directives to take advantage of a highly capable GPU…”
Získať plný text
Konferenčný príspevok.. -
17
A Fine-grained Prefetching Scheme for DGEMM Kernels on GPU with Auto-tuning Compatibility
ISSN: 1530-2075Vydavateľské údaje: IEEE 01.05.2022Vydané v Proceedings - IEEE International Parallel and Distributed Processing Symposium (01.05.2022)“…General Matrix Multiplication (GEMM) is one of the fundamental kernels for scientific and high-performance computing…”
Získať plný text
Konferenčný príspevok.. -
18
Exploiting Memory Access Patterns to Improve Memory Performance in Data-Parallel Architectures
ISSN: 1045-9219, 1558-2183Vydavateľské údaje: New York IEEE 01.01.2011Vydané v IEEE transactions on parallel and distributed systems (01.01.2011)“…The introduction of General-Purpose computation on GPUs (GPGPUs) has changed the landscape for the future of parallel computing…”
Získať plný text
Journal Article -
19
Parallel breadth-first search on distributed memory systems
ISBN: 145030771X, 9781450307710ISSN: 2167-4329Vydavateľské údaje: New York, NY, USA ACM 12.11.2011Vydané v 2011 International Conference for High Performance Computing, Networking, Storage and Analysis (SC) (12.11.2011)“…Data-intensive, graph-based computations are pervasive in several scientific applications, and are known to to be quite challenging to implement on distributed memory systems…”
Získať plný text
Konferenčný príspevok.. -
20
Multi-BSP vs. BSP: A Case of Study for Dell AMD Multicores
ISSN: 2377-5750Vydavateľské údaje: IEEE 01.03.2018Vydané v Proceedings - Euromicro Workshop on Parallel and Distributed Processing (01.03.2018)“… The Bulk-Synchronous Parallel (BSP) is a well-known computing model originally devised for distributed algorithms running on clusters of single-core processors…”
Získať plný text
Konferenčný príspevok..

