Search Results - Shared-memory parallel computing
-
1
Computational performance of a smoothed particle hydrodynamics simulation for shared-memory parallel computing
ISSN: 0010-4655, 1879-2944Published: Elsevier B.V 01.09.2015Published in Computer physics communications (01.09.2015)“…The computational performance of a smoothed particle hydrodynamics (SPH) simulation is investigated for three types of current shared-memory parallel computer devices…”
Get full text
Journal Article -
2
On Pervasive Shared Memory Parallel Computing
ISSN: 0026-2005, 2167-8634Published: Alma Michigan Academy of Science, Arts & Letters 22.06.2021Published in Michigan academician (22.06.2021)“…Nowadays one can find parallel computing (PC) capability in laptops, desktops, smart phones and embedded devices in, use today…”
Get full text
Journal Article -
3
Adaptive parallel applications: from shared memory architectures to fog computing (2002–2022)
ISSN: 1386-7857, 1573-7543Published: New York Springer US 01.12.2022Published in Cluster computing (01.12.2022)“…The evolution of parallel architectures points to dynamic environments where the number of available resources or configurations may vary during the execution of applications…”
Get full text
Journal Article -
4
Max-PIM: Fast and Efficient Max/Min Searching in DRAM
Published: IEEE 05.12.2021Published in 2021 58th ACM/IEEE Design Automation Conference (DAC) (05.12.2021)“…Recently, in-DRAM computing is becoming one promising technique to address the notorious 'memory-wall' issue for big data processing…”
Get full text
Conference Proceeding -
5
Chasing Away RAts: Semantics and evaluation for relaxed atomics on heterogeneous systems
Published: ACM 01.06.2017Published in 2017 ACM/IEEE 44th Annual International Symposium on Computer Architecture (ISCA) (01.06.2017)“…An unambiguous and easy-to-understand memory consistency model is crucial for ensuring correct synchronization and guiding future design of heterogeneous…”
Get full text
Conference Proceeding -
6
Reverse-Mode Automatic Differentiation and Optimization of GPU Kernels via Enzyme
ISSN: 2167-4337Published: ACM 14.11.2021Published in SC21: International Conference for High Performance Computing, Networking, Storage and Analysis (14.11.2021)“…Computing derivatives is key to many algorithms in scientific computing and machine learning such as optimization, uncertainty quantification, and stability analysis…”
Get full text
Conference Proceeding -
7
SLATE: Design of a Modern Distributed and Accelerated Linear Algebra Library
ISSN: 2167-4337Published: ACM 17.11.2019Published in SC19: International Conference for High Performance Computing, Networking, Storage and Analysis (17.11.2019)“… SLATE will provide coverage of existing ScaLAPACK functionality, including the parallel BLAS…”
Get full text
Conference Proceeding -
8
A Parallel Computing Method for the Computation of the Moore-Penrose Generalized Inverse for Shared-Memory Architectures
ISSN: 2169-3536, 2169-3536Published: Piscataway IEEE 01.01.2023Published in IEEE access (01.01.2023)“… In this paper, we propose a parallel computing method for the computation of the Moore-Penrose generalized inverse of large-size full-rank rectangular matrices…”
Get full text
Journal Article -
9
Domain decomposition with discrete element simulations using shared-memory parallel computing for railways applications
ISSN: 1779-7179, 1958-5829Published: Routledge 01.01.2012Published in European journal of computational mechanics (01.01.2012)“… To reduce computational costs, we study the use of two strategies: domain decomposition methods and shared-memory parallelisation with OpenMP…”
Get full text
Journal Article -
10
DASP: Specific Dense Matrix Multiply-Accumulate Units Accelerated General Sparse Matrix-Vector Multiplication
ISSN: 2167-4337Published: ACM 11.11.2023Published in International Conference for High Performance Computing, Networking, Storage and Analysis (Online) (11.11.2023)“…Sparse matrix-vector multiplication (SpMV) plays a key role in computational science and engineering, graph processing, and machine learning applications. Much…”
Get full text
Conference Proceeding -
11
Bringing Order to Sparsity: A Sparse Matrix Reordering Study on Multicore CPUs
ISSN: 2167-4337Published: ACM 11.11.2023Published in International Conference for High Performance Computing, Networking, Storage and Analysis (Online) (11.11.2023)“…Many real-world computations involve sparse data structures in the form of sparse matrices. A common strategy for optimizing sparse matrix operations is to…”
Get full text
Conference Proceeding -
12
Understanding Priority-Based Scheduling of Graph Algorithms on a Shared-Memory Platform
ISSN: 2167-4337Published: ACM 17.11.2019Published in SC19: International Conference for High Performance Computing, Networking, Storage and Analysis (17.11.2019)“… It performs a detailed empirical performance analysis of several advanced CPS designs in a state-of-the-art graph analytics framework running on a large shared-memory server…”
Get full text
Conference Proceeding -
13
Almost Deterministic Work Stealing
ISSN: 2167-4337Published: ACM 17.11.2019Published in SC19: International Conference for High Performance Computing, Networking, Storage and Analysis (17.11.2019)“…With task parallel models, programmers can easily parallelize divide-and-conquer algorithms by using nested fork-join structures…”
Get full text
Conference Proceeding -
14
NestedMP: Enabling cache-aware thread mapping for nested parallel shared memory applications
ISSN: 0167-8191, 1872-7336Published: Elsevier B.V 01.01.2016Published in Parallel computing (01.01.2016)“…•Task-core mapping schemas for nested-parallel applications may affect performance…”
Get full text
Journal Article -
15
Novel parallel method for association rule mining on multi-core shared memory systems
ISSN: 0167-8191, 1872-7336Published: Elsevier B.V 01.12.2014Published in Parallel computing (01.12.2014)“…•ShaFEM: a novel association rule mining method for multi-core shared memory systems…”
Get full text
Journal Article -
16
STM-Multifrontal QR: Streaming Task Mapping Multifrontal QR Factorization Empowered by GCN
ISSN: 2167-4337Published: ACM 14.11.2021Published in SC21: International Conference for High Performance Computing, Networking, Storage and Analysis (14.11.2021)“…% higher than that of prior work. Moreover, for numerical factorization, an optimized tasks stream parallel processing strategy is proposed and a more efficient computing task mapping framework for NUMA architecture is adopted…”
Get full text
Conference Proceeding -
17
A flexible sparse matrix data format and parallel algorithms for the assembly of finite element matrices on shared memory systems
ISSN: 0167-8191Published: 01.09.2023Published in Parallel computing (01.09.2023)Get full text
Journal Article -
18
TSHMEM: Shared-Memory Parallel Computing on Tilera Many-Core Processors
Published: IEEE 01.05.2013Published in 2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum (01.05.2013)“… Fortunately, a partitioned global address space (PGAS) programming model has demonstrated realizable performance and productivity potential for large parallel computing systems with distributed-memory architectures…”
Get full text
Conference Proceeding -
19
Implementation of QR and LQ decompositions on shared memory parallel computing systems
Published: IEEE 2016Published in 2016 2nd International Conference on Industrial Engineering, Applications and Manufacturing (ICIEAM) (2016)“…The paper presents some results of research of applicability of various computing schemes of implementing QR and LQ matrix decompositions for shared memory parallel systems…”
Get full text
Conference Proceeding -
20
Concurrent computation of topological watershed on shared memory parallel machines
ISSN: 0167-8191, 1872-7336Published: Elsevier B.V 01.11.2017Published in Parallel computing (01.11.2017)“…•We set up an adequate parallelization approach that guides a parallel watershed computing…”
Get full text
Journal Article