Výsledky vyhledávání - Shared-memory parallel computing

1

Načítá se…

Computational performance of a smoothed particle hydrodynamics simulation for shared-memory parallel computing Autor Nishiura, Daisuke, Furuichi, Mikito, Sakaguchi, Hide

ISSN: 0010-4655, 1879-2944

Vydáno: Elsevier B.V 01.09.2015

Vydáno v Computer physics communications (01.09.2015)
“…The computational performance of a smoothed particle hydrodynamics (SPH) simulation is investigated for three types of current shared-memory parallel computer devices…”

Získat plný text

Journal Article

Přidat do oblíbených

Uloženo v:
2

Načítá se…

On Pervasive Shared Memory Parallel Computing Autor Rattan, Ishwar

ISSN: 0026-2005, 2167-8634

Vydáno: Alma Michigan Academy of Science, Arts & Letters 22.06.2021

Vydáno v Michigan academician (22.06.2021)
“…Nowadays one can find parallel computing (PC) capability in laptops, desktops, smart phones and embedded devices in, use today…”

Získat plný text

Journal Article

Přidat do oblíbených

Uloženo v:
3

Načítá se…

Adaptive parallel applications: from shared memory architectures to fog computing (2002–2022) Autor Galante, Guilherme, da Rosa Righi, Rodrigo

ISSN: 1386-7857, 1573-7543

Vydáno: New York Springer US 01.12.2022

Vydáno v Cluster computing (01.12.2022)
“…The evolution of parallel architectures points to dynamic environments where the number of available resources or configurations may vary during the execution of applications…”

Získat plný text

Journal Article

Přidat do oblíbených

Uloženo v:
4

Načítá se…

Max-PIM: Fast and Efficient Max/Min Searching in DRAM Autor Zhang, Fan, Angizi, Shaahin, Fan, Deliang

Vydáno: IEEE 05.12.2021

Vydáno v 2021 58th ACM/IEEE Design Automation Conference (DAC) (05.12.2021)
“…Recently, in-DRAM computing is becoming one promising technique to address the notorious 'memory-wall' issue for big data processing…”

Získat plný text

Konferenční příspěvek

Přidat do oblíbených

Uloženo v:
5

Načítá se…

Chasing Away RAts: Semantics and evaluation for relaxed atomics on heterogeneous systems Autor Sinclair, Matthew D., Alsop, Johnathan, Adve, Sarita V.

Vydáno: ACM 01.06.2017

Vydáno v 2017 ACM/IEEE 44th Annual International Symposium on Computer Architecture (ISCA) (01.06.2017)
“…An unambiguous and easy-to-understand memory consistency model is crucial for ensuring correct synchronization and guiding future design of heterogeneous…”

Získat plný text

Konferenční příspěvek

Přidat do oblíbených

Uloženo v:
6

Načítá se…

Reverse-Mode Automatic Differentiation and Optimization of GPU Kernels via Enzyme Autor Moses, William S., Churavy, Valentin, Paehler, Ludger, Huckelheim, Jan, Narayanan, Sri Hari Krishna, Schanen, Michel, Doerfert, Johannes

ISSN: 2167-4337

Vydáno: ACM 14.11.2021

Vydáno v SC21: International Conference for High Performance Computing, Networking, Storage and Analysis (14.11.2021)
“…Computing derivatives is key to many algorithms in scientific computing and machine learning such as optimization, uncertainty quantification, and stability analysis…”

Získat plný text

Konferenční příspěvek

Přidat do oblíbených

Uloženo v:
7

Načítá se…

SLATE: Design of a Modern Distributed and Accelerated Linear Algebra Library Autor Gates, Mark, Kurzak, Jakub, Charara, Ali, YarKhan, Asim, Dongarra, Jack

ISSN: 2167-4337

Vydáno: ACM 17.11.2019

Vydáno v SC19: International Conference for High Performance Computing, Networking, Storage and Analysis (17.11.2019)
“… SLATE will provide coverage of existing ScaLAPACK functionality, including the parallel BLAS…”

Získat plný text

Konferenční příspěvek

Přidat do oblíbených

Uloženo v:
8

Načítá se…

A Parallel Computing Method for the Computation of the Moore-Penrose Generalized Inverse for Shared-Memory Architectures Autor Gelvez-Almeida, Elkin, Barrientos, Ricardo J., Vilches-Ponce, Karina, Mora, Marco

ISSN: 2169-3536, 2169-3536

Vydáno: Piscataway IEEE 01.01.2023

Vydáno v IEEE access (01.01.2023)
“… In this paper, we propose a parallel computing method for the computation of the Moore-Penrose generalized inverse of large-size full-rank rectangular matrices…”

Získat plný text

Journal Article

Přidat do oblíbených

Uloženo v:
9

Načítá se…

Domain decomposition with discrete element simulations using shared-memory parallel computing for railways applications Autor Hoang, T.M.P., Saussine, G., Dureisseix, D., Alart, P.

ISSN: 1779-7179, 1958-5829

Vydáno: Routledge 01.01.2012

Vydáno v European journal of computational mechanics (01.01.2012)
“… To reduce computational costs, we study the use of two strategies: domain decomposition methods and shared-memory parallelisation with OpenMP…”

Získat plný text

Journal Article

Přidat do oblíbených

Uloženo v:
10

Načítá se…

DASP: Specific Dense Matrix Multiply-Accumulate Units Accelerated General Sparse Matrix-Vector Multiplication Autor Lu, Yuechen, Liu, Weifeng

ISSN: 2167-4337

Vydáno: ACM 11.11.2023

Vydáno v International Conference for High Performance Computing, Networking, Storage and Analysis (Online) (11.11.2023)
“…Sparse matrix-vector multiplication (SpMV) plays a key role in computational science and engineering, graph processing, and machine learning applications. Much…”

Získat plný text

Konferenční příspěvek

Přidat do oblíbených

Uloženo v:
11

Načítá se…

Bringing Order to Sparsity: A Sparse Matrix Reordering Study on Multicore CPUs Autor Trotter, James D., Ekmekcibasi, Sinan, Langguth, Johannes, Torun, Tugba, Duzakin, Emre, Ilic, Aleksandar, Unat, Didem

ISSN: 2167-4337

Vydáno: ACM 11.11.2023

Vydáno v International Conference for High Performance Computing, Networking, Storage and Analysis (Online) (11.11.2023)
“…Many real-world computations involve sparse data structures in the form of sparse matrices. A common strategy for optimizing sparse matrix operations is to…”

Získat plný text

Konferenční příspěvek

Přidat do oblíbených

Uloženo v:
12

Načítá se…

Understanding Priority-Based Scheduling of Graph Algorithms on a Shared-Memory Platform Autor Yesil, Serif, Heidarshenas, Azin, Morrison, Adam, Torrellas, Josep

ISSN: 2167-4337

Vydáno: ACM 17.11.2019

Vydáno v SC19: International Conference for High Performance Computing, Networking, Storage and Analysis (17.11.2019)
“… It performs a detailed empirical performance analysis of several advanced CPS designs in a state-of-the-art graph analytics framework running on a large shared-memory server…”

Získat plný text

Konferenční příspěvek

Přidat do oblíbených

Uloženo v:
13

Načítá se…

A flexible sparse matrix data format and parallel algorithms for the assembly of finite element matrices on shared memory systems Autor Sky, Adam, Polindara, César, Muench, Ingo, Birk, Carolin

ISSN: 0167-8191

Vydáno: 01.09.2023

Vydáno v Parallel computing (01.09.2023)

Získat plný text

Journal Article

Přidat do oblíbených

Uloženo v:
14

Načítá se…

Almost Deterministic Work Stealing Autor Shiina, Shumpei, Taura, Kenjiro

ISSN: 2167-4337

Vydáno: ACM 17.11.2019

Vydáno v SC19: International Conference for High Performance Computing, Networking, Storage and Analysis (17.11.2019)
“…With task parallel models, programmers can easily parallelize divide-and-conquer algorithms by using nested fork-join structures…”

Získat plný text

Konferenční příspěvek

Přidat do oblíbených

Uloženo v:
15

Načítá se…

NestedMP: Enabling cache-aware thread mapping for nested parallel shared memory applications Autor He, Jiangzhou, Chen, Wenguang, Tang, Zhizhong

ISSN: 0167-8191, 1872-7336

Vydáno: Elsevier B.V 01.01.2016

Vydáno v Parallel computing (01.01.2016)
“…•Task-core mapping schemas for nested-parallel applications may affect performance…”

Získat plný text

Journal Article

Přidat do oblíbených

Uloženo v:
16

Načítá se…

Novel parallel method for association rule mining on multi-core shared memory systems Autor Vu, Lan, Alaghband, Gita

ISSN: 0167-8191, 1872-7336

Vydáno: Elsevier B.V 01.12.2014

Vydáno v Parallel computing (01.12.2014)
“…•ShaFEM: a novel association rule mining method for multi-core shared memory systems…”

Získat plný text

Journal Article

Přidat do oblíbených

Uloženo v:
17

Načítá se…

STM-Multifrontal QR: Streaming Task Mapping Multifrontal QR Factorization Empowered by GCN Autor Lin, Shengle, Yang, Wangdong, Wang, Haotian, Tsai, Qinyun, Li, Kenli

ISSN: 2167-4337

Vydáno: ACM 14.11.2021

Vydáno v SC21: International Conference for High Performance Computing, Networking, Storage and Analysis (14.11.2021)
“…% higher than that of prior work. Moreover, for numerical factorization, an optimized tasks stream parallel processing strategy is proposed and a more efficient computing task mapping framework for NUMA architecture is adopted…”

Získat plný text

Konferenční příspěvek

Přidat do oblíbených

Uloženo v:
18

Načítá se…

Using OpenMP: Portable Shared Memory Parallel Programming Autor Chapman, Barbara, Jost, Gabriele, van der Pas, Ruud

ISBN: 0262533022, 9780262533027

Vydáno: Cambridge, Mass MIT Press 2007

“…OpenMP, a portable programming interface for shared memory parallel computers, was adopted as an informal standard in 1997 by computer scientists who wanted a unified model on which to base programs…”

Získat plný text

E-kniha Kniha

Přidat do oblíbených

Uloženo v:
19

Načítá se…

TSHMEM: Shared-Memory Parallel Computing on Tilera Many-Core Processors Autor Lam, Bryant C., George, Alan D., Lam, Herman

Vydáno: IEEE 01.05.2013

Vydáno v 2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum (01.05.2013)
“… Fortunately, a partitioned global address space (PGAS) programming model has demonstrated realizable performance and productivity potential for large parallel computing systems with distributed-memory architectures…”

Získat plný text

Konferenční příspěvek

Přidat do oblíbených

Uloženo v:
20

Načítá se…

Implementation of QR and LQ decompositions on shared memory parallel computing systems Autor Egunov, V., Andreev, A.

Vydáno: IEEE 2016

Vydáno v 2016 2nd International Conference on Industrial Engineering, Applications and Manufacturing (ICIEAM) (2016)
“…The paper presents some results of research of applicability of various computing schemes of implementing QR and LQ matrix decompositions for shared memory parallel systems…”

Získat plný text

Konferenční příspěvek

Přidat do oblíbených

Uloženo v:

Výsledky vyhledávání - Shared-memory parallel computing

Computational performance of a smoothed particle hydrodynamics simulation for shared-memory parallel computing Autor Nishiura, Daisuke, Furuichi, Mikito, Sakaguchi, Hide

On Pervasive Shared Memory Parallel Computing Autor Rattan, Ishwar

Adaptive parallel applications: from shared memory architectures to fog computing (2002–2022) Autor Galante, Guilherme, da Rosa Righi, Rodrigo

Max-PIM: Fast and Efficient Max/Min Searching in DRAM Autor Zhang, Fan, Angizi, Shaahin, Fan, Deliang

Chasing Away RAts: Semantics and evaluation for relaxed atomics on heterogeneous systems Autor Sinclair, Matthew D., Alsop, Johnathan, Adve, Sarita V.

Reverse-Mode Automatic Differentiation and Optimization of GPU Kernels via Enzyme Autor Moses, William S., Churavy, Valentin, Paehler, Ludger, Huckelheim, Jan, Narayanan, Sri Hari Krishna, Schanen, Michel, Doerfert, Johannes

SLATE: Design of a Modern Distributed and Accelerated Linear Algebra Library Autor Gates, Mark, Kurzak, Jakub, Charara, Ali, YarKhan, Asim, Dongarra, Jack

A Parallel Computing Method for the Computation of the Moore-Penrose Generalized Inverse for Shared-Memory Architectures Autor Gelvez-Almeida, Elkin, Barrientos, Ricardo J., Vilches-Ponce, Karina, Mora, Marco

Domain decomposition with discrete element simulations using shared-memory parallel computing for railways applications Autor Hoang, T.M.P., Saussine, G., Dureisseix, D., Alart, P.

DASP: Specific Dense Matrix Multiply-Accumulate Units Accelerated General Sparse Matrix-Vector Multiplication Autor Lu, Yuechen, Liu, Weifeng

Bringing Order to Sparsity: A Sparse Matrix Reordering Study on Multicore CPUs Autor Trotter, James D., Ekmekcibasi, Sinan, Langguth, Johannes, Torun, Tugba, Duzakin, Emre, Ilic, Aleksandar, Unat, Didem

Understanding Priority-Based Scheduling of Graph Algorithms on a Shared-Memory Platform Autor Yesil, Serif, Heidarshenas, Azin, Morrison, Adam, Torrellas, Josep

A flexible sparse matrix data format and parallel algorithms for the assembly of finite element matrices on shared memory systems Autor Sky, Adam, Polindara, César, Muench, Ingo, Birk, Carolin

Almost Deterministic Work Stealing Autor Shiina, Shumpei, Taura, Kenjiro

NestedMP: Enabling cache-aware thread mapping for nested parallel shared memory applications Autor He, Jiangzhou, Chen, Wenguang, Tang, Zhizhong

Novel parallel method for association rule mining on multi-core shared memory systems Autor Vu, Lan, Alaghband, Gita

STM-Multifrontal QR: Streaming Task Mapping Multifrontal QR Factorization Empowered by GCN Autor Lin, Shengle, Yang, Wangdong, Wang, Haotian, Tsai, Qinyun, Li, Kenli

Using OpenMP: Portable Shared Memory Parallel Programming Autor Chapman, Barbara, Jost, Gabriele, van der Pas, Ruud

TSHMEM: Shared-Memory Parallel Computing on Tilera Many-Core Processors Autor Lam, Bryant C., George, Alan D., Lam, Herman

Implementation of QR and LQ decompositions on shared memory parallel computing systems Autor Egunov, V., Andreev, A.

Vyhledávací nástroje:

Upřesnit hledání

Médium

Předmětová oblast

Téma

Jazyk

Rok vydání