Suchergebnisse - Computing methodologies → Linear algebra algorithms

1

Wird geladen …

Pipirima: Predicting Patterns in Sparsity to Accelerate Matrix Algebra von Bakhtiar, Ubaid, Joo, Donghyeon, Asgari, Bahar

Veröffentlicht: IEEE 22.06.2025

Veröffentlicht in 2025 62nd ACM/IEEE Design Automation Conference (DAC) (22.06.2025)
“… While sparsity, a feature of data in many applications, provides optimization opportunities such as reducing unnecessary computations, data transfers, and …”

Volltext

Tagungsbericht

Zu den Favoriten

Gespeichert in:
2

Wird geladen …

ReSMiPS: A ReRAM-based Sparse Mixed-precision Solver with Fast Matrix Reordering Algorithm von Fu, Yuyang, Li, Jiancong, Chen, Jia, Zhou, Zhiwei, Zhou, Houji, Peng, Wenlong, Li, Yi, Miao, Xiangshui

Veröffentlicht: IEEE 22.06.2025

Veröffentlicht in 2025 62nd ACM/IEEE Design Automation Conference (DAC) (22.06.2025)
“… The solution of sparse matrix equations is essential in scientific computing. However, traditional solvers on digital computing platforms are limited by memory bottlenecks in largescale sparse matrix storage and computation …”

Volltext

Tagungsbericht

Zu den Favoriten

Gespeichert in:
3

Wird geladen …

SpV8: Pursuing Optimal Vectorization and Regular Computation Pattern in SpMV von Li, Chenyang, Xia, Tian, Zhao, Wenzhe, Zheng, Nanning, Ren, Pengju

Veröffentlicht: IEEE 05.12.2021

Veröffentlicht in 2021 58th ACM/IEEE Design Automation Conference (DAC) (05.12.2021)
“… We evaluate SpV8 on Intel Xeon CPU and compare with multiple state-of-art SpMV algorithms using 71 sparse matrices …”

Volltext

Tagungsbericht

Zu den Favoriten

Gespeichert in:
4

Wird geladen …

Harnessing GPU Tensor Cores for Fast FP16 Arithmetic to Speed up Mixed-Precision Iterative Refinement Solvers von Haidar, Azzam, Tomov, Stanimire, Dongarra, Jack, Higham, Nicholas J.

Veröffentlicht: IEEE 01.11.2018

Veröffentlicht in SC18: International Conference for High Performance Computing, Networking, Storage and Analysis (01.11.2018)
“… Low-precision floating-point arithmetic is a powerful tool for accelerating scientific computing applications, especially those in artificial intelligence …”

Volltext

Tagungsbericht

Zu den Favoriten

Gespeichert in:
5

Wird geladen …

FSPA: An FeFET-based Sparse Matrix-Dense Vector Multiplication Accelerator von Zhang, Xiaoyu, Li, Zerun, Liu, Rui, Chen, Xiaoming, Han, Yinhe

Veröffentlicht: IEEE 09.07.2023

Veröffentlicht in 2023 60th ACM/IEEE Design Automation Conference (DAC) (09.07.2023)
“… The performance of traditional SpMV accelerators is bounded by memory. In-memory computing (IMC) is a promising technique to alleviate the memory bottleneck …”

Volltext

Tagungsbericht

Zu den Favoriten

Gespeichert in:
6

Wird geladen …

On the Parallel I/O Optimality of Linear Algebra Kernels: Near-Optimal Matrix Factorizations von Kwasniewski, Grzegorz, Kabic, Marko, Ben-Nun, Tal, Ziogas, Alexandros Nikolaos, Saethre, Jens Eirik, Gaillard, Andre, Schneider, Timo, Besta, Maciej, Kozhevnikov, Anton, VandeVondele, Joost, Hoefler, Torsten

ISSN: 2167-4337

Veröffentlicht: ACM 14.11.2021

Veröffentlicht in SC21: International Conference for High Performance Computing, Networking, Storage and Analysis (14.11.2021)
“… We first establish a theoretical framework for deriving parallel I/O lower bounds for linear algebra kernels, and then utilize its insights to derive Cholesky and LU schedules, both communicating N^{3}/(P\sqrt{M …”

Volltext

Tagungsbericht

Zu den Favoriten

Gespeichert in:
7

Wird geladen …

Solvability of Matrix-Exponential Equations von Ouaknine, Joel, Pouly, Amaury, Sousa-Pinto, Joao, Worrell, James

ISBN: 9781450343916, 1450343910

Veröffentlicht: New York, NY, USA ACM 05.07.2016

Veröffentlicht in Proceedings of the 31st Annual ACM/IEEE Symposium on Logic in Computer Science (05.07.2016)
“… Our results have applications to reachability problems for linear hybrid automata. Our decidability proof relies on a number of theorems from algebraic and transcendental …”

Volltext

Tagungsbericht

Zu den Favoriten

Gespeichert in:
8

Wird geladen …

Many-Body Electronic Correlation Energy using Krylov Subspace Linear Solvers von Shah, Shikhar, Zhang, Boqin, Huang, Hua, Pask, John E., Suryanarayana, Phanish, Chow, Edmond

Veröffentlicht: IEEE 17.11.2024

Veröffentlicht in SC24: International Conference for High Performance Computing, Networking, Storage and Analysis (17.11.2024)
“… This paper presents the formulation and implementation of a high performance algorithm to compute the many-body electronic correlation energy via the random-phase approximation within density functional theory …”

Volltext

Tagungsbericht

Zu den Favoriten

Gespeichert in:
9

Wird geladen …

An approximate computing technique for reducing the complexity of a direct-solver for sparse linear systems in real-time video processing von Schaffner, Michael, Gurkaynak, Frank K., Smolic, Aljosa, Kaeslin, Hubert, Benini, Luca

ISSN: 0738-100X

Veröffentlicht: IEEE 01.06.2014

Veröffentlicht in Proceedings - ACM IEEE Design Automation Conference (01.06.2014)
“… Many video processing algorithms are formulated as least-squares problems that result in large, sparse linear systems …”

Volltext

Tagungsbericht

Zu den Favoriten

Gespeichert in:
10

Wird geladen …

Steady-state dynamic temperature analysis and reliability optimization for embedded multiprocessor systems von Ukhov, Ivan, Bao, Min, Eles, Petru, Peng, Zebo

ISBN: 1450311997, 9781450311991

ISSN: 0738-100X

Veröffentlicht: New York, NY, USA ACM 03.06.2012

Veröffentlicht in DAC Design Automation Conference 2012 (03.06.2012)
“… In this paper we propose an analytical technique for the steady-state dynamic temperature analysis (SSDTA) of multiprocessor systems with periodic …”

Volltext

Tagungsbericht

Zu den Favoriten

Gespeichert in:
11

Wird geladen …

Hello SME! Generating Fast Matrix Multiplication Kernels Using the Scalable Matrix Extension von Remke, Stefan, Breuer, Alexander

Veröffentlicht: IEEE 17.11.2024

Veröffentlicht in SC24-W: Workshops of the International Conference for High Performance Computing, Networking, Storage and Analysis (17.11.2024)
“… Modern central processing units (CPUs) feature single-instruction, multiple-data pipelines to accelerate compute-intensive floating-point and fixed-point …”

Volltext

Tagungsbericht

Zu den Favoriten

Gespeichert in:
12

Wird geladen …

Boosting Graph Spectral Sparsification via Parallel Sparse Approximate Inverse of Cholesky Factor von Chen, Baiyu, Liu, Zhiqiang, Zhang, Yibin, Yu, Wenjian

ISSN: 2153-697X

Veröffentlicht: IEEE 22.01.2024

Veröffentlicht in Proceedings of the ASP-DAC ... Asia and South Pacific Design Automation Conference (22.01.2024)
“… With the advance of very-large-scale-integrated (VLSI) systems, fast and efficient algorithms for solving equations of Laplacian matrices are increasingly significant …”

Volltext

Tagungsbericht

Zu den Favoriten

Gespeichert in:
13

Wird geladen …

GPU Accelerated Sparse Cholesky Factorization von Karsavuran, M. Ozan, Ng, Esmond G., Peyton, Barry W.

Veröffentlicht: IEEE 17.11.2024

Veröffentlicht in SC24-W: Workshops of the International Conference for High Performance Computing, Networking, Storage and Analysis (17.11.2024)
“… The solution of sparse symmetric positive definite linear systems is an important computational kernel in large-scale scientific and engineering modeling and simulation …”

Volltext

Tagungsbericht

Zu den Favoriten

Gespeichert in:
14

Wird geladen …

High-Performance Eigensolver Combining EigenExa and Iterative Refinement von Uchino, Yuki, Imamura, Toshiyuki

Veröffentlicht: IEEE 17.11.2024

Veröffentlicht in SC24-W: Workshops of the International Conference for High Performance Computing, Networking, Storage and Analysis (17.11.2024)
“… Eigenvalue decomposition is ubiquitous in simulations. Various eigensolvers for computing approximations have been developed thus far …”

Volltext

Tagungsbericht

Zu den Favoriten

Gespeichert in:
15

Wird geladen …

Implementing sparse matrix-vector multiplication on throughput-oriented processors von Bell, Nathan, Garland, Michael

ISBN: 1605587443, 9781605587448

ISSN: 2167-4329

Veröffentlicht: New York, NY, USA ACM 14.11.2009

Veröffentlicht in Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis (14.11.2009)
“… Sparse matrix-vector multiplication (SpMV) is of singular importance in sparse linear algebra …”

Volltext

Tagungsbericht

Zu den Favoriten

Gespeichert in:
16

Wird geladen …

COPA: Constrained PARAFAC2 for Sparse & Large Datasets von Afshar, Ardavan, Perros, Ioakeim, Papalexakis, Evangelos E, Searles, Elizabeth, Ho, Joyce, Sun, Jimeng

ISSN: 2155-0751

Veröffentlicht: United States 01.10.2018

Veröffentlicht in Proceedings of the ... ACM International Conference on Information & Knowledge Management (01.10.2018)
“… PARAFAC2 has demonstrated success in modeling irregular tensors, where the tensor dimensions vary across one of the modes. An example scenario is modeling …”

Weitere Angaben

Journal Article

Zu den Favoriten

Gespeichert in:
17

Wird geladen …

Scaling lattice QCD beyond 100 GPUs von Babich, R., Clark, M. A., Joó, B., Shi, G., Brower, R. C., Gottlieb, S.

ISBN: 145030771X, 9781450307710

ISSN: 2167-4329

Veröffentlicht: New York, NY, USA ACM 12.11.2011

Veröffentlicht in 2011 International Conference for High Performance Computing, Networking, Storage and Analysis (SC) (12.11.2011)
“… Over the past five years, graphics processing units (GPUs) have had a transformational effect on numerical lattice quantum chromodynamics (LQCD) calculations …”

Volltext

Tagungsbericht

Zu den Favoriten

Gespeichert in:
18

Wird geladen …

Geometry-oblivious FMM for compressing dense SPD matrices von Yu, Chenhan D., Levitt, James, Reiz, Severin, Biros, George

ISBN: 9781450351140, 145035114X

ISSN: 2167-4337

Veröffentlicht: New York, NY, USA ACM 12.11.2017

Veröffentlicht in International Conference for High Performance Computing, Networking, Storage and Analysis (Online) (12.11.2017)
“… We present GOFMM (geometry-oblivious FMM), a novel method that creates a hierarchical low-rank approximation, or "compression," of an arbitrary dense symmetric …”

Volltext

Tagungsbericht

Zu den Favoriten

Gespeichert in:
19

Wird geladen …

On monte carlo hybrid methods for linear algebra von Dávila, Diego, Alexandrov, Vassil, Esquivel-Flores, Oscar A.

ISBN: 1509052224, 9781509052226

Veröffentlicht: Piscataway, NJ, USA IEEE Press 13.11.2016

Veröffentlicht in Proceedings of the 7th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems (13.11.2016)
“… This paper presents an enhanced hybrid (e.g. stochastic/ deterministic) method for Linear Algebra based on bulding an efficient stochastic preconditioner and then solving the corresponding System of Linear Algebraic Equations (SLAE …”

Volltext

Tagungsbericht

Zu den Favoriten

Gespeichert in:
20

Wird geladen …

Combining Models and Guided Empirical Search to Optimize for Multiple Levels of the Memory Hierarchy von Chen, Chun, Chame, Jacqueline, Hall, Mary

ISBN: 9780769522982, 076952298X

Veröffentlicht: Washington, DC, USA IEEE Computer Society 20.03.2005

Veröffentlicht in International Symposium on Code Generation and Optimization : CGO 2005 : 20-23 March, 2005 : San Jose, California (20.03.2005)
“… This paper describes an algorithm for simultaneously optimizing across multiple levels of the memory hierarchy for dense-matrix computations …”

Volltext

Tagungsbericht

Zu den Favoriten

Gespeichert in:

Suchergebnisse - Computing methodologies → Linear algebra algorithms

Pipirima: Predicting Patterns in Sparsity to Accelerate Matrix Algebra von Bakhtiar, Ubaid, Joo, Donghyeon, Asgari, Bahar

ReSMiPS: A ReRAM-based Sparse Mixed-precision Solver with Fast Matrix Reordering Algorithm von Fu, Yuyang, Li, Jiancong, Chen, Jia, Zhou, Zhiwei, Zhou, Houji, Peng, Wenlong, Li, Yi, Miao, Xiangshui

SpV8: Pursuing Optimal Vectorization and Regular Computation Pattern in SpMV von Li, Chenyang, Xia, Tian, Zhao, Wenzhe, Zheng, Nanning, Ren, Pengju

Harnessing GPU Tensor Cores for Fast FP16 Arithmetic to Speed up Mixed-Precision Iterative Refinement Solvers von Haidar, Azzam, Tomov, Stanimire, Dongarra, Jack, Higham, Nicholas J.

FSPA: An FeFET-based Sparse Matrix-Dense Vector Multiplication Accelerator von Zhang, Xiaoyu, Li, Zerun, Liu, Rui, Chen, Xiaoming, Han, Yinhe

Solvability of Matrix-Exponential Equations von Ouaknine, Joel, Pouly, Amaury, Sousa-Pinto, Joao, Worrell, James

Many-Body Electronic Correlation Energy using Krylov Subspace Linear Solvers von Shah, Shikhar, Zhang, Boqin, Huang, Hua, Pask, John E., Suryanarayana, Phanish, Chow, Edmond

An approximate computing technique for reducing the complexity of a direct-solver for sparse linear systems in real-time video processing von Schaffner, Michael, Gurkaynak, Frank K., Smolic, Aljosa, Kaeslin, Hubert, Benini, Luca

Steady-state dynamic temperature analysis and reliability optimization for embedded multiprocessor systems von Ukhov, Ivan, Bao, Min, Eles, Petru, Peng, Zebo

Hello SME! Generating Fast Matrix Multiplication Kernels Using the Scalable Matrix Extension von Remke, Stefan, Breuer, Alexander

Boosting Graph Spectral Sparsification via Parallel Sparse Approximate Inverse of Cholesky Factor von Chen, Baiyu, Liu, Zhiqiang, Zhang, Yibin, Yu, Wenjian

GPU Accelerated Sparse Cholesky Factorization von Karsavuran, M. Ozan, Ng, Esmond G., Peyton, Barry W.

High-Performance Eigensolver Combining EigenExa and Iterative Refinement von Uchino, Yuki, Imamura, Toshiyuki

Implementing sparse matrix-vector multiplication on throughput-oriented processors von Bell, Nathan, Garland, Michael

COPA: Constrained PARAFAC2 for Sparse & Large Datasets von Afshar, Ardavan, Perros, Ioakeim, Papalexakis, Evangelos E, Searles, Elizabeth, Ho, Joyce, Sun, Jimeng

Scaling lattice QCD beyond 100 GPUs von Babich, R., Clark, M. A., Joó, B., Shi, G., Brower, R. C., Gottlieb, S.

Geometry-oblivious FMM for compressing dense SPD matrices von Yu, Chenhan D., Levitt, James, Reiz, Severin, Biros, George

On monte carlo hybrid methods for linear algebra von Dávila, Diego, Alexandrov, Vassil, Esquivel-Flores, Oscar A.

Combining Models and Guided Empirical Search to Optimize for Multiple Levels of the Memory Hierarchy von Chen, Chun, Chame, Jacqueline, Hall, Mary

Suchwerkzeuge:

Treffer weiter einschränken

Format

Schlagwortumfeld

Thema

Sprache

Erscheinungsjahr