Search Results - Computing methodologies → Linear algebra algorithms
-
1
Pipirima: Predicting Patterns in Sparsity to Accelerate Matrix Algebra
Published: IEEE 22.06.2025Published in 2025 62nd ACM/IEEE Design Automation Conference (DAC) (22.06.2025)“…While sparsity, a feature of data in many applications, provides optimization opportunities such as reducing unnecessary computations, data transfers, and…”
Get full text
Conference Proceeding -
2
ReSMiPS: A ReRAM-based Sparse Mixed-precision Solver with Fast Matrix Reordering Algorithm
Published: IEEE 22.06.2025Published in 2025 62nd ACM/IEEE Design Automation Conference (DAC) (22.06.2025)“…The solution of sparse matrix equations is essential in scientific computing. However, traditional solvers on digital computing platforms are limited by memory bottlenecks in largescale sparse matrix storage and computation…”
Get full text
Conference Proceeding -
3
SpV8: Pursuing Optimal Vectorization and Regular Computation Pattern in SpMV
Published: IEEE 05.12.2021Published in 2021 58th ACM/IEEE Design Automation Conference (DAC) (05.12.2021)“… We evaluate SpV8 on Intel Xeon CPU and compare with multiple state-of-art SpMV algorithms using 71 sparse matrices…”
Get full text
Conference Proceeding -
4
FSPA: An FeFET-based Sparse Matrix-Dense Vector Multiplication Accelerator
Published: IEEE 09.07.2023Published in 2023 60th ACM/IEEE Design Automation Conference (DAC) (09.07.2023)“… The performance of traditional SpMV accelerators is bounded by memory. In-memory computing (IMC) is a promising technique to alleviate the memory bottleneck…”
Get full text
Conference Proceeding -
5
Harnessing GPU Tensor Cores for Fast FP16 Arithmetic to Speed up Mixed-Precision Iterative Refinement Solvers
Published: IEEE 01.11.2018Published in SC18: International Conference for High Performance Computing, Networking, Storage and Analysis (01.11.2018)“…Low-precision floating-point arithmetic is a powerful tool for accelerating scientific computing applications, especially those in artificial intelligence…”
Get full text
Conference Proceeding -
6
On the Parallel I/O Optimality of Linear Algebra Kernels: Near-Optimal Matrix Factorizations
ISSN: 2167-4337Published: ACM 14.11.2021Published in SC21: International Conference for High Performance Computing, Networking, Storage and Analysis (14.11.2021)“… We first establish a theoretical framework for deriving parallel I/O lower bounds for linear algebra kernels, and then utilize its insights to derive Cholesky and LU schedules, both communicating N^{3}/(P\sqrt{M…”
Get full text
Conference Proceeding -
7
Solvability of Matrix-Exponential Equations
ISBN: 9781450343916, 1450343910Published: New York, NY, USA ACM 05.07.2016Published in Proceedings of the 31st Annual ACM/IEEE Symposium on Logic in Computer Science (05.07.2016)“… Our results have applications to reachability problems for linear hybrid automata. Our decidability proof relies on a number of theorems from algebraic and transcendental…”
Get full text
Conference Proceeding -
8
Many-Body Electronic Correlation Energy using Krylov Subspace Linear Solvers
Published: IEEE 17.11.2024Published in SC24: International Conference for High Performance Computing, Networking, Storage and Analysis (17.11.2024)“…This paper presents the formulation and implementation of a high performance algorithm to compute the many-body electronic correlation energy via the random-phase approximation within density functional theory…”
Get full text
Conference Proceeding -
9
An approximate computing technique for reducing the complexity of a direct-solver for sparse linear systems in real-time video processing
ISSN: 0738-100XPublished: IEEE 01.06.2014Published in Proceedings - ACM IEEE Design Automation Conference (01.06.2014)“…Many video processing algorithms are formulated as least-squares problems that result in large, sparse linear systems…”
Get full text
Conference Proceeding -
10
Steady-state dynamic temperature analysis and reliability optimization for embedded multiprocessor systems
ISBN: 1450311997, 9781450311991ISSN: 0738-100XPublished: New York, NY, USA ACM 03.06.2012Published in DAC Design Automation Conference 2012 (03.06.2012)“…In this paper we propose an analytical technique for the steady-state dynamic temperature analysis (SSDTA) of multiprocessor systems with periodic…”
Get full text
Conference Proceeding -
11
Hello SME! Generating Fast Matrix Multiplication Kernels Using the Scalable Matrix Extension
Published: IEEE 17.11.2024Published in SC24-W: Workshops of the International Conference for High Performance Computing, Networking, Storage and Analysis (17.11.2024)“…Modern central processing units (CPUs) feature single-instruction, multiple-data pipelines to accelerate compute-intensive floating-point and fixed-point…”
Get full text
Conference Proceeding -
12
Boosting Graph Spectral Sparsification via Parallel Sparse Approximate Inverse of Cholesky Factor
ISSN: 2153-697XPublished: IEEE 22.01.2024Published in Proceedings of the ASP-DAC ... Asia and South Pacific Design Automation Conference (22.01.2024)“…With the advance of very-large-scale-integrated (VLSI) systems, fast and efficient algorithms for solving equations of Laplacian matrices are increasingly significant…”
Get full text
Conference Proceeding -
13
GPU Accelerated Sparse Cholesky Factorization
Published: IEEE 17.11.2024Published in SC24-W: Workshops of the International Conference for High Performance Computing, Networking, Storage and Analysis (17.11.2024)“…The solution of sparse symmetric positive definite linear systems is an important computational kernel in large-scale scientific and engineering modeling and simulation…”
Get full text
Conference Proceeding -
14
High-Performance Eigensolver Combining EigenExa and Iterative Refinement
Published: IEEE 17.11.2024Published in SC24-W: Workshops of the International Conference for High Performance Computing, Networking, Storage and Analysis (17.11.2024)“… Eigenvalue decomposition is ubiquitous in simulations. Various eigensolvers for computing approximations have been developed thus far…”
Get full text
Conference Proceeding -
15
Implementing sparse matrix-vector multiplication on throughput-oriented processors
ISBN: 1605587443, 9781605587448ISSN: 2167-4329Published: New York, NY, USA ACM 14.11.2009Published in Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis (14.11.2009)“…Sparse matrix-vector multiplication (SpMV) is of singular importance in sparse linear algebra…”
Get full text
Conference Proceeding -
16
COPA: Constrained PARAFAC2 for Sparse & Large Datasets
ISSN: 2155-0751Published: United States 01.10.2018Published in Proceedings of the ... ACM International Conference on Information & Knowledge Management (01.10.2018)“…PARAFAC2 has demonstrated success in modeling irregular tensors, where the tensor dimensions vary across one of the modes. An example scenario is modeling…”
Get more information
Journal Article -
17
Scaling lattice QCD beyond 100 GPUs
ISBN: 145030771X, 9781450307710ISSN: 2167-4329Published: New York, NY, USA ACM 12.11.2011Published in 2011 International Conference for High Performance Computing, Networking, Storage and Analysis (SC) (12.11.2011)“…Over the past five years, graphics processing units (GPUs) have had a transformational effect on numerical lattice quantum chromodynamics (LQCD) calculations…”
Get full text
Conference Proceeding -
18
Geometry-oblivious FMM for compressing dense SPD matrices
ISBN: 9781450351140, 145035114XISSN: 2167-4337Published: New York, NY, USA ACM 12.11.2017Published in International Conference for High Performance Computing, Networking, Storage and Analysis (Online) (12.11.2017)“…We present GOFMM (geometry-oblivious FMM), a novel method that creates a hierarchical low-rank approximation, or "compression," of an arbitrary dense symmetric…”
Get full text
Conference Proceeding -
19
On monte carlo hybrid methods for linear algebra
ISBN: 1509052224, 9781509052226Published: Piscataway, NJ, USA IEEE Press 13.11.2016Published in Proceedings of the 7th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems (13.11.2016)“…This paper presents an enhanced hybrid (e.g. stochastic/ deterministic) method for Linear Algebra based on bulding an efficient stochastic preconditioner and then solving the corresponding System of Linear Algebraic Equations (SLAE…”
Get full text
Conference Proceeding -
20
Combining Models and Guided Empirical Search to Optimize for Multiple Levels of the Memory Hierarchy
ISBN: 9780769522982, 076952298XPublished: Washington, DC, USA IEEE Computer Society 20.03.2005Published in International Symposium on Code Generation and Optimization : CGO 2005 : 20-23 March, 2005 : San Jose, California (20.03.2005)“…This paper describes an algorithm for simultaneously optimizing across multiple levels of the memory hierarchy for dense-matrix computations…”
Get full text
Conference Proceeding

