Suchergebnisse - Computing methodologies → Linear algebra algorithms
-
1
Pipirima: Predicting Patterns in Sparsity to Accelerate Matrix Algebra
Veröffentlicht: IEEE 22.06.2025Veröffentlicht in 2025 62nd ACM/IEEE Design Automation Conference (DAC) (22.06.2025)“… While sparsity, a feature of data in many applications, provides optimization opportunities such as reducing unnecessary computations, data transfers, and …”
Volltext
Tagungsbericht -
2
ReSMiPS: A ReRAM-based Sparse Mixed-precision Solver with Fast Matrix Reordering Algorithm
Veröffentlicht: IEEE 22.06.2025Veröffentlicht in 2025 62nd ACM/IEEE Design Automation Conference (DAC) (22.06.2025)“… The solution of sparse matrix equations is essential in scientific computing. However, traditional solvers on digital computing platforms are limited by memory bottlenecks in largescale sparse matrix storage and computation …”
Volltext
Tagungsbericht -
3
SpV8: Pursuing Optimal Vectorization and Regular Computation Pattern in SpMV
Veröffentlicht: IEEE 05.12.2021Veröffentlicht in 2021 58th ACM/IEEE Design Automation Conference (DAC) (05.12.2021)“… We evaluate SpV8 on Intel Xeon CPU and compare with multiple state-of-art SpMV algorithms using 71 sparse matrices …”
Volltext
Tagungsbericht -
4
Harnessing GPU Tensor Cores for Fast FP16 Arithmetic to Speed up Mixed-Precision Iterative Refinement Solvers
Veröffentlicht: IEEE 01.11.2018Veröffentlicht in SC18: International Conference for High Performance Computing, Networking, Storage and Analysis (01.11.2018)“… Low-precision floating-point arithmetic is a powerful tool for accelerating scientific computing applications, especially those in artificial intelligence …”
Volltext
Tagungsbericht -
5
FSPA: An FeFET-based Sparse Matrix-Dense Vector Multiplication Accelerator
Veröffentlicht: IEEE 09.07.2023Veröffentlicht in 2023 60th ACM/IEEE Design Automation Conference (DAC) (09.07.2023)“… The performance of traditional SpMV accelerators is bounded by memory. In-memory computing (IMC) is a promising technique to alleviate the memory bottleneck …”
Volltext
Tagungsbericht -
6
On the Parallel I/O Optimality of Linear Algebra Kernels: Near-Optimal Matrix Factorizations
ISSN: 2167-4337Veröffentlicht: ACM 14.11.2021Veröffentlicht in SC21: International Conference for High Performance Computing, Networking, Storage and Analysis (14.11.2021)“… We first establish a theoretical framework for deriving parallel I/O lower bounds for linear algebra kernels, and then utilize its insights to derive Cholesky and LU schedules, both communicating N^{3}/(P\sqrt{M …”
Volltext
Tagungsbericht -
7
Solvability of Matrix-Exponential Equations
ISBN: 9781450343916, 1450343910Veröffentlicht: New York, NY, USA ACM 05.07.2016Veröffentlicht in Proceedings of the 31st Annual ACM/IEEE Symposium on Logic in Computer Science (05.07.2016)“… Our results have applications to reachability problems for linear hybrid automata. Our decidability proof relies on a number of theorems from algebraic and transcendental …”
Volltext
Tagungsbericht -
8
Many-Body Electronic Correlation Energy using Krylov Subspace Linear Solvers
Veröffentlicht: IEEE 17.11.2024Veröffentlicht in SC24: International Conference for High Performance Computing, Networking, Storage and Analysis (17.11.2024)“… This paper presents the formulation and implementation of a high performance algorithm to compute the many-body electronic correlation energy via the random-phase approximation within density functional theory …”
Volltext
Tagungsbericht -
9
An approximate computing technique for reducing the complexity of a direct-solver for sparse linear systems in real-time video processing
ISSN: 0738-100XVeröffentlicht: IEEE 01.06.2014Veröffentlicht in Proceedings - ACM IEEE Design Automation Conference (01.06.2014)“… Many video processing algorithms are formulated as least-squares problems that result in large, sparse linear systems …”
Volltext
Tagungsbericht -
10
Steady-state dynamic temperature analysis and reliability optimization for embedded multiprocessor systems
ISBN: 1450311997, 9781450311991ISSN: 0738-100XVeröffentlicht: New York, NY, USA ACM 03.06.2012Veröffentlicht in DAC Design Automation Conference 2012 (03.06.2012)“… In this paper we propose an analytical technique for the steady-state dynamic temperature analysis (SSDTA) of multiprocessor systems with periodic …”
Volltext
Tagungsbericht -
11
Hello SME! Generating Fast Matrix Multiplication Kernels Using the Scalable Matrix Extension
Veröffentlicht: IEEE 17.11.2024Veröffentlicht in SC24-W: Workshops of the International Conference for High Performance Computing, Networking, Storage and Analysis (17.11.2024)“… Modern central processing units (CPUs) feature single-instruction, multiple-data pipelines to accelerate compute-intensive floating-point and fixed-point …”
Volltext
Tagungsbericht -
12
Boosting Graph Spectral Sparsification via Parallel Sparse Approximate Inverse of Cholesky Factor
ISSN: 2153-697XVeröffentlicht: IEEE 22.01.2024Veröffentlicht in Proceedings of the ASP-DAC ... Asia and South Pacific Design Automation Conference (22.01.2024)“… With the advance of very-large-scale-integrated (VLSI) systems, fast and efficient algorithms for solving equations of Laplacian matrices are increasingly significant …”
Volltext
Tagungsbericht -
13
GPU Accelerated Sparse Cholesky Factorization
Veröffentlicht: IEEE 17.11.2024Veröffentlicht in SC24-W: Workshops of the International Conference for High Performance Computing, Networking, Storage and Analysis (17.11.2024)“… The solution of sparse symmetric positive definite linear systems is an important computational kernel in large-scale scientific and engineering modeling and simulation …”
Volltext
Tagungsbericht -
14
High-Performance Eigensolver Combining EigenExa and Iterative Refinement
Veröffentlicht: IEEE 17.11.2024Veröffentlicht in SC24-W: Workshops of the International Conference for High Performance Computing, Networking, Storage and Analysis (17.11.2024)“… Eigenvalue decomposition is ubiquitous in simulations. Various eigensolvers for computing approximations have been developed thus far …”
Volltext
Tagungsbericht -
15
Implementing sparse matrix-vector multiplication on throughput-oriented processors
ISBN: 1605587443, 9781605587448ISSN: 2167-4329Veröffentlicht: New York, NY, USA ACM 14.11.2009Veröffentlicht in Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis (14.11.2009)“… Sparse matrix-vector multiplication (SpMV) is of singular importance in sparse linear algebra …”
Volltext
Tagungsbericht -
16
COPA: Constrained PARAFAC2 for Sparse & Large Datasets
ISSN: 2155-0751Veröffentlicht: United States 01.10.2018Veröffentlicht in Proceedings of the ... ACM International Conference on Information & Knowledge Management (01.10.2018)“… PARAFAC2 has demonstrated success in modeling irregular tensors, where the tensor dimensions vary across one of the modes. An example scenario is modeling …”
Weitere Angaben
Journal Article -
17
Scaling lattice QCD beyond 100 GPUs
ISBN: 145030771X, 9781450307710ISSN: 2167-4329Veröffentlicht: New York, NY, USA ACM 12.11.2011Veröffentlicht in 2011 International Conference for High Performance Computing, Networking, Storage and Analysis (SC) (12.11.2011)“… Over the past five years, graphics processing units (GPUs) have had a transformational effect on numerical lattice quantum chromodynamics (LQCD) calculations …”
Volltext
Tagungsbericht -
18
Geometry-oblivious FMM for compressing dense SPD matrices
ISBN: 9781450351140, 145035114XISSN: 2167-4337Veröffentlicht: New York, NY, USA ACM 12.11.2017Veröffentlicht in International Conference for High Performance Computing, Networking, Storage and Analysis (Online) (12.11.2017)“… We present GOFMM (geometry-oblivious FMM), a novel method that creates a hierarchical low-rank approximation, or "compression," of an arbitrary dense symmetric …”
Volltext
Tagungsbericht -
19
On monte carlo hybrid methods for linear algebra
ISBN: 1509052224, 9781509052226Veröffentlicht: Piscataway, NJ, USA IEEE Press 13.11.2016Veröffentlicht in Proceedings of the 7th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems (13.11.2016)“… This paper presents an enhanced hybrid (e.g. stochastic/ deterministic) method for Linear Algebra based on bulding an efficient stochastic preconditioner and then solving the corresponding System of Linear Algebraic Equations (SLAE …”
Volltext
Tagungsbericht -
20
Combining Models and Guided Empirical Search to Optimize for Multiple Levels of the Memory Hierarchy
ISBN: 9780769522982, 076952298XVeröffentlicht: Washington, DC, USA IEEE Computer Society 20.03.2005Veröffentlicht in International Symposium on Code Generation and Optimization : CGO 2005 : 20-23 March, 2005 : San Jose, California (20.03.2005)“… This paper describes an algorithm for simultaneously optimizing across multiple levels of the memory hierarchy for dense-matrix computations …”
Volltext
Tagungsbericht

