Výsledky vyhledávání - Computing methodologies → Linear algebra algorithms
-
1
Pipirima: Predicting Patterns in Sparsity to Accelerate Matrix Algebra
Vydáno: IEEE 22.06.2025Vydáno v 2025 62nd ACM/IEEE Design Automation Conference (DAC) (22.06.2025)“…While sparsity, a feature of data in many applications, provides optimization opportunities such as reducing unnecessary computations, data transfers, and…”
Získat plný text
Konferenční příspěvek -
2
ReSMiPS: A ReRAM-based Sparse Mixed-precision Solver with Fast Matrix Reordering Algorithm
Vydáno: IEEE 22.06.2025Vydáno v 2025 62nd ACM/IEEE Design Automation Conference (DAC) (22.06.2025)“…The solution of sparse matrix equations is essential in scientific computing. However, traditional solvers on digital computing platforms are limited by memory bottlenecks in largescale sparse matrix storage and computation…”
Získat plný text
Konferenční příspěvek -
3
SpV8: Pursuing Optimal Vectorization and Regular Computation Pattern in SpMV
Vydáno: IEEE 05.12.2021Vydáno v 2021 58th ACM/IEEE Design Automation Conference (DAC) (05.12.2021)“… We evaluate SpV8 on Intel Xeon CPU and compare with multiple state-of-art SpMV algorithms using 71 sparse matrices…”
Získat plný text
Konferenční příspěvek -
4
Harnessing GPU Tensor Cores for Fast FP16 Arithmetic to Speed up Mixed-Precision Iterative Refinement Solvers
Vydáno: IEEE 01.11.2018Vydáno v SC18: International Conference for High Performance Computing, Networking, Storage and Analysis (01.11.2018)“…Low-precision floating-point arithmetic is a powerful tool for accelerating scientific computing applications, especially those in artificial intelligence…”
Získat plný text
Konferenční příspěvek -
5
FSPA: An FeFET-based Sparse Matrix-Dense Vector Multiplication Accelerator
Vydáno: IEEE 09.07.2023Vydáno v 2023 60th ACM/IEEE Design Automation Conference (DAC) (09.07.2023)“… The performance of traditional SpMV accelerators is bounded by memory. In-memory computing (IMC) is a promising technique to alleviate the memory bottleneck…”
Získat plný text
Konferenční příspěvek -
6
On the Parallel I/O Optimality of Linear Algebra Kernels: Near-Optimal Matrix Factorizations
ISSN: 2167-4337Vydáno: ACM 14.11.2021Vydáno v SC21: International Conference for High Performance Computing, Networking, Storage and Analysis (14.11.2021)“… We first establish a theoretical framework for deriving parallel I/O lower bounds for linear algebra kernels, and then utilize its insights to derive Cholesky and LU schedules, both communicating N^{3}/(P\sqrt{M…”
Získat plný text
Konferenční příspěvek -
7
Solvability of Matrix-Exponential Equations
ISBN: 9781450343916, 1450343910Vydáno: New York, NY, USA ACM 05.07.2016Vydáno v Proceedings of the 31st Annual ACM/IEEE Symposium on Logic in Computer Science (05.07.2016)“… Our results have applications to reachability problems for linear hybrid automata. Our decidability proof relies on a number of theorems from algebraic and transcendental…”
Získat plný text
Konferenční příspěvek -
8
Many-Body Electronic Correlation Energy using Krylov Subspace Linear Solvers
Vydáno: IEEE 17.11.2024Vydáno v SC24: International Conference for High Performance Computing, Networking, Storage and Analysis (17.11.2024)“…This paper presents the formulation and implementation of a high performance algorithm to compute the many-body electronic correlation energy via the random-phase approximation within density functional theory…”
Získat plný text
Konferenční příspěvek -
9
An approximate computing technique for reducing the complexity of a direct-solver for sparse linear systems in real-time video processing
ISSN: 0738-100XVydáno: IEEE 01.06.2014Vydáno v Proceedings - ACM IEEE Design Automation Conference (01.06.2014)“…Many video processing algorithms are formulated as least-squares problems that result in large, sparse linear systems…”
Získat plný text
Konferenční příspěvek -
10
Steady-state dynamic temperature analysis and reliability optimization for embedded multiprocessor systems
ISBN: 1450311997, 9781450311991ISSN: 0738-100XVydáno: New York, NY, USA ACM 03.06.2012Vydáno v DAC Design Automation Conference 2012 (03.06.2012)“…In this paper we propose an analytical technique for the steady-state dynamic temperature analysis (SSDTA) of multiprocessor systems with periodic…”
Získat plný text
Konferenční příspěvek -
11
Hello SME! Generating Fast Matrix Multiplication Kernels Using the Scalable Matrix Extension
Vydáno: IEEE 17.11.2024Vydáno v SC24-W: Workshops of the International Conference for High Performance Computing, Networking, Storage and Analysis (17.11.2024)“…Modern central processing units (CPUs) feature single-instruction, multiple-data pipelines to accelerate compute-intensive floating-point and fixed-point…”
Získat plný text
Konferenční příspěvek -
12
Boosting Graph Spectral Sparsification via Parallel Sparse Approximate Inverse of Cholesky Factor
ISSN: 2153-697XVydáno: IEEE 22.01.2024Vydáno v Proceedings of the ASP-DAC ... Asia and South Pacific Design Automation Conference (22.01.2024)“…With the advance of very-large-scale-integrated (VLSI) systems, fast and efficient algorithms for solving equations of Laplacian matrices are increasingly significant…”
Získat plný text
Konferenční příspěvek -
13
GPU Accelerated Sparse Cholesky Factorization
Vydáno: IEEE 17.11.2024Vydáno v SC24-W: Workshops of the International Conference for High Performance Computing, Networking, Storage and Analysis (17.11.2024)“…The solution of sparse symmetric positive definite linear systems is an important computational kernel in large-scale scientific and engineering modeling and simulation…”
Získat plný text
Konferenční příspěvek -
14
High-Performance Eigensolver Combining EigenExa and Iterative Refinement
Vydáno: IEEE 17.11.2024Vydáno v SC24-W: Workshops of the International Conference for High Performance Computing, Networking, Storage and Analysis (17.11.2024)“… Eigenvalue decomposition is ubiquitous in simulations. Various eigensolvers for computing approximations have been developed thus far…”
Získat plný text
Konferenční příspěvek -
15
Implementing sparse matrix-vector multiplication on throughput-oriented processors
ISBN: 1605587443, 9781605587448ISSN: 2167-4329Vydáno: New York, NY, USA ACM 14.11.2009Vydáno v Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis (14.11.2009)“…Sparse matrix-vector multiplication (SpMV) is of singular importance in sparse linear algebra…”
Získat plný text
Konferenční příspěvek -
16
COPA: Constrained PARAFAC2 for Sparse & Large Datasets
ISSN: 2155-0751Vydáno: United States 01.10.2018Vydáno v Proceedings of the ... ACM International Conference on Information & Knowledge Management (01.10.2018)“…PARAFAC2 has demonstrated success in modeling irregular tensors, where the tensor dimensions vary across one of the modes. An example scenario is modeling…”
Zjistit podrobnosti o přístupu
Journal Article -
17
Scaling lattice QCD beyond 100 GPUs
ISBN: 145030771X, 9781450307710ISSN: 2167-4329Vydáno: New York, NY, USA ACM 12.11.2011Vydáno v 2011 International Conference for High Performance Computing, Networking, Storage and Analysis (SC) (12.11.2011)“…Over the past five years, graphics processing units (GPUs) have had a transformational effect on numerical lattice quantum chromodynamics (LQCD) calculations…”
Získat plný text
Konferenční příspěvek -
18
Geometry-oblivious FMM for compressing dense SPD matrices
ISBN: 9781450351140, 145035114XISSN: 2167-4337Vydáno: New York, NY, USA ACM 12.11.2017Vydáno v International Conference for High Performance Computing, Networking, Storage and Analysis (Online) (12.11.2017)“…We present GOFMM (geometry-oblivious FMM), a novel method that creates a hierarchical low-rank approximation, or "compression," of an arbitrary dense symmetric…”
Získat plný text
Konferenční příspěvek -
19
On monte carlo hybrid methods for linear algebra
ISBN: 1509052224, 9781509052226Vydáno: Piscataway, NJ, USA IEEE Press 13.11.2016Vydáno v Proceedings of the 7th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems (13.11.2016)“…This paper presents an enhanced hybrid (e.g. stochastic/ deterministic) method for Linear Algebra based on bulding an efficient stochastic preconditioner and then solving the corresponding System of Linear Algebraic Equations (SLAE…”
Získat plný text
Konferenční příspěvek -
20
Combining Models and Guided Empirical Search to Optimize for Multiple Levels of the Memory Hierarchy
ISBN: 9780769522982, 076952298XVydáno: Washington, DC, USA IEEE Computer Society 20.03.2005Vydáno v International Symposium on Code Generation and Optimization : CGO 2005 : 20-23 March, 2005 : San Jose, California (20.03.2005)“…This paper describes an algorithm for simultaneously optimizing across multiple levels of the memory hierarchy for dense-matrix computations…”
Získat plný text
Konferenční příspěvek

