Suchergebnisse - Computing methodologies → Linear algebra algorithms

  1. 1

    Pipirima: Predicting Patterns in Sparsity to Accelerate Matrix Algebra von Bakhtiar, Ubaid, Joo, Donghyeon, Asgari, Bahar

    Veröffentlicht: IEEE 22.06.2025
    “… While sparsity, a feature of data in many applications, provides optimization opportunities such as reducing unnecessary computations, data transfers, and …”
    Volltext
    Tagungsbericht
  2. 2

    ReSMiPS: A ReRAM-based Sparse Mixed-precision Solver with Fast Matrix Reordering Algorithm von Fu, Yuyang, Li, Jiancong, Chen, Jia, Zhou, Zhiwei, Zhou, Houji, Peng, Wenlong, Li, Yi, Miao, Xiangshui

    Veröffentlicht: IEEE 22.06.2025
    “… The solution of sparse matrix equations is essential in scientific computing. However, traditional solvers on digital computing platforms are limited by memory bottlenecks in largescale sparse matrix storage and computation …”
    Volltext
    Tagungsbericht
  3. 3

    SpV8: Pursuing Optimal Vectorization and Regular Computation Pattern in SpMV von Li, Chenyang, Xia, Tian, Zhao, Wenzhe, Zheng, Nanning, Ren, Pengju

    Veröffentlicht: IEEE 05.12.2021
    “… We evaluate SpV8 on Intel Xeon CPU and compare with multiple state-of-art SpMV algorithms using 71 sparse matrices …”
    Volltext
    Tagungsbericht
  4. 4

    Harnessing GPU Tensor Cores for Fast FP16 Arithmetic to Speed up Mixed-Precision Iterative Refinement Solvers von Haidar, Azzam, Tomov, Stanimire, Dongarra, Jack, Higham, Nicholas J.

    Veröffentlicht: IEEE 01.11.2018
    “… Low-precision floating-point arithmetic is a powerful tool for accelerating scientific computing applications, especially those in artificial intelligence …”
    Volltext
    Tagungsbericht
  5. 5

    FSPA: An FeFET-based Sparse Matrix-Dense Vector Multiplication Accelerator von Zhang, Xiaoyu, Li, Zerun, Liu, Rui, Chen, Xiaoming, Han, Yinhe

    Veröffentlicht: IEEE 09.07.2023
    “… The performance of traditional SpMV accelerators is bounded by memory. In-memory computing (IMC) is a promising technique to alleviate the memory bottleneck …”
    Volltext
    Tagungsbericht
  6. 6

    On the Parallel I/O Optimality of Linear Algebra Kernels: Near-Optimal Matrix Factorizations von Kwasniewski, Grzegorz, Kabic, Marko, Ben-Nun, Tal, Ziogas, Alexandros Nikolaos, Saethre, Jens Eirik, Gaillard, Andre, Schneider, Timo, Besta, Maciej, Kozhevnikov, Anton, VandeVondele, Joost, Hoefler, Torsten

    ISSN: 2167-4337
    Veröffentlicht: ACM 14.11.2021
    “… We first establish a theoretical framework for deriving parallel I/O lower bounds for linear algebra kernels, and then utilize its insights to derive Cholesky and LU schedules, both communicating N^{3}/(P\sqrt{M …”
    Volltext
    Tagungsbericht
  7. 7

    Solvability of Matrix-Exponential Equations von Ouaknine, Joel, Pouly, Amaury, Sousa-Pinto, Joao, Worrell, James

    ISBN: 9781450343916, 1450343910
    Veröffentlicht: New York, NY, USA ACM 05.07.2016
    “… Our results have applications to reachability problems for linear hybrid automata. Our decidability proof relies on a number of theorems from algebraic and transcendental …”
    Volltext
    Tagungsbericht
  8. 8

    Many-Body Electronic Correlation Energy using Krylov Subspace Linear Solvers von Shah, Shikhar, Zhang, Boqin, Huang, Hua, Pask, John E., Suryanarayana, Phanish, Chow, Edmond

    Veröffentlicht: IEEE 17.11.2024
    “… This paper presents the formulation and implementation of a high performance algorithm to compute the many-body electronic correlation energy via the random-phase approximation within density functional theory …”
    Volltext
    Tagungsbericht
  9. 9

    An approximate computing technique for reducing the complexity of a direct-solver for sparse linear systems in real-time video processing von Schaffner, Michael, Gurkaynak, Frank K., Smolic, Aljosa, Kaeslin, Hubert, Benini, Luca

    ISSN: 0738-100X
    Veröffentlicht: IEEE 01.06.2014
    Veröffentlicht in Proceedings - ACM IEEE Design Automation Conference (01.06.2014)
    “… Many video processing algorithms are formulated as least-squares problems that result in large, sparse linear systems …”
    Volltext
    Tagungsbericht
  10. 10

    Steady-state dynamic temperature analysis and reliability optimization for embedded multiprocessor systems von Ukhov, Ivan, Bao, Min, Eles, Petru, Peng, Zebo

    ISBN: 1450311997, 9781450311991
    ISSN: 0738-100X
    Veröffentlicht: New York, NY, USA ACM 03.06.2012
    Veröffentlicht in DAC Design Automation Conference 2012 (03.06.2012)
    “… In this paper we propose an analytical technique for the steady-state dynamic temperature analysis (SSDTA) of multiprocessor systems with periodic …”
    Volltext
    Tagungsbericht
  11. 11

    Hello SME! Generating Fast Matrix Multiplication Kernels Using the Scalable Matrix Extension von Remke, Stefan, Breuer, Alexander

    Veröffentlicht: IEEE 17.11.2024
    “… Modern central processing units (CPUs) feature single-instruction, multiple-data pipelines to accelerate compute-intensive floating-point and fixed-point …”
    Volltext
    Tagungsbericht
  12. 12

    Boosting Graph Spectral Sparsification via Parallel Sparse Approximate Inverse of Cholesky Factor von Chen, Baiyu, Liu, Zhiqiang, Zhang, Yibin, Yu, Wenjian

    ISSN: 2153-697X
    Veröffentlicht: IEEE 22.01.2024
    “… With the advance of very-large-scale-integrated (VLSI) systems, fast and efficient algorithms for solving equations of Laplacian matrices are increasingly significant …”
    Volltext
    Tagungsbericht
  13. 13

    GPU Accelerated Sparse Cholesky Factorization von Karsavuran, M. Ozan, Ng, Esmond G., Peyton, Barry W.

    Veröffentlicht: IEEE 17.11.2024
    “… The solution of sparse symmetric positive definite linear systems is an important computational kernel in large-scale scientific and engineering modeling and simulation …”
    Volltext
    Tagungsbericht
  14. 14

    High-Performance Eigensolver Combining EigenExa and Iterative Refinement von Uchino, Yuki, Imamura, Toshiyuki

    Veröffentlicht: IEEE 17.11.2024
    “… Eigenvalue decomposition is ubiquitous in simulations. Various eigensolvers for computing approximations have been developed thus far …”
    Volltext
    Tagungsbericht
  15. 15

    Implementing sparse matrix-vector multiplication on throughput-oriented processors von Bell, Nathan, Garland, Michael

    ISBN: 1605587443, 9781605587448
    ISSN: 2167-4329
    Veröffentlicht: New York, NY, USA ACM 14.11.2009
    “… Sparse matrix-vector multiplication (SpMV) is of singular importance in sparse linear algebra …”
    Volltext
    Tagungsbericht
  16. 16

    COPA: Constrained PARAFAC2 for Sparse & Large Datasets von Afshar, Ardavan, Perros, Ioakeim, Papalexakis, Evangelos E, Searles, Elizabeth, Ho, Joyce, Sun, Jimeng

    ISSN: 2155-0751
    Veröffentlicht: United States 01.10.2018
    “… PARAFAC2 has demonstrated success in modeling irregular tensors, where the tensor dimensions vary across one of the modes. An example scenario is modeling …”
    Weitere Angaben
    Journal Article
  17. 17

    Scaling lattice QCD beyond 100 GPUs von Babich, R., Clark, M. A., Joó, B., Shi, G., Brower, R. C., Gottlieb, S.

    ISBN: 145030771X, 9781450307710
    ISSN: 2167-4329
    Veröffentlicht: New York, NY, USA ACM 12.11.2011
    “… Over the past five years, graphics processing units (GPUs) have had a transformational effect on numerical lattice quantum chromodynamics (LQCD) calculations …”
    Volltext
    Tagungsbericht
  18. 18

    Geometry-oblivious FMM for compressing dense SPD matrices von Yu, Chenhan D., Levitt, James, Reiz, Severin, Biros, George

    ISBN: 9781450351140, 145035114X
    ISSN: 2167-4337
    Veröffentlicht: New York, NY, USA ACM 12.11.2017
    “… We present GOFMM (geometry-oblivious FMM), a novel method that creates a hierarchical low-rank approximation, or "compression," of an arbitrary dense symmetric …”
    Volltext
    Tagungsbericht
  19. 19

    On monte carlo hybrid methods for linear algebra von Dávila, Diego, Alexandrov, Vassil, Esquivel-Flores, Oscar A.

    ISBN: 1509052224, 9781509052226
    Veröffentlicht: Piscataway, NJ, USA IEEE Press 13.11.2016
    “… This paper presents an enhanced hybrid (e.g. stochastic/ deterministic) method for Linear Algebra based on bulding an efficient stochastic preconditioner and then solving the corresponding System of Linear Algebraic Equations (SLAE …”
    Volltext
    Tagungsbericht
  20. 20

    Combining Models and Guided Empirical Search to Optimize for Multiple Levels of the Memory Hierarchy von Chen, Chun, Chame, Jacqueline, Hall, Mary

    ISBN: 9780769522982, 076952298X
    Veröffentlicht: Washington, DC, USA IEEE Computer Society 20.03.2005
    “… This paper describes an algorithm for simultaneously optimizing across multiple levels of the memory hierarchy for dense-matrix computations …”
    Volltext
    Tagungsbericht