Suchergebnisse - (("Matrix transpose algorithm") OR ("Matrix transport algorithm"))

Andere Suchmöglichkeiten:

  • Treffer 1 - 17 von 17
Treffer weiter einschränken
  1. 1

    Padding Free Bank Conflict Resolution for CUDA-Based Matrix Transpose Algorithm von Khan, Ayaz ul Hassan, Al-Mouhamed, Mayez, Fatayer, Allam, Almousa, Anas, Baqais, Abdulrahman, Assayony, Mohammed

    ISSN: 2211-7938, 2211-7946, 2211-7946
    Veröffentlicht: Dordrecht Springer Netherlands 01.01.2014
    “… In this paper, two matrix transpose algorithms are proposed to alleviate the aforementioned issues of ensuring coalesced access and conflict free bank access …”
    Volltext
    Journal Article
  2. 2

    Communication efficient adaptive matrix transpose algorithm for FFT on symmetric multiprocessors von Al Na'mneh, R., Pan, W.D., Adhami, R.

    ISBN: 0780388089, 9780780388086
    ISSN: 0094-2898
    Veröffentlicht: IEEE 2005
    “… In this paper, we propose an efficient algorithm (the adaptive matrix-transpose algorithm) for transposing matrices, which is based on all-to …”
    Volltext
    Tagungsbericht
  3. 3

    Restructuring and implementations of 2D matrix transpose algorithm using SSE4 vector instructions von Zekri, Ahmed S.

    Veröffentlicht: IEEE 01.10.2015
    “… Current general-purpose processors are augmented with vector instructions that can process many elements of matrices and vectors in parallel. Transposing a …”
    Volltext
    Tagungsbericht
  4. 4

    Padding free bank conflict resolution for CUDA-based matrix transpose algorithm von Khan, A., Al-Mouhamed, M., Fatayar, A., Almousa, A., Baqais, A., Assayony, M.

    Veröffentlicht: IEEE 01.06.2014
    “… In this paper, two matrix transpose algorithms are proposed to alleviate the aforementioned issues of ensuring coalesced access and conflict free bank access …”
    Volltext
    Tagungsbericht
  5. 5

    Parallel matrix transpose algorithms on distributed memory concurrent computers von Choi, Jaeyoung, Dongarra, Jack J., Walker, David W.

    ISSN: 0167-8191, 1872-7336
    Veröffentlicht: Amsterdam Elsevier B.V 01.09.1995
    Veröffentlicht in Parallel computing (01.09.1995)
    “… This paper describes parallel matrix transpose algorithms on distributed memory concurrent processors …”
    Volltext
    Journal Article
  6. 6

    Matrix transpose on meshes with buses von Békési, József, Galambos, Gábor

    ISSN: 0743-7315, 1096-0848
    Veröffentlicht: Elsevier Inc 01.10.2016
    Veröffentlicht in Journal of parallel and distributed computing (01.10.2016)
    “… 0.45n for the number of steps required by any matrix transpose algorithm on an n×n mesh with buses. Next we present an algorithm which solves this problem in less …”
    Volltext
    Journal Article
  7. 7

    A 280 mV-to-1.1 V 256b Reconfigurable SIMD Vector Permutation Engine With 2-Dimensional Shuffle in 22 nm Tri-Gate CMOS von Hsu, S. K., Agarwal, A., Anders, M. A., Mathew, S. K., Kaul, H., Sheikh, F., Krishnamurthy, R. K.

    ISSN: 0018-9200, 1558-173X
    Veröffentlicht: New York, NY IEEE 01.01.2013
    Veröffentlicht in IEEE journal of solid-state circuits (01.01.2013)
    “… An ultra-low voltage reconfigurable 4-way to 32-way SIMD vector permutation engine is fabricated in 22 nm tri-gate bulk CMOS, consisting of a 32-entry × 256b …”
    Volltext
    Journal Article Tagungsbericht
  8. 8

    Linear-time matrix transpose algorithms using vector register file with diagonal registers von Hanounik, B., Hu, X.

    ISBN: 0769509908, 9780769509907
    ISSN: 1530-2075
    Veröffentlicht: IEEE 2001
    “… Matrix transpose operation (MT) is used frequently in many multimedia and high performance applications. Therefore, using a faster MT operation results in a …”
    Volltext
    Tagungsbericht
  9. 9

    Parallel matrix transpose algorithms on distributed memory concurrent computers von Jaeyoung Choi, Dongarra, J.J., Walker, D.W.

    ISBN: 0818649801, 9780818649806
    Veröffentlicht: IEEE Comput. Soc. Press 1993
    “… This paper describes parallel matrix transpose algorithms on distributed memory concurrent processors …”
    Volltext
    Tagungsbericht
  10. 10

    A 280mV-to-1.1V 256b reconfigurable SIMD vector permutation engine with 2-dimensional shuffle in 22nm CMOS von Hsu, S., Agarwal, A., Anders, M., Mathew, S., Kaul, H., Sheikh, F., Krishnamurthy, R.

    ISBN: 1467303763, 9781467303767
    ISSN: 0193-6530
    Veröffentlicht: IEEE 01.02.2012
    “… Energy-efficient SIMD permutation operations are key for maximizing high-performance microprocessor vector datapath utilization in multimedia, graphics, and …”
    Volltext
    Tagungsbericht
  11. 11

    A Sparse Matrix Fast Transpose Algorithm Based on Pseudo-Address von Da, Wenjiao, Ren, Zhiguo, Lu, Jiao, Shi, Xuxia

    Veröffentlicht: IEEE 01.12.2019
    “… sparse matrix, mainly study the fast transpose algorithm of sparse matrix, and propose a new matrix transpose algorithm on it for the first time-the sparse matrix fast transpose algorithm of pseudo …”
    Volltext
    Tagungsbericht
  12. 12

    A simplified design strategy for mapping image processing algorithms on a SIMD torus von Seetharaman, Guna

    ISSN: 0304-3975, 1879-2294
    Veröffentlicht: Elsevier B.V 03.04.1995
    Veröffentlicht in Theoretical computer science (03.04.1995)
    “… A method is proposed to effectively realize large number of arbitrary, one-to-one, personalized, and concurrent communication between the PEs, by suitably repeating the matrix transpose algorithm …”
    Volltext
    Journal Article
  13. 13

    An O(n) Time-Complexity Matrix Transpose on Torus Array Processor von Ravankar, A. A., Sedukhin, S. G.

    ISBN: 1457717964, 9781457717963
    Veröffentlicht: IEEE 01.11.2011
    “… and an efficient matrix transpose algorithm can speed up many applications. In this paper, we propose a new algorithm for n x n matrix transposition on array processors connected in torus network …”
    Volltext
    Tagungsbericht
  14. 14

    A parallel cosmological hydrodynamics code von Bode, Paul W., Xu, Guohong, Cen, Renyue

    ISBN: 0897918541, 9780897918541
    Veröffentlicht: Washington, DC, USA IEEE Computer Society 17.11.1996
    “… A new, flexible matrix transpose algorithm is used to interchange distributed and local dimensions of the mesh. Timing results from runs on an IBM SP2 supercomputer are given …”
    Volltext
    Tagungsbericht
  15. 15

    A Parallel Cosmological Hydrodynamics Code von Renyue Cen, Guohong Xu, Bode, P.W.

    ISBN: 0897918541, 9780897918541
    Veröffentlicht: IEEE 1996
    “… , combining a mesh based Eulerian hydrodynamics code and a Particle-Mesh N-body code. A new, flexible matrix transpose algorithm is used to interchange distributed and local dimensions of the mesh …”
    Volltext
    Tagungsbericht
  16. 16

    Random Address Permute-Shift Technique for the Shared Memory on GPUs von Nakano, Koji, Matsumae, Susumu, Ito, Yasuaki

    ISSN: 0190-3918
    Veröffentlicht: IEEE 01.09.2014
    “… The Discrete Memory Machine (DMM) is a theoretical parallel computing model that captures the essence of memory access to the shared memory of a streaming …”
    Volltext
    Tagungsbericht
  17. 17

    The Random Address Shift to Reduce the Memory Access Congestion on the Discrete Memory Machine von Nakano, Koji, Matsumae, Susumu, Ito, Yasuaki

    ISSN: 2379-1888
    Veröffentlicht: IEEE 01.12.2013
    “… The Discrete Memory Machine (DMM) is a theoretical parallel computing model that captures the essence of memory access of the streaming multiprocessor on …”
    Volltext
    Tagungsbericht