Suchergebnisse - (("Matrix transpose algorithm") OR ("Matrix transport algorithm"))

Andere Suchmöglichkeiten:

(("Matrix transpose algorithm") OR ("Matrix transport algorithm")) »
- (("Matrix transport algorithm") OR ("Matrix transport algorithm"))

1

Wird geladen …

Padding Free Bank Conflict Resolution for CUDA-Based Matrix Transpose Algorithm von Khan, Ayaz ul Hassan, Al-Mouhamed, Mayez, Fatayer, Allam, Almousa, Anas, Baqais, Abdulrahman, Assayony, Mohammed

ISSN: 2211-7938, 2211-7946, 2211-7946

Veröffentlicht: Dordrecht Springer Netherlands 01.01.2014

Veröffentlicht in The International journal of networked and distributed computing (Online) (01.01.2014)
“… In this paper, two matrix transpose algorithms are proposed to alleviate the aforementioned issues of ensuring coalesced access and conflict free bank access …”

Volltext

Journal Article

Zu den Favoriten

Gespeichert in:
2

Wird geladen …

Communication efficient adaptive matrix transpose algorithm for FFT on symmetric multiprocessors von Al Na'mneh, R., Pan, W.D., Adhami, R.

ISBN: 0780388089, 9780780388086

ISSN: 0094-2898

Veröffentlicht: IEEE 2005

Veröffentlicht in Proceedings of the Thirty-Seventh Southeastern Symposium on System Theory, 2005. SSST '05 (2005)
“… In this paper, we propose an efficient algorithm (the adaptive matrix-transpose algorithm) for transposing matrices, which is based on all-to …”

Volltext

Tagungsbericht

Zu den Favoriten

Gespeichert in:
3

Wird geladen …

Restructuring and implementations of 2D matrix transpose algorithm using SSE4 vector instructions von Zekri, Ahmed S.

Veröffentlicht: IEEE 01.10.2015

Veröffentlicht in 2015 International Conference on Applied Research in Computer Science and Engineering (ICAR) (01.10.2015)
“… Current general-purpose processors are augmented with vector instructions that can process many elements of matrices and vectors in parallel. Transposing a …”

Volltext

Tagungsbericht

Zu den Favoriten

Gespeichert in:
4

Wird geladen …

Padding free bank conflict resolution for CUDA-based matrix transpose algorithm von Khan, A., Al-Mouhamed, M., Fatayar, A., Almousa, A., Baqais, A., Assayony, M.

Veröffentlicht: IEEE 01.06.2014

Veröffentlicht in 15th IEEE/ACIS International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD) (01.06.2014)
“… In this paper, two matrix transpose algorithms are proposed to alleviate the aforementioned issues of ensuring coalesced access and conflict free bank access …”

Volltext

Tagungsbericht

Zu den Favoriten

Gespeichert in:
5

Wird geladen …

Parallel matrix transpose algorithms on distributed memory concurrent computers von Choi, Jaeyoung, Dongarra, Jack J., Walker, David W.

ISSN: 0167-8191, 1872-7336

Veröffentlicht: Amsterdam Elsevier B.V 01.09.1995

Veröffentlicht in Parallel computing (01.09.1995)
“… This paper describes parallel matrix transpose algorithms on distributed memory concurrent processors …”

Volltext

Journal Article

Zu den Favoriten

Gespeichert in:
6

Wird geladen …

Matrix transpose on meshes with buses von Békési, József, Galambos, Gábor

ISSN: 0743-7315, 1096-0848

Veröffentlicht: Elsevier Inc 01.10.2016

Veröffentlicht in Journal of parallel and distributed computing (01.10.2016)
“… 0.45n for the number of steps required by any matrix transpose algorithm on an n×n mesh with buses. Next we present an algorithm which solves this problem in less …”

Volltext

Journal Article

Zu den Favoriten

Gespeichert in:
7

Wird geladen …

A 280 mV-to-1.1 V 256b Reconfigurable SIMD Vector Permutation Engine With 2-Dimensional Shuffle in 22 nm Tri-Gate CMOS von Hsu, S. K., Agarwal, A., Anders, M. A., Mathew, S. K., Kaul, H., Sheikh, F., Krishnamurthy, R. K.

ISSN: 0018-9200, 1558-173X

Veröffentlicht: New York, NY IEEE 01.01.2013

Veröffentlicht in IEEE journal of solid-state circuits (01.01.2013)
“… An ultra-low voltage reconfigurable 4-way to 32-way SIMD vector permutation engine is fabricated in 22 nm tri-gate bulk CMOS, consisting of a 32-entry × 256b …”

Volltext

Journal Article Tagungsbericht

Zu den Favoriten

Gespeichert in:
8

Wird geladen …

Linear-time matrix transpose algorithms using vector register file with diagonal registers von Hanounik, B., Hu, X.

ISBN: 0769509908, 9780769509907

ISSN: 1530-2075

Veröffentlicht: IEEE 2001

Veröffentlicht in Proceedings 15th International Parallel and Distributed Processing Symposium. IPDPS 2001 (2001)
“… Matrix transpose operation (MT) is used frequently in many multimedia and high performance applications. Therefore, using a faster MT operation results in a …”

Volltext

Tagungsbericht

Zu den Favoriten

Gespeichert in:
9

Wird geladen …

Parallel matrix transpose algorithms on distributed memory concurrent computers von Jaeyoung Choi, Dongarra, J.J., Walker, D.W.

ISBN: 0818649801, 9780818649806

Veröffentlicht: IEEE Comput. Soc. Press 1993

Veröffentlicht in Proceedings of the Scalable Parallel Libraries Conference , October 6-8, 1993, Mississippi State, Mississippi (1993)
“… This paper describes parallel matrix transpose algorithms on distributed memory concurrent processors …”

Volltext

Tagungsbericht

Zu den Favoriten

Gespeichert in:
10

Wird geladen …

A 280mV-to-1.1V 256b reconfigurable SIMD vector permutation engine with 2-dimensional shuffle in 22nm CMOS von Hsu, S., Agarwal, A., Anders, M., Mathew, S., Kaul, H., Sheikh, F., Krishnamurthy, R.

ISBN: 1467303763, 9781467303767

ISSN: 0193-6530

Veröffentlicht: IEEE 01.02.2012

Veröffentlicht in 2012 IEEE International Solid-State Circuits Conference (01.02.2012)
“… Energy-efficient SIMD permutation operations are key for maximizing high-performance microprocessor vector datapath utilization in multimedia, graphics, and …”

Volltext

Tagungsbericht

Zu den Favoriten

Gespeichert in:
11

Wird geladen …

A Sparse Matrix Fast Transpose Algorithm Based on Pseudo-Address von Da, Wenjiao, Ren, Zhiguo, Lu, Jiao, Shi, Xuxia

Veröffentlicht: IEEE 01.12.2019

Veröffentlicht in 2019 International Conference on Intelligent Computing, Automation and Systems (ICICAS) (01.12.2019)
“… sparse matrix, mainly study the fast transpose algorithm of sparse matrix, and propose a new matrix transpose algorithm on it for the first time-the sparse matrix fast transpose algorithm of pseudo …”

Volltext

Tagungsbericht

Zu den Favoriten

Gespeichert in:
12

Wird geladen …

A simplified design strategy for mapping image processing algorithms on a SIMD torus von Seetharaman, Guna

ISSN: 0304-3975, 1879-2294

Veröffentlicht: Elsevier B.V 03.04.1995

Veröffentlicht in Theoretical computer science (03.04.1995)
“… A method is proposed to effectively realize large number of arbitrary, one-to-one, personalized, and concurrent communication between the PEs, by suitably repeating the matrix transpose algorithm …”

Volltext

Journal Article

Zu den Favoriten

Gespeichert in:
13

Wird geladen …

An O(n) Time-Complexity Matrix Transpose on Torus Array Processor von Ravankar, A. A., Sedukhin, S. G.

ISBN: 1457717964, 9781457717963

Veröffentlicht: IEEE 01.11.2011

Veröffentlicht in 2011 Second International Conference on Networking and Computing (01.11.2011)
“… and an efficient matrix transpose algorithm can speed up many applications. In this paper, we propose a new algorithm for n x n matrix transposition on array processors connected in torus network …”

Volltext

Tagungsbericht

Zu den Favoriten

Gespeichert in:
14

Wird geladen …

A parallel cosmological hydrodynamics code von Bode, Paul W., Xu, Guohong, Cen, Renyue

ISBN: 0897918541, 9780897918541

Veröffentlicht: Washington, DC, USA IEEE Computer Society 17.11.1996

Veröffentlicht in Proceedings of the 1996 ACM/IEEE conference on Supercomputing (17.11.1996)
“… A new, flexible matrix transpose algorithm is used to interchange distributed and local dimensions of the mesh. Timing results from runs on an IBM SP2 supercomputer are given …”

Volltext

Tagungsbericht

Zu den Favoriten

Gespeichert in:
15

Wird geladen …

A Parallel Cosmological Hydrodynamics Code von Renyue Cen, Guohong Xu, Bode, P.W.

ISBN: 0897918541, 9780897918541

Veröffentlicht: IEEE 1996

Veröffentlicht in Supercomputing '96 conference proceedings : the International Conference on High Performance Computing and Communications : November 17-22, 1996, Pittsburgh, PA (1996)
“… , combining a mesh based Eulerian hydrodynamics code and a Particle-Mesh N-body code. A new, flexible matrix transpose algorithm is used to interchange distributed and local dimensions of the mesh …”

Volltext

Tagungsbericht

Zu den Favoriten

Gespeichert in:
16

Wird geladen …

Random Address Permute-Shift Technique for the Shared Memory on GPUs von Nakano, Koji, Matsumae, Susumu, Ito, Yasuaki

ISSN: 0190-3918

Veröffentlicht: IEEE 01.09.2014

Veröffentlicht in Proceedings of the International Conference on Parallel Processing (01.09.2014)
“… The Discrete Memory Machine (DMM) is a theoretical parallel computing model that captures the essence of memory access to the shared memory of a streaming …”

Volltext

Tagungsbericht

Zu den Favoriten

Gespeichert in:
17

Wird geladen …

The Random Address Shift to Reduce the Memory Access Congestion on the Discrete Memory Machine von Nakano, Koji, Matsumae, Susumu, Ito, Yasuaki

ISSN: 2379-1888

Veröffentlicht: IEEE 01.12.2013

Veröffentlicht in International Symposium on Computing and Networking (Online) (01.12.2013)
“… The Discrete Memory Machine (DMM) is a theoretical parallel computing model that captures the essence of memory access of the streaming multiprocessor on …”

Volltext

Tagungsbericht

Zu den Favoriten

Gespeichert in:

Suchergebnisse - (("Matrix transpose algorithm") OR ("Matrix transport algorithm"))

Andere Suchmöglichkeiten:

Padding Free Bank Conflict Resolution for CUDA-Based Matrix Transpose Algorithm von Khan, Ayaz ul Hassan, Al-Mouhamed, Mayez, Fatayer, Allam, Almousa, Anas, Baqais, Abdulrahman, Assayony, Mohammed

Communication efficient adaptive matrix transpose algorithm for FFT on symmetric multiprocessors von Al Na'mneh, R., Pan, W.D., Adhami, R.

Restructuring and implementations of 2D matrix transpose algorithm using SSE4 vector instructions von Zekri, Ahmed S.

Padding free bank conflict resolution for CUDA-based matrix transpose algorithm von Khan, A., Al-Mouhamed, M., Fatayar, A., Almousa, A., Baqais, A., Assayony, M.

Parallel matrix transpose algorithms on distributed memory concurrent computers von Choi, Jaeyoung, Dongarra, Jack J., Walker, David W.

Matrix transpose on meshes with buses von Békési, József, Galambos, Gábor

A 280 mV-to-1.1 V 256b Reconfigurable SIMD Vector Permutation Engine With 2-Dimensional Shuffle in 22 nm Tri-Gate CMOS von Hsu, S. K., Agarwal, A., Anders, M. A., Mathew, S. K., Kaul, H., Sheikh, F., Krishnamurthy, R. K.

Linear-time matrix transpose algorithms using vector register file with diagonal registers von Hanounik, B., Hu, X.

Parallel matrix transpose algorithms on distributed memory concurrent computers von Jaeyoung Choi, Dongarra, J.J., Walker, D.W.

A 280mV-to-1.1V 256b reconfigurable SIMD vector permutation engine with 2-dimensional shuffle in 22nm CMOS von Hsu, S., Agarwal, A., Anders, M., Mathew, S., Kaul, H., Sheikh, F., Krishnamurthy, R.

A Sparse Matrix Fast Transpose Algorithm Based on Pseudo-Address von Da, Wenjiao, Ren, Zhiguo, Lu, Jiao, Shi, Xuxia

A simplified design strategy for mapping image processing algorithms on a SIMD torus von Seetharaman, Guna

An O(n) Time-Complexity Matrix Transpose on Torus Array Processor von Ravankar, A. A., Sedukhin, S. G.

A parallel cosmological hydrodynamics code von Bode, Paul W., Xu, Guohong, Cen, Renyue

A Parallel Cosmological Hydrodynamics Code von Renyue Cen, Guohong Xu, Bode, P.W.

Random Address Permute-Shift Technique for the Shared Memory on GPUs von Nakano, Koji, Matsumae, Susumu, Ito, Yasuaki

The Random Address Shift to Reduce the Memory Access Congestion on the Discrete Memory Machine von Nakano, Koji, Matsumae, Susumu, Ito, Yasuaki

Suchwerkzeuge:

Treffer weiter einschränken

Format

Schlagwortumfeld

Thema

Sprache

Erscheinungsjahr