Restructuring and implementations of 2D matrix transpose algorithm using SSE4 vector instructions
Current general-purpose processors are augmented with vector instructions that can process many elements of matrices and vectors in parallel. Transposing a matrix in-place is a main kernel operation required by many scientific and engineering applications to shuttle data before, during, or after pro...
Saved in:
| Published in: | 2015 International Conference on Applied Research in Computer Science and Engineering (ICAR) pp. 1 - 7 |
|---|---|
| Main Author: | |
| Format: | Conference Proceeding |
| Language: | English |
| Published: |
IEEE
01.10.2015
|
| Subjects: | |
| Online Access: | Get full text |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Be the first to leave a comment!