A programming methodology for designing block recursive algorithms on various computer networks

In this paper, we use the tensor product notation as the framework of a programming methodology for designing block recursive algorithms on various computer networks. In our previous works, we propose a programming methodology for designing block recursive algorithms on shared memory and distributed...

Celý popis

Uloženo v:

Podrobná bibliografie
Vydáno v:	Proceedings. International Conference on Parallel Processing Workshop s. 607 - 614
Hlavní autoři:	Min-Hsuan Fan, Chua-Huang Huang, Yeh-Ching Chung
Médium:	Konferenční příspěvek
Jazyk:	angličtina
Vydáno:	IEEE 2002
Témata:	Algorithm design and analysis Computer networks Concurrent computing Design methodology Hypercubes Matrix decomposition Multiprocessor interconnection networks Parallel processing Parallel programming Tensile stress
ISBN:	9780769516806, 0769516807
ISSN:	1530-2016
On-line přístup:	Získat plný text
Tagy:	Přidat tag Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!

Popis
Shrnutí:	In this paper, we use the tensor product notation as the framework of a programming methodology for designing block recursive algorithms on various computer networks. In our previous works, we propose a programming methodology for designing block recursive algorithms on shared memory and distributed-memory multiprocessors without considering the interconnection of processors. We extend the work to consider the block recursive algorithms on direct networks and multistage interconnection networks. We use parallel prefix computation as an example to illustrate the methodology. First, we represent the prefix computation problem as a computational matrix which may not be suitable for deriving algorithms on specific computer networks. In this methodology, we add two steps to derive tensor product formulas of parallel prefix algorithms on computer networks: (1) decompose the computational matrix into two submatrices, and (2) construct an augmented matrix. The augmented matrix can be factorized so that each term is a tensor product formula and can fit into a specified network topology. With the augmented matrix, the input data is also extended. It means, in addition to the input data, an auxiliary vector as temporary storage is used The content Of temporary storage is relevant to the decomposition of the original computational matrix. We present the methodology to derive various parallel prefix algorithms on hypercube, omega, and baseline networks and verify correctness of the resulting tensor product formulas using induction.
ISBN:	9780769516806 0769516807
ISSN:	1530-2016
DOI:	10.1109/ICPPW.2002.1039783