Fast solution of large N× N matrix equations in an MIMD–SIMD Hybrid System

In this paper, we propose a new high-speed computation algorithm for solving a large N× N matrix system using the MIMD–SIMD Hybrid System. The MIMD–SIMD Hybrid System (also denoted as Hybrid System in this paper) is a new parallel architecture consisting of a combination of Cluster of Workstations (...

Celý popis

Uložené v:

Podrobná bibliografia
Vydané v:	Parallel computing Ročník 29; číslo 11; s. 1669 - 1684
Hlavní autori:	Chin Sim, Leo, Leedham, Graham, Chin Jian, Leo, Schroder, Heiko
Médium:	Journal Article
Jazyk:	English
Vydavateľské údaje:	Elsevier B.V 01.11.2003
Predmet:	Cluster Gauss–LU Hybrid System Parallelization Speedup Speedup Gauss–LU Cluster Hybrid System Parallelization
ISSN:	0167-8191, 1872-7336
On-line prístup:	Získať plný text
Tagy:	Pridať tag Žiadne tagy, Buďte prvý, kto otaguje tento záznam!

Abstract	In this paper, we propose a new high-speed computation algorithm for solving a large N× N matrix system using the MIMD–SIMD Hybrid System. The MIMD–SIMD Hybrid System (also denoted as Hybrid System in this paper) is a new parallel architecture consisting of a combination of Cluster of Workstations (COWs) and SIMD systems working concurrently to produce an optimal parallel computation. We first introduce our prototype SIMD system and our Hybrid System setup before presenting how it can be implemented to find the unknowns in a large N× N linear matrix equation system using the Gauss–LU algorithm. This algorithm basically performs the ‘Divide and Conquer’ approach by breaking down the large N× N matrix system into a manageable 32 × 32 matrix for fast computation.
AbstractList	In this paper, we propose a new high-speed computation algorithm for solving a large N× N matrix system using the MIMD–SIMD Hybrid System. The MIMD–SIMD Hybrid System (also denoted as Hybrid System in this paper) is a new parallel architecture consisting of a combination of Cluster of Workstations (COWs) and SIMD systems working concurrently to produce an optimal parallel computation. We first introduce our prototype SIMD system and our Hybrid System setup before presenting how it can be implemented to find the unknowns in a large N× N linear matrix equation system using the Gauss–LU algorithm. This algorithm basically performs the ‘Divide and Conquer’ approach by breaking down the large N× N matrix system into a manageable 32 × 32 matrix for fast computation.
Author	Schroder, Heiko Chin Sim, Leo Chin Jian, Leo Leedham, Graham
Author_xml	– sequence: 1 givenname: Leo surname: Chin Sim fullname: Chin Sim, Leo email: ps1567958a@ntu.edu.sg organization: Centre for High Performance Embedded Systems, School of Computer Engineering, Nanyang Technological University (NTU), Blk N4, 2A-32, Nanyang Avenue, Singapore, Singapore – sequence: 2 givenname: Graham surname: Leedham fullname: Leedham, Graham organization: Centre for High Performance Embedded Systems, School of Computer Engineering, Nanyang Technological University (NTU), Blk N4, 2A-32, Nanyang Avenue, Singapore, Singapore – sequence: 3 givenname: Leo surname: Chin Jian fullname: Chin Jian, Leo organization: School of Engineering and Industrial Design, University of Western Sydney, Australia – sequence: 4 givenname: Heiko surname: Schroder fullname: Schroder, Heiko organization: School of Computer and IT, Royal Melbourne Institute of Technology, Australia
BookMark	eNp9kEFOwzAQRS1UJNrCCdj4AgnjOImdBQtUKK3UlkVhbTnOGLlqk2KniO64AwsOxE04CSllzepLM_-NRm9AenVTIyGXDGIGLL9axVvtTRMnADyGLAbGTkifSZFEgvO8R_pdS0SSFeyMDEJYAUCeSuiT-ViHloZmvWtdU9PG0rX2z0gXX590QTe69e6N4stOH9aBuprqms6n89vv949lF3SyL72r6HIfWtyck1Or1wEv_nJInsZ3j6NJNHu4n45uZpFhed5GXJQSUy5SlpgcKwBTcJCoUyu6uShRSgtFVlSJFhkyLU2ZFILZ3BjL08zyIeHHu8Y3IXi0auvdRvu9YqAORtRK_RpRByMKMtUZ6ajrI4Xda68OvQrGYW2wch5Nq6rG_cv_AGp9bSA
Cites_doi	10.1016/S0167-8191(02)00182-5 10.1006/jpdc.2001.1768 10.1137/0909042 10.1016/S0167-8191(01)00134-X 10.1109/71.963416 10.1109/FMPC.1992.234898 10.1016/0167-8191(93)90036-K 10.1016/0743-7315(87)90002-5 10.1137/S1064827595281368 10.2172/5604546 10.1137/0909037 10.1016/S0129-6264(01)00069-5
ContentType	Journal Article
Copyright	2003 Elsevier B.V.
Copyright_xml	– notice: 2003 Elsevier B.V.
DBID	AAYXX CITATION
DOI	10.1016/j.parco.2003.05.011
DatabaseName	CrossRef
DatabaseTitle	CrossRef
DatabaseTitleList
DeliveryMethod	fulltext_linktorsrc
Discipline	Computer Science
EISSN	1872-7336
EndPage	1684
ExternalDocumentID	10_1016_j_parco_2003_05_011 S0167819103001510
GroupedDBID	--K --M -~X .DC .~1 0R~ 123 1B1 1~. 1~5 29O 4.4 457 4G. 5VS 6OB 7-5 71M 8P~ 9JN AACTN AAEDT AAEDW AAIAV AAIKJ AAKOC AALRI AAOAW AAQFI AAQXK AAXUO AAYFN ABBOA ABEFU ABFNM ABJNI ABMAC ABXDB ABYKQ ACDAQ ACGFS ACNNM ACRLP ACZNC ADBBV ADEZE ADJOM ADMUD ADTZH AEBSH AECPX AEKER AENEX AFKWA AFTJW AGHFR AGUBO AGYEJ AHHHB AHJVU AHZHX AIALX AIEXJ AIKHN AITUG AJBFU AJOXV ALMA_UNASSIGNED_HOLDINGS AMFUW AMRAJ AOUOD ASPBG AVWKF AXJTR AZFZN BJAXD BKOJK BLXMC CS3 DU5 EBS EFJIC EFLBG EJD EO8 EO9 EP2 EP3 F5P FDB FEDTE FGOYB FIRID FNPLU FYGXN G-Q G8K GBLVA GBOLZ HLZ HVGLF HZ~ H~9 IHE J1W JJJVA KOM LG9 M41 MO0 N9A O-L O9- OAUVE OZT P-8 P-9 P2P PC. Q38 R2- RIG ROL RPZ SBC SCC SDF SDG SDP SES SEW SPC SPCBC SST SSV SSZ T5K WH7 WUQ XPP ZMT ~G- 9DU AATTM AAXKI AAYWO AAYXX ABDPE ABWVN ACLOT ACRPL ACVFH ADCNI ADNMO AEIPS AEUPX AFJKZ AFPUW AGQPQ AIGII AIIUN AKBMS AKRWK AKYEP ANKPU APXCP CITATION EFKBS ~HD
ID	FETCH-LOGICAL-c166t-37b8e437412c6ed00c9308ea4f78e47be88f0959d2a75e1a8cb2971f6ccf345f3
ISSN	0167-8191
IngestDate	Sat Nov 29 04:06:53 EST 2025 Fri Feb 23 02:30:43 EST 2024
IsPeerReviewed	true
IsScholarly	true
Issue	11
Keywords	Speedup Gauss–LU Cluster Hybrid System Parallelization
Language	English
License	https://www.elsevier.com/tdm/userlicense/1.0
LinkModel	OpenURL
MergedId	FETCHMERGED-LOGICAL-c166t-37b8e437412c6ed00c9308ea4f78e47be88f0959d2a75e1a8cb2971f6ccf345f3
PageCount	16
ParticipantIDs	crossref_primary_10_1016_j_parco_2003_05_011 elsevier_sciencedirect_doi_10_1016_j_parco_2003_05_011
PublicationCentury	2000
PublicationDate	2003-11-00
PublicationDateYYYYMMDD	2003-11-01
PublicationDate_xml	– month: 11 year: 2003 text: 2003-11-00
PublicationDecade	2000
PublicationTitle	Parallel computing
PublicationYear	2003
Publisher	Elsevier B.V
Publisher_xml	– name: Elsevier B.V
References	E. Anderson, J.J. Dongarra, S. Ostrouchov, Installation guide for LAPACK, Computer Science Department, Technical Report CS-92-151, University of Tennessee, Knoxville, March 1992 Beaumont, Legrand, Rastello, Robert (BIB6) 2002; 28 Geist, Romine (BIB11) 1988; 9 Leo, Schroder, Leedham (BIB15) 2003; 29 A.K. Amoura, E. Bampis, J.-C. Konig, Efficient algorithms for parallel Gaussian Elimination on distributed memory machines, Report No. 1026 LRI, Université Paris-Sud, Orsay, France (1995) Li (BIB16) 2001; 61 Schimmler, Lang (BIB17) 1996; 2748 Anderson, Bai, Bischof, Blackford, Demmel, Dongarra, Du Croz, Greenbaum, Hammarling, McKenney, Sorensen (BIB2) 1999 Bai, Demmel, Dongarra, Petitet, Robinson, Stanley (BIB4) 1997; 18 J.J. Dongarra, Performance of various computers using standard linear equations software in a Fortran environment, Computer Science Department, Technical Report CS-89-85, University of Tennessee, Knoxville, TN, March 1990 J. Choi, J. Dongarra, R. Pozo, D.W. Walker, SCALAPACK: a scalable linear algebra for distributed memory concurrent computers, in: Proceedings of the Fourth Symposium on the Frontiers of Massively Parallel Computation, 1992, pp. 120–127 Schmidth, Schimmler, Schroder (BIB18) 1997; 1300 Dongarra, Bunch, Moler, Stewart (BIB10) 1979 Schroder (BIB19) 1988 Chu, George, Quesnel (BIB8) 1993; 19 Tiskin (BIB21) 2001; 11 Beaumont, Legrand, Rastello, Robert (BIB5) 2001; 12 Johnson (BIB14) 1987; 4 P. Stadzin, Transporting distributed BLAS to the Fujitsu AP3000 and the VP-300, in: Proceedings of the 8th Parallel Computing Workshop, Singapore, 1998, pp. 69–76 Heath, Romine (BIB13) 1988; 9 Geng, Oden, Van de Geijn (BIB12) 1990 Geist (10.1016/j.parco.2003.05.011_BIB11) 1988; 9 10.1016/j.parco.2003.05.011_BIB1 Geng (10.1016/j.parco.2003.05.011_BIB12) 1990 10.1016/j.parco.2003.05.011_BIB3 Bai (10.1016/j.parco.2003.05.011_BIB4) 1997; 18 Beaumont (10.1016/j.parco.2003.05.011_BIB6) 2002; 28 10.1016/j.parco.2003.05.011_BIB7 10.1016/j.parco.2003.05.011_BIB20 Beaumont (10.1016/j.parco.2003.05.011_BIB5) 2001; 12 10.1016/j.parco.2003.05.011_BIB9 Li (10.1016/j.parco.2003.05.011_BIB16) 2001; 61 Chu (10.1016/j.parco.2003.05.011_BIB8) 1993; 19 Schmidth (10.1016/j.parco.2003.05.011_BIB18) 1997; 1300 Leo (10.1016/j.parco.2003.05.011_BIB15) 2003; 29 Johnson (10.1016/j.parco.2003.05.011_BIB14) 1987; 4 Dongarra (10.1016/j.parco.2003.05.011_BIB10) 1979 Heath (10.1016/j.parco.2003.05.011_BIB13) 1988; 9 Schroder (10.1016/j.parco.2003.05.011_BIB19) 1988 Schimmler (10.1016/j.parco.2003.05.011_BIB17) 1996; 2748 Tiskin (10.1016/j.parco.2003.05.011_BIB21) 2001; 11 Anderson (10.1016/j.parco.2003.05.011_BIB2) 1999
References_xml	– volume: 9 start-page: 639 year: 1988 end-page: 649 ident: BIB11 publication-title: SIAM Journal of Scientific Statistic Computing – year: 1979 ident: BIB10 article-title: LINPACK Users’ Guide – volume: 29 start-page: 21 year: 2003 end-page: 36 ident: BIB15 article-title: MIMD–SIMD Hybrid System––towards a new low cost parallel system publication-title: Journal of Parallel Computing – start-page: 83 year: 1988 end-page: 91 ident: BIB19 article-title: Instruction Systolic array––tradeoff between flexibility and speed publication-title: Computer Systems Science and Engineering – volume: 19 start-page: 243 year: 1993 end-page: 256 ident: BIB8 article-title: Parallel matrix inversion on a subcube-grid publication-title: Journal of Parallel Computing – reference: P. Stadzin, Transporting distributed BLAS to the Fujitsu AP3000 and the VP-300, in: Proceedings of the 8th Parallel Computing Workshop, Singapore, 1998, pp. 69–76 – year: 1990 ident: BIB12 article-title: Massively parallel computation for acoustical scattering problems using boundary elements methods publication-title: Journal of Sound and Vibration, USA – year: 1999 ident: BIB2 article-title: LAPACK User’s Guide – volume: 4 start-page: 133 year: 1987 end-page: 172 ident: BIB14 article-title: Communication efficient basic linear algebraic computation on hybercube architectures publication-title: Journal of Parallel and Distributed Computing – volume: 12 start-page: 1033 year: 2001 end-page: 1051 ident: BIB5 article-title: Matrix multiplication on heterogenous platforms publication-title: IEEE Transactions on Parallel and Distributed Systems – volume: 2748 start-page: 136 year: 1996 end-page: 144 ident: BIB17 article-title: The instruction Systolic array in image processing applications publication-title: Proceedings Europto 96 – reference: J. Choi, J. Dongarra, R. Pozo, D.W. Walker, SCALAPACK: a scalable linear algebra for distributed memory concurrent computers, in: Proceedings of the Fourth Symposium on the Frontiers of Massively Parallel Computation, 1992, pp. 120–127 – reference: J.J. Dongarra, Performance of various computers using standard linear equations software in a Fortran environment, Computer Science Department, Technical Report CS-89-85, University of Tennessee, Knoxville, TN, March 1990 – volume: 1300 start-page: 798 year: 1997 end-page: 806 ident: BIB18 article-title: Morphological Hough transform on the instruction Systolic array publication-title: Proceedings of EuroPar 97 – reference: E. Anderson, J.J. Dongarra, S. Ostrouchov, Installation guide for LAPACK, Computer Science Department, Technical Report CS-92-151, University of Tennessee, Knoxville, March 1992 – reference: A.K. Amoura, E. Bampis, J.-C. Konig, Efficient algorithms for parallel Gaussian Elimination on distributed memory machines, Report No. 1026 LRI, Université Paris-Sud, Orsay, France (1995) – volume: 9 start-page: 558 year: 1988 end-page: 588 ident: BIB13 article-title: Parallel solution of triangular systems on distributed memory multiprocessors publication-title: SIAM Journal of Scientific Statistic Computing – volume: 28 start-page: 155 year: 2002 end-page: 185 ident: BIB6 article-title: Dense linear algebra kernels on heterogeneous platforms: redistribution issue publication-title: Journal of Parallel Computing – volume: 11 start-page: 409 year: 2001 end-page: 422 ident: BIB21 article-title: A new way to divide and conquer publication-title: Parallel Processing Letters – volume: 61 start-page: 1709 year: 2001 end-page: 1731 ident: BIB16 article-title: Scalable parallel matrix multiplication on distributed memory parallel computers publication-title: Journal of Parallel and Distributed Computing – volume: 18 start-page: 1446 year: 1997 end-page: 1461 ident: BIB4 article-title: The spectral decomposition of non-symmetric matrices on distributed memory parallel computers publication-title: SIAM Journal on Scientific Computing – start-page: 83 issue: April year: 1988 ident: 10.1016/j.parco.2003.05.011_BIB19 article-title: Instruction Systolic array––tradeoff between flexibility and speed publication-title: Computer Systems Science and Engineering – year: 1999 ident: 10.1016/j.parco.2003.05.011_BIB2 – volume: 29 start-page: 21 year: 2003 ident: 10.1016/j.parco.2003.05.011_BIB15 article-title: MIMD–SIMD Hybrid System––towards a new low cost parallel system publication-title: Journal of Parallel Computing doi: 10.1016/S0167-8191(02)00182-5 – volume: 61 start-page: 1709 issue: 12 year: 2001 ident: 10.1016/j.parco.2003.05.011_BIB16 article-title: Scalable parallel matrix multiplication on distributed memory parallel computers publication-title: Journal of Parallel and Distributed Computing doi: 10.1006/jpdc.2001.1768 – ident: 10.1016/j.parco.2003.05.011_BIB9 – volume: 9 start-page: 639 year: 1988 ident: 10.1016/j.parco.2003.05.011_BIB11 article-title: LU factorization algorithms on distributed-memory architectures publication-title: SIAM Journal of Scientific Statistic Computing doi: 10.1137/0909042 – ident: 10.1016/j.parco.2003.05.011_BIB1 – volume: 28 start-page: 155 year: 2002 ident: 10.1016/j.parco.2003.05.011_BIB6 article-title: Dense linear algebra kernels on heterogeneous platforms: redistribution issue publication-title: Journal of Parallel Computing doi: 10.1016/S0167-8191(01)00134-X – volume: 12 start-page: 1033 issue: 10 year: 2001 ident: 10.1016/j.parco.2003.05.011_BIB5 article-title: Matrix multiplication on heterogenous platforms publication-title: IEEE Transactions on Parallel and Distributed Systems doi: 10.1109/71.963416 – ident: 10.1016/j.parco.2003.05.011_BIB7 doi: 10.1109/FMPC.1992.234898 – volume: 19 start-page: 243 year: 1993 ident: 10.1016/j.parco.2003.05.011_BIB8 article-title: Parallel matrix inversion on a subcube-grid publication-title: Journal of Parallel Computing doi: 10.1016/0167-8191(93)90036-K – volume: 4 start-page: 133 year: 1987 ident: 10.1016/j.parco.2003.05.011_BIB14 article-title: Communication efficient basic linear algebraic computation on hybercube architectures publication-title: Journal of Parallel and Distributed Computing doi: 10.1016/0743-7315(87)90002-5 – year: 1979 ident: 10.1016/j.parco.2003.05.011_BIB10 – volume: 1300 start-page: 798 year: 1997 ident: 10.1016/j.parco.2003.05.011_BIB18 article-title: Morphological Hough transform on the instruction Systolic array – volume: 18 start-page: 1446 year: 1997 ident: 10.1016/j.parco.2003.05.011_BIB4 article-title: The spectral decomposition of non-symmetric matrices on distributed memory parallel computers publication-title: SIAM Journal on Scientific Computing doi: 10.1137/S1064827595281368 – ident: 10.1016/j.parco.2003.05.011_BIB3 doi: 10.2172/5604546 – ident: 10.1016/j.parco.2003.05.011_BIB20 – volume: 9 start-page: 558 year: 1988 ident: 10.1016/j.parco.2003.05.011_BIB13 article-title: Parallel solution of triangular systems on distributed memory multiprocessors publication-title: SIAM Journal of Scientific Statistic Computing doi: 10.1137/0909037 – volume: 2748 start-page: 136 year: 1996 ident: 10.1016/j.parco.2003.05.011_BIB17 article-title: The instruction Systolic array in image processing applications – volume: 11 start-page: 409 year: 2001 ident: 10.1016/j.parco.2003.05.011_BIB21 article-title: A new way to divide and conquer publication-title: Parallel Processing Letters doi: 10.1016/S0129-6264(01)00069-5 – issue: September year: 1990 ident: 10.1016/j.parco.2003.05.011_BIB12 article-title: Massively parallel computation for acoustical scattering problems using boundary elements methods publication-title: Journal of Sound and Vibration, USA
SSID	ssj0006480
Score	1.6423031
Snippet	In this paper, we propose a new high-speed computation algorithm for solving a large N× N matrix system using the MIMD–SIMD Hybrid System. The MIMD–SIMD Hybrid...
SourceID	crossref elsevier
SourceType	Index Database Publisher
StartPage	1669
SubjectTerms	Cluster Gauss–LU Hybrid System Parallelization Speedup
Title	Fast solution of large N× N matrix equations in an MIMD–SIMD Hybrid System
URI	https://dx.doi.org/10.1016/j.parco.2003.05.011
Volume	29
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
journalDatabaseRights	– providerCode: PRVESC databaseName: ScienceDirect customDbUrl: eissn: 1872-7336 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0006480 issn: 0167-8191 databaseCode: AIEXJ dateStart: 19950101 isFulltext: true titleUrlDefault: https://www.sciencedirect.com providerName: Elsevier
link	http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV3NbtNAEF5FLQcu_CPKn_bAzRjZXtvrPVbQKq2oVSlFys2y17sibbBDmlThxjsgxAPxJjwJM95dJ1UQAiQuVrLK-mfny8zseL4ZQl5EocqCuNZ-qmTqx0nCfVHryFd1rUOdahFXptkEz_NsPBang8FXx4W5mvKmyVYrMfuvooYxEDZSZ_9C3P1JYQA-g9DhCGKH4x8J_rC8XHjuEugLTjHZ28vxlbjgXu59wKr8K099XK4TyfGVzdHJG5f6wEbwxRt-QjqXLWq-6cWelnNswTLtEtKXC2f9uiwBONnIdGh-q9qNdJ_aErINNfva748nlgCxnjCS7-dtbeA0VJOL9lp0glmaXh8y26LNmCgmaGfcKRojZDRvxsHVZ6YailPNNhhiIRhuKNowNR1erNEOU9NobssgmNjE-asZYKnjejJTqDVc278-K3GE94W3BYoPvCQk7u1GPBGg73f3jw7Gx72JT-OuJV__HK6cVZc4uHWpX7s8G27M2R1yy-4_6L7BzV0yUM09ctv19qBW1d8nJwgj6mBEW007GNH8-zeaUwMh2kOIThpaNhQh9OPzFwQPNeChBjwPyLvDg7PXQ9-23vAlrOwCzE6VqZiBuxnJVNVBIAULMlXGmsM4r1SWaYwg11HJExWWmawiweHPLaVmcaLZQ7LTtI16RKgUSSVh16C0CJBCX0nNWYRuItOMc7ZHXrrVKWamwkrhUg_Pi24xsVcqK4KkgMXcI6lbwcI6icb5K0Dkv5v4-F8nPiE318h-SnYW86V6Rm7Iq8Xkcv7cQuMnmOmIPw
linkProvider	Elsevier
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Fast+solution+of+large+N%C3%97+N+matrix+equations+in+an+MIMD%E2%80%93SIMD+Hybrid+System&rft.jtitle=Parallel+computing&rft.au=Chin+Sim%2C+Leo&rft.au=Leedham%2C+Graham&rft.au=Chin+Jian%2C+Leo&rft.au=Schroder%2C+Heiko&rft.date=2003-11-01&rft.pub=Elsevier+B.V&rft.issn=0167-8191&rft.eissn=1872-7336&rft.volume=29&rft.issue=11&rft.spage=1669&rft.epage=1684&rft_id=info:doi/10.1016%2Fj.parco.2003.05.011&rft.externalDocID=S0167819103001510
thumbnail_l	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0167-8191&client=summon
thumbnail_m	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0167-8191&client=summon
thumbnail_s	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0167-8191&client=summon