Fast solution of large N× N matrix equations in an MIMD–SIMD Hybrid System
In this paper, we propose a new high-speed computation algorithm for solving a large N× N matrix system using the MIMD–SIMD Hybrid System. The MIMD–SIMD Hybrid System (also denoted as Hybrid System in this paper) is a new parallel architecture consisting of a combination of Cluster of Workstations (...
Uložené v:
| Vydané v: | Parallel computing Ročník 29; číslo 11; s. 1669 - 1684 |
|---|---|
| Hlavní autori: | , , , |
| Médium: | Journal Article |
| Jazyk: | English |
| Vydavateľské údaje: |
Elsevier B.V
01.11.2003
|
| Predmet: | |
| ISSN: | 0167-8191, 1872-7336 |
| On-line prístup: | Získať plný text |
| Tagy: |
Pridať tag
Žiadne tagy, Buďte prvý, kto otaguje tento záznam!
|
| Abstract | In this paper, we propose a new high-speed computation algorithm for solving a large
N×
N matrix system using the MIMD–SIMD Hybrid System. The MIMD–SIMD Hybrid System (also denoted as Hybrid System in this paper) is a new parallel architecture consisting of a combination of Cluster of Workstations (COWs) and SIMD systems working concurrently to produce an optimal parallel computation. We first introduce our prototype SIMD system and our Hybrid System setup before presenting how it can be implemented to find the unknowns in a large
N×
N linear matrix equation system using the
Gauss–LU algorithm. This algorithm basically performs the ‘Divide and Conquer’ approach by breaking down the large
N×
N matrix system into a manageable 32
×
32 matrix for fast computation. |
|---|---|
| AbstractList | In this paper, we propose a new high-speed computation algorithm for solving a large
N×
N matrix system using the MIMD–SIMD Hybrid System. The MIMD–SIMD Hybrid System (also denoted as Hybrid System in this paper) is a new parallel architecture consisting of a combination of Cluster of Workstations (COWs) and SIMD systems working concurrently to produce an optimal parallel computation. We first introduce our prototype SIMD system and our Hybrid System setup before presenting how it can be implemented to find the unknowns in a large
N×
N linear matrix equation system using the
Gauss–LU algorithm. This algorithm basically performs the ‘Divide and Conquer’ approach by breaking down the large
N×
N matrix system into a manageable 32
×
32 matrix for fast computation. |
| Author | Schroder, Heiko Chin Sim, Leo Chin Jian, Leo Leedham, Graham |
| Author_xml | – sequence: 1 givenname: Leo surname: Chin Sim fullname: Chin Sim, Leo email: ps1567958a@ntu.edu.sg organization: Centre for High Performance Embedded Systems, School of Computer Engineering, Nanyang Technological University (NTU), Blk N4, 2A-32, Nanyang Avenue, Singapore, Singapore – sequence: 2 givenname: Graham surname: Leedham fullname: Leedham, Graham organization: Centre for High Performance Embedded Systems, School of Computer Engineering, Nanyang Technological University (NTU), Blk N4, 2A-32, Nanyang Avenue, Singapore, Singapore – sequence: 3 givenname: Leo surname: Chin Jian fullname: Chin Jian, Leo organization: School of Engineering and Industrial Design, University of Western Sydney, Australia – sequence: 4 givenname: Heiko surname: Schroder fullname: Schroder, Heiko organization: School of Computer and IT, Royal Melbourne Institute of Technology, Australia |
| BookMark | eNp9kEFOwzAQRS1UJNrCCdj4AgnjOImdBQtUKK3UlkVhbTnOGLlqk2KniO64AwsOxE04CSllzepLM_-NRm9AenVTIyGXDGIGLL9axVvtTRMnADyGLAbGTkifSZFEgvO8R_pdS0SSFeyMDEJYAUCeSuiT-ViHloZmvWtdU9PG0rX2z0gXX590QTe69e6N4stOH9aBuprqms6n89vv949lF3SyL72r6HIfWtyck1Or1wEv_nJInsZ3j6NJNHu4n45uZpFhed5GXJQSUy5SlpgcKwBTcJCoUyu6uShRSgtFVlSJFhkyLU2ZFILZ3BjL08zyIeHHu8Y3IXi0auvdRvu9YqAORtRK_RpRByMKMtUZ6ajrI4Xda68OvQrGYW2wch5Nq6rG_cv_AGp9bSA |
| Cites_doi | 10.1016/S0167-8191(02)00182-5 10.1006/jpdc.2001.1768 10.1137/0909042 10.1016/S0167-8191(01)00134-X 10.1109/71.963416 10.1109/FMPC.1992.234898 10.1016/0167-8191(93)90036-K 10.1016/0743-7315(87)90002-5 10.1137/S1064827595281368 10.2172/5604546 10.1137/0909037 10.1016/S0129-6264(01)00069-5 |
| ContentType | Journal Article |
| Copyright | 2003 Elsevier B.V. |
| Copyright_xml | – notice: 2003 Elsevier B.V. |
| DBID | AAYXX CITATION |
| DOI | 10.1016/j.parco.2003.05.011 |
| DatabaseName | CrossRef |
| DatabaseTitle | CrossRef |
| DatabaseTitleList | |
| DeliveryMethod | fulltext_linktorsrc |
| Discipline | Computer Science |
| EISSN | 1872-7336 |
| EndPage | 1684 |
| ExternalDocumentID | 10_1016_j_parco_2003_05_011 S0167819103001510 |
| GroupedDBID | --K --M -~X .DC .~1 0R~ 123 1B1 1~. 1~5 29O 4.4 457 4G. 5VS 6OB 7-5 71M 8P~ 9JN AACTN AAEDT AAEDW AAIAV AAIKJ AAKOC AALRI AAOAW AAQFI AAQXK AAXUO AAYFN ABBOA ABEFU ABFNM ABJNI ABMAC ABXDB ABYKQ ACDAQ ACGFS ACNNM ACRLP ACZNC ADBBV ADEZE ADJOM ADMUD ADTZH AEBSH AECPX AEKER AENEX AFKWA AFTJW AGHFR AGUBO AGYEJ AHHHB AHJVU AHZHX AIALX AIEXJ AIKHN AITUG AJBFU AJOXV ALMA_UNASSIGNED_HOLDINGS AMFUW AMRAJ AOUOD ASPBG AVWKF AXJTR AZFZN BJAXD BKOJK BLXMC CS3 DU5 EBS EFJIC EFLBG EJD EO8 EO9 EP2 EP3 F5P FDB FEDTE FGOYB FIRID FNPLU FYGXN G-Q G8K GBLVA GBOLZ HLZ HVGLF HZ~ H~9 IHE J1W JJJVA KOM LG9 M41 MO0 N9A O-L O9- OAUVE OZT P-8 P-9 P2P PC. Q38 R2- RIG ROL RPZ SBC SCC SDF SDG SDP SES SEW SPC SPCBC SST SSV SSZ T5K WH7 WUQ XPP ZMT ~G- 9DU AATTM AAXKI AAYWO AAYXX ABDPE ABWVN ACLOT ACRPL ACVFH ADCNI ADNMO AEIPS AEUPX AFJKZ AFPUW AGQPQ AIGII AIIUN AKBMS AKRWK AKYEP ANKPU APXCP CITATION EFKBS ~HD |
| ID | FETCH-LOGICAL-c166t-37b8e437412c6ed00c9308ea4f78e47be88f0959d2a75e1a8cb2971f6ccf345f3 |
| ISSN | 0167-8191 |
| IngestDate | Sat Nov 29 04:06:53 EST 2025 Fri Feb 23 02:30:43 EST 2024 |
| IsPeerReviewed | true |
| IsScholarly | true |
| Issue | 11 |
| Keywords | Speedup Gauss–LU Cluster Hybrid System Parallelization |
| Language | English |
| License | https://www.elsevier.com/tdm/userlicense/1.0 |
| LinkModel | OpenURL |
| MergedId | FETCHMERGED-LOGICAL-c166t-37b8e437412c6ed00c9308ea4f78e47be88f0959d2a75e1a8cb2971f6ccf345f3 |
| PageCount | 16 |
| ParticipantIDs | crossref_primary_10_1016_j_parco_2003_05_011 elsevier_sciencedirect_doi_10_1016_j_parco_2003_05_011 |
| PublicationCentury | 2000 |
| PublicationDate | 2003-11-00 |
| PublicationDateYYYYMMDD | 2003-11-01 |
| PublicationDate_xml | – month: 11 year: 2003 text: 2003-11-00 |
| PublicationDecade | 2000 |
| PublicationTitle | Parallel computing |
| PublicationYear | 2003 |
| Publisher | Elsevier B.V |
| Publisher_xml | – name: Elsevier B.V |
| References | E. Anderson, J.J. Dongarra, S. Ostrouchov, Installation guide for LAPACK, Computer Science Department, Technical Report CS-92-151, University of Tennessee, Knoxville, March 1992 Beaumont, Legrand, Rastello, Robert (BIB6) 2002; 28 Geist, Romine (BIB11) 1988; 9 Leo, Schroder, Leedham (BIB15) 2003; 29 A.K. Amoura, E. Bampis, J.-C. Konig, Efficient algorithms for parallel Gaussian Elimination on distributed memory machines, Report No. 1026 LRI, Université Paris-Sud, Orsay, France (1995) Li (BIB16) 2001; 61 Schimmler, Lang (BIB17) 1996; 2748 Anderson, Bai, Bischof, Blackford, Demmel, Dongarra, Du Croz, Greenbaum, Hammarling, McKenney, Sorensen (BIB2) 1999 Bai, Demmel, Dongarra, Petitet, Robinson, Stanley (BIB4) 1997; 18 J.J. Dongarra, Performance of various computers using standard linear equations software in a Fortran environment, Computer Science Department, Technical Report CS-89-85, University of Tennessee, Knoxville, TN, March 1990 J. Choi, J. Dongarra, R. Pozo, D.W. Walker, SCALAPACK: a scalable linear algebra for distributed memory concurrent computers, in: Proceedings of the Fourth Symposium on the Frontiers of Massively Parallel Computation, 1992, pp. 120–127 Schmidth, Schimmler, Schroder (BIB18) 1997; 1300 Dongarra, Bunch, Moler, Stewart (BIB10) 1979 Schroder (BIB19) 1988 Chu, George, Quesnel (BIB8) 1993; 19 Tiskin (BIB21) 2001; 11 Beaumont, Legrand, Rastello, Robert (BIB5) 2001; 12 Johnson (BIB14) 1987; 4 P. Stadzin, Transporting distributed BLAS to the Fujitsu AP3000 and the VP-300, in: Proceedings of the 8th Parallel Computing Workshop, Singapore, 1998, pp. 69–76 Heath, Romine (BIB13) 1988; 9 Geng, Oden, Van de Geijn (BIB12) 1990 Geist (10.1016/j.parco.2003.05.011_BIB11) 1988; 9 10.1016/j.parco.2003.05.011_BIB1 Geng (10.1016/j.parco.2003.05.011_BIB12) 1990 10.1016/j.parco.2003.05.011_BIB3 Bai (10.1016/j.parco.2003.05.011_BIB4) 1997; 18 Beaumont (10.1016/j.parco.2003.05.011_BIB6) 2002; 28 10.1016/j.parco.2003.05.011_BIB7 10.1016/j.parco.2003.05.011_BIB20 Beaumont (10.1016/j.parco.2003.05.011_BIB5) 2001; 12 10.1016/j.parco.2003.05.011_BIB9 Li (10.1016/j.parco.2003.05.011_BIB16) 2001; 61 Chu (10.1016/j.parco.2003.05.011_BIB8) 1993; 19 Schmidth (10.1016/j.parco.2003.05.011_BIB18) 1997; 1300 Leo (10.1016/j.parco.2003.05.011_BIB15) 2003; 29 Johnson (10.1016/j.parco.2003.05.011_BIB14) 1987; 4 Dongarra (10.1016/j.parco.2003.05.011_BIB10) 1979 Heath (10.1016/j.parco.2003.05.011_BIB13) 1988; 9 Schroder (10.1016/j.parco.2003.05.011_BIB19) 1988 Schimmler (10.1016/j.parco.2003.05.011_BIB17) 1996; 2748 Tiskin (10.1016/j.parco.2003.05.011_BIB21) 2001; 11 Anderson (10.1016/j.parco.2003.05.011_BIB2) 1999 |
| References_xml | – volume: 9 start-page: 639 year: 1988 end-page: 649 ident: BIB11 publication-title: SIAM Journal of Scientific Statistic Computing – year: 1979 ident: BIB10 article-title: LINPACK Users’ Guide – volume: 29 start-page: 21 year: 2003 end-page: 36 ident: BIB15 article-title: MIMD–SIMD Hybrid System––towards a new low cost parallel system publication-title: Journal of Parallel Computing – start-page: 83 year: 1988 end-page: 91 ident: BIB19 article-title: Instruction Systolic array––tradeoff between flexibility and speed publication-title: Computer Systems Science and Engineering – volume: 19 start-page: 243 year: 1993 end-page: 256 ident: BIB8 article-title: Parallel matrix inversion on a subcube-grid publication-title: Journal of Parallel Computing – reference: P. Stadzin, Transporting distributed BLAS to the Fujitsu AP3000 and the VP-300, in: Proceedings of the 8th Parallel Computing Workshop, Singapore, 1998, pp. 69–76 – year: 1990 ident: BIB12 article-title: Massively parallel computation for acoustical scattering problems using boundary elements methods publication-title: Journal of Sound and Vibration, USA – year: 1999 ident: BIB2 article-title: LAPACK User’s Guide – volume: 4 start-page: 133 year: 1987 end-page: 172 ident: BIB14 article-title: Communication efficient basic linear algebraic computation on hybercube architectures publication-title: Journal of Parallel and Distributed Computing – volume: 12 start-page: 1033 year: 2001 end-page: 1051 ident: BIB5 article-title: Matrix multiplication on heterogenous platforms publication-title: IEEE Transactions on Parallel and Distributed Systems – volume: 2748 start-page: 136 year: 1996 end-page: 144 ident: BIB17 article-title: The instruction Systolic array in image processing applications publication-title: Proceedings Europto 96 – reference: J. Choi, J. Dongarra, R. Pozo, D.W. Walker, SCALAPACK: a scalable linear algebra for distributed memory concurrent computers, in: Proceedings of the Fourth Symposium on the Frontiers of Massively Parallel Computation, 1992, pp. 120–127 – reference: J.J. Dongarra, Performance of various computers using standard linear equations software in a Fortran environment, Computer Science Department, Technical Report CS-89-85, University of Tennessee, Knoxville, TN, March 1990 – volume: 1300 start-page: 798 year: 1997 end-page: 806 ident: BIB18 article-title: Morphological Hough transform on the instruction Systolic array publication-title: Proceedings of EuroPar 97 – reference: E. Anderson, J.J. Dongarra, S. Ostrouchov, Installation guide for LAPACK, Computer Science Department, Technical Report CS-92-151, University of Tennessee, Knoxville, March 1992 – reference: A.K. Amoura, E. Bampis, J.-C. Konig, Efficient algorithms for parallel Gaussian Elimination on distributed memory machines, Report No. 1026 LRI, Université Paris-Sud, Orsay, France (1995) – volume: 9 start-page: 558 year: 1988 end-page: 588 ident: BIB13 article-title: Parallel solution of triangular systems on distributed memory multiprocessors publication-title: SIAM Journal of Scientific Statistic Computing – volume: 28 start-page: 155 year: 2002 end-page: 185 ident: BIB6 article-title: Dense linear algebra kernels on heterogeneous platforms: redistribution issue publication-title: Journal of Parallel Computing – volume: 11 start-page: 409 year: 2001 end-page: 422 ident: BIB21 article-title: A new way to divide and conquer publication-title: Parallel Processing Letters – volume: 61 start-page: 1709 year: 2001 end-page: 1731 ident: BIB16 article-title: Scalable parallel matrix multiplication on distributed memory parallel computers publication-title: Journal of Parallel and Distributed Computing – volume: 18 start-page: 1446 year: 1997 end-page: 1461 ident: BIB4 article-title: The spectral decomposition of non-symmetric matrices on distributed memory parallel computers publication-title: SIAM Journal on Scientific Computing – start-page: 83 issue: April year: 1988 ident: 10.1016/j.parco.2003.05.011_BIB19 article-title: Instruction Systolic array––tradeoff between flexibility and speed publication-title: Computer Systems Science and Engineering – year: 1999 ident: 10.1016/j.parco.2003.05.011_BIB2 – volume: 29 start-page: 21 year: 2003 ident: 10.1016/j.parco.2003.05.011_BIB15 article-title: MIMD–SIMD Hybrid System––towards a new low cost parallel system publication-title: Journal of Parallel Computing doi: 10.1016/S0167-8191(02)00182-5 – volume: 61 start-page: 1709 issue: 12 year: 2001 ident: 10.1016/j.parco.2003.05.011_BIB16 article-title: Scalable parallel matrix multiplication on distributed memory parallel computers publication-title: Journal of Parallel and Distributed Computing doi: 10.1006/jpdc.2001.1768 – ident: 10.1016/j.parco.2003.05.011_BIB9 – volume: 9 start-page: 639 year: 1988 ident: 10.1016/j.parco.2003.05.011_BIB11 article-title: LU factorization algorithms on distributed-memory architectures publication-title: SIAM Journal of Scientific Statistic Computing doi: 10.1137/0909042 – ident: 10.1016/j.parco.2003.05.011_BIB1 – volume: 28 start-page: 155 year: 2002 ident: 10.1016/j.parco.2003.05.011_BIB6 article-title: Dense linear algebra kernels on heterogeneous platforms: redistribution issue publication-title: Journal of Parallel Computing doi: 10.1016/S0167-8191(01)00134-X – volume: 12 start-page: 1033 issue: 10 year: 2001 ident: 10.1016/j.parco.2003.05.011_BIB5 article-title: Matrix multiplication on heterogenous platforms publication-title: IEEE Transactions on Parallel and Distributed Systems doi: 10.1109/71.963416 – ident: 10.1016/j.parco.2003.05.011_BIB7 doi: 10.1109/FMPC.1992.234898 – volume: 19 start-page: 243 year: 1993 ident: 10.1016/j.parco.2003.05.011_BIB8 article-title: Parallel matrix inversion on a subcube-grid publication-title: Journal of Parallel Computing doi: 10.1016/0167-8191(93)90036-K – volume: 4 start-page: 133 year: 1987 ident: 10.1016/j.parco.2003.05.011_BIB14 article-title: Communication efficient basic linear algebraic computation on hybercube architectures publication-title: Journal of Parallel and Distributed Computing doi: 10.1016/0743-7315(87)90002-5 – year: 1979 ident: 10.1016/j.parco.2003.05.011_BIB10 – volume: 1300 start-page: 798 year: 1997 ident: 10.1016/j.parco.2003.05.011_BIB18 article-title: Morphological Hough transform on the instruction Systolic array – volume: 18 start-page: 1446 year: 1997 ident: 10.1016/j.parco.2003.05.011_BIB4 article-title: The spectral decomposition of non-symmetric matrices on distributed memory parallel computers publication-title: SIAM Journal on Scientific Computing doi: 10.1137/S1064827595281368 – ident: 10.1016/j.parco.2003.05.011_BIB3 doi: 10.2172/5604546 – ident: 10.1016/j.parco.2003.05.011_BIB20 – volume: 9 start-page: 558 year: 1988 ident: 10.1016/j.parco.2003.05.011_BIB13 article-title: Parallel solution of triangular systems on distributed memory multiprocessors publication-title: SIAM Journal of Scientific Statistic Computing doi: 10.1137/0909037 – volume: 2748 start-page: 136 year: 1996 ident: 10.1016/j.parco.2003.05.011_BIB17 article-title: The instruction Systolic array in image processing applications – volume: 11 start-page: 409 year: 2001 ident: 10.1016/j.parco.2003.05.011_BIB21 article-title: A new way to divide and conquer publication-title: Parallel Processing Letters doi: 10.1016/S0129-6264(01)00069-5 – issue: September year: 1990 ident: 10.1016/j.parco.2003.05.011_BIB12 article-title: Massively parallel computation for acoustical scattering problems using boundary elements methods publication-title: Journal of Sound and Vibration, USA |
| SSID | ssj0006480 |
| Score | 1.6423031 |
| Snippet | In this paper, we propose a new high-speed computation algorithm for solving a large
N×
N matrix system using the MIMD–SIMD Hybrid System. The MIMD–SIMD Hybrid... |
| SourceID | crossref elsevier |
| SourceType | Index Database Publisher |
| StartPage | 1669 |
| SubjectTerms | Cluster Gauss–LU Hybrid System Parallelization Speedup |
| Title | Fast solution of large N× N matrix equations in an MIMD–SIMD Hybrid System |
| URI | https://dx.doi.org/10.1016/j.parco.2003.05.011 |
| Volume | 29 |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| journalDatabaseRights | – providerCode: PRVESC databaseName: ScienceDirect customDbUrl: eissn: 1872-7336 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0006480 issn: 0167-8191 databaseCode: AIEXJ dateStart: 19950101 isFulltext: true titleUrlDefault: https://www.sciencedirect.com providerName: Elsevier |
| link | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV3NbtNAEF5FLQcu_CPKn_bAzRjZXtvrPVbQKq2oVSlFys2y17sibbBDmlThxjsgxAPxJjwJM95dJ1UQAiQuVrLK-mfny8zseL4ZQl5EocqCuNZ-qmTqx0nCfVHryFd1rUOdahFXptkEz_NsPBang8FXx4W5mvKmyVYrMfuvooYxEDZSZ_9C3P1JYQA-g9DhCGKH4x8J_rC8XHjuEugLTjHZ28vxlbjgXu59wKr8K099XK4TyfGVzdHJG5f6wEbwxRt-QjqXLWq-6cWelnNswTLtEtKXC2f9uiwBONnIdGh-q9qNdJ_aErINNfva748nlgCxnjCS7-dtbeA0VJOL9lp0glmaXh8y26LNmCgmaGfcKRojZDRvxsHVZ6YailPNNhhiIRhuKNowNR1erNEOU9NobssgmNjE-asZYKnjejJTqDVc278-K3GE94W3BYoPvCQk7u1GPBGg73f3jw7Gx72JT-OuJV__HK6cVZc4uHWpX7s8G27M2R1yy-4_6L7BzV0yUM09ctv19qBW1d8nJwgj6mBEW007GNH8-zeaUwMh2kOIThpaNhQh9OPzFwQPNeChBjwPyLvDg7PXQ9-23vAlrOwCzE6VqZiBuxnJVNVBIAULMlXGmsM4r1SWaYwg11HJExWWmawiweHPLaVmcaLZQ7LTtI16RKgUSSVh16C0CJBCX0nNWYRuItOMc7ZHXrrVKWamwkrhUg_Pi24xsVcqK4KkgMXcI6lbwcI6icb5K0Dkv5v4-F8nPiE318h-SnYW86V6Rm7Iq8Xkcv7cQuMnmOmIPw |
| linkProvider | Elsevier |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Fast+solution+of+large+N%C3%97+N+matrix+equations+in+an+MIMD%E2%80%93SIMD+Hybrid+System&rft.jtitle=Parallel+computing&rft.au=Chin+Sim%2C+Leo&rft.au=Leedham%2C+Graham&rft.au=Chin+Jian%2C+Leo&rft.au=Schroder%2C+Heiko&rft.date=2003-11-01&rft.pub=Elsevier+B.V&rft.issn=0167-8191&rft.eissn=1872-7336&rft.volume=29&rft.issue=11&rft.spage=1669&rft.epage=1684&rft_id=info:doi/10.1016%2Fj.parco.2003.05.011&rft.externalDocID=S0167819103001510 |
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0167-8191&client=summon |
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0167-8191&client=summon |
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0167-8191&client=summon |