On designing optimal parallel triangular solvers
This paper explores the problem of solving triangular linear systems on parallel distributed-memory machines. Working within the LogP model, tight asymptotic bounds for solving these systems using forward/backward substitution are presented. Specifically, lower bounds on execution time independent o...
Uloženo v:
| Vydáno v: | Information and computation Ročník 161; číslo 2; s. 172 - 210 |
|---|---|
| Hlavní autor: | |
| Médium: | Journal Article |
| Jazyk: | angličtina |
| Vydáno: |
San Diego, CA
Elsevier Inc
2000
Elsevier |
| Témata: | |
| ISSN: | 0890-5401, 1090-2651 |
| On-line přístup: | Získat plný text |
| Tagy: |
Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
|
| Abstract | This paper explores the problem of solving triangular linear systems on parallel distributed-memory machines. Working within the LogP model, tight asymptotic bounds for solving these systems using forward/backward substitution are presented. Specifically, lower bounds on execution time independent of the data layout, lower bounds for data layouts in which the number of data items per processor is bounded, and lower bounds for specific data layouts commonly used in designing parallel algorithms for this problem are presented in this paper. Furthermore, algorithms are provided which have running times within a constant factor of the lower bounds described. One interesting result is that the popular two-dimensional block matrix layout necessarily results in significantly longer running times than simpler one-dimensional schemes. Finally, a generalization of the lower bounds to banded triangular linear systems is presented. |
|---|---|
| AbstractList | This paper explores the problem of solving triangular linear systems on parallel distributed-memory machines. Working within the LogP model, tight asymptotic bounds for solving these systems using forward/backward substitution are presented. Specifically, lower bounds on execution time independent of the data layout, lower bounds for data layouts in which the number of data items per processor is bounded, and lower bounds for specific data layouts commonly used in designing parallel algorithms for this problem are presented in this paper. Furthermore, algorithms are provided which have running times within a constant factor of the lower bounds described. One interesting result is that the popular two-dimensional block matrix layout necessarily results in significantly longer running times than simpler one-dimensional schemes. Finally, a generalization of the lower bounds to banded triangular linear systems is presented. |
| Author | Santos, Eunice E. |
| Author_xml | – sequence: 1 givenname: Eunice E. surname: Santos fullname: Santos, Eunice E. email: santos@cs.vt.edu organization: Department of Computer Science, Virginia Polytechnic Institute & State University, Blacksburg, Virginia, USA |
| BackLink | http://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&idt=1520418$$DView record in Pascal Francis |
| BookMark | eNp1kD1rwzAQhkVJoUnatbOHrnZPtqUqYwn9gkCW7OIsn4KKKhnJDfTf1yaFQqHT3fA-9_Gs2CLEQIzdcqg4gLx3wcSqBoCqVlJesCWHDZS1FHzBlqCmXrTAr9gq53cAzkUrlwz2oegpu2Nw4VjEYXQf6IsBE3pPvhiTw3D89JiKHP2JUr5mlxZ9ppufumaH56fD9rXc7V_eto-70jRcjCWHRnSAYlrbW4tKkq1VrfpaIVoJD7KRRLbdkOpRWUXSdhw7I1sUnexMs2Z357EDZoPeJgzGZT2k6b70pbmooeVqilXnmEkx50T2NwF6tqJnK3q2omcrE9D-AYwbcXQxjAmd_x9TZ4yml0-Oks7GUTDUu0Rm1H10_6HfEk98wA |
| CODEN | INFCEC |
| CitedBy_id | crossref_primary_10_1016_j_procs_2011_04_025 crossref_primary_10_1016_j_cpc_2006_05_002 crossref_primary_10_1023_A_1020586405464 |
| Cites_doi | 10.1006/jpdc.1999.1529 10.1137/0909037 10.1137/0714076 10.1137/0910025 10.1016/0167-8191(88)90009-9 10.1145/240455.240477 |
| ContentType | Journal Article |
| Copyright | 2000 Academic Press 2000 INIST-CNRS |
| Copyright_xml | – notice: 2000 Academic Press – notice: 2000 INIST-CNRS |
| DBID | 6I. AAFTH AAYXX CITATION IQODW |
| DOI | 10.1006/inco.2000.2866 |
| DatabaseName | ScienceDirect Open Access Titles Elsevier:ScienceDirect:Open Access CrossRef Pascal-Francis |
| DatabaseTitle | CrossRef |
| DatabaseTitleList | |
| DeliveryMethod | fulltext_linktorsrc |
| Discipline | Engineering Computer Science Mathematics Applied Sciences |
| EISSN | 1090-2651 |
| EndPage | 210 |
| ExternalDocumentID | 1520418 10_1006_inco_2000_2866 S0890540100928669 |
| GroupedDBID | --K --M --Z -~X .~1 0R~ 1B1 1~. 1~5 29I 4.4 457 4G. 5GY 5VS 6I. 6TJ 7-5 71M 8P~ 9JN AACTN AAEDT AAEDW AAFTH AAIAV AAIKJ AAKOC AALRI AAOAW AAQFI AAQXK AAXUO AAYFN ABAOU ABBOA ABFNM ABJNI ABMAC ABTAH ABVKL ABXDB ABYKQ ACAZW ACDAQ ACGFS ACNNM ACRLP ACZNC ADBBV ADEZE ADFGL ADMUD AEBSH AEKER AENEX AEXQZ AFKWA AFTJW AGHFR AGUBO AGYEJ AHHHB AHZHX AIALX AIEXJ AIKHN AITUG AJBFU AJOXV ALMA_UNASSIGNED_HOLDINGS AMFUW AMRAJ AOUOD ARUGR ASPBG AVWKF AXJTR AZFZN BKOJK BLXMC CAG COF CS3 DM4 DU5 E3Z EBS EFBJH EFLBG EJD EO8 EO9 EP2 EP3 FDB FEDTE FGOYB FIRID FNPLU FYGXN G-Q G8K GBLVA GBOLZ HVGLF HZ~ H~9 IHE IXB J1W KOM LG5 LX9 M41 MHUIS MO0 MVM N9A NCXOZ O-L O9- OAUVE OK1 OZT P-8 P-9 P2P PC. Q38 R2- RIG RNS ROL RPZ SDF SDG SDP SES SEW SPC SPCBC SSV SSW SSZ T5K TN5 WH7 WUQ XJT XPP ZMT ZU3 ZY4 ~G- 9DU AATTM AAXKI AAYWO AAYXX ABDPE ABWVN ACLOT ACRPL ACVFH ADCNI ADNMO ADVLN AEIPS AEUPX AFJKZ AFPUW AGQPQ AIGII AIIUN AKBMS AKRWK AKYEP ANKPU APXCP CITATION EFKBS ~HD AFXIZ AGCQF AGRNS BNPGV IQODW SSH |
| ID | FETCH-LOGICAL-c315t-1035b0a5890dffa86ef2828d28aaf607636eef49e8da8f8e6fb1abc64a5b6bc3 |
| ISICitedReferencesCount | 3 |
| ISICitedReferencesURI | http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000089567800003&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| ISSN | 0890-5401 |
| IngestDate | Mon Jul 21 09:13:58 EDT 2025 Sat Nov 29 01:56:25 EST 2025 Tue Nov 18 22:43:36 EST 2025 Fri Feb 23 02:30:04 EST 2024 |
| IsDoiOpenAccess | true |
| IsOpenAccess | true |
| IsPeerReviewed | true |
| IsScholarly | true |
| Issue | 2 |
| Keywords | matrix computation distributed-memory LogP model numerical methods triangular solvers parallel algorithmsand complexity Triangular matrix Parallel algorithm Triangular linear system Numerical method Forward substitution scheme Complexity Latency Equation system Distributed memory Distributed memory multiprocessor system Linear system Computer program Matrix calculus Backward substitution scheme Time complexity Message transmission |
| Language | English |
| License | http://www.elsevier.com/open-access/userlicense/1.0 https://www.elsevier.com/tdm/userlicense/1.0 https://www.elsevier.com/open-access/userlicense/1.0 CC BY 4.0 |
| LinkModel | OpenURL |
| MergedId | FETCHMERGED-LOGICAL-c315t-1035b0a5890dffa86ef2828d28aaf607636eef49e8da8f8e6fb1abc64a5b6bc3 |
| OpenAccessLink | https://dx.doi.org/10.1006/inco.2000.2866 |
| PageCount | 39 |
| ParticipantIDs | pascalfrancis_primary_1520418 crossref_primary_10_1006_inco_2000_2866 crossref_citationtrail_10_1006_inco_2000_2866 elsevier_sciencedirect_doi_10_1006_inco_2000_2866 |
| PublicationCentury | 2000 |
| PublicationDate | 2000-00-00 |
| PublicationDateYYYYMMDD | 2000-01-01 |
| PublicationDate_xml | – year: 2000 text: 2000-00-00 |
| PublicationDecade | 2000 |
| PublicationPlace | San Diego, CA |
| PublicationPlace_xml | – name: San Diego, CA |
| PublicationTitle | Information and computation |
| PublicationYear | 2000 |
| Publisher | Elsevier Inc Elsevier |
| Publisher_xml | – name: Elsevier Inc – name: Elsevier |
| References | Santos (bib16) 1995 Eisenstat, Heath, Henkel, Romine (bib5) 1998 Gallivan (bib6) 1990 Dongarra, van de Geijn, Walker (bib4) April 1992 Borodin, Munro (bib1) 1975 Karp, Sahay, Santos, Schauser (bib11) 1993 Culler, Karp, Patterson, Sahay, Santos, Schauser, Subramonian, von Eicken (bib2) November 1996 (bib9) 1992 Karp, Ramachandran (bib12) 1990 Heath, Romine (bib7) 1988 Santos (bib17) 1999; 57 Heller (bib8) 1987; 29 Li, Coleman (bib13) 1989 Higham (bib10) 1993 Demmel, Heaths, van der Vorst (bib3) 1993 Romine, Ortega (bib14) 1988; 6 Sameh, Brent (bib15) 1977 Heath (10.1006/inco.2000.2866_bib7) 1988 Demmel (10.1006/inco.2000.2866_bib3) 1993 Dongarra (10.1006/inco.2000.2866_bib4) 1992 Romine (10.1006/inco.2000.2866_bib14) 1988; 6 Borodin (10.1006/inco.2000.2866_bib1) 1975 Higham (10.1006/inco.2000.2866_bib10) 1993 Santos (10.1006/inco.2000.2866_bib16) 1995 Heller (10.1006/inco.2000.2866_bib8) 1987; 29 Karp (10.1006/inco.2000.2866_bib11) 1993 Li (10.1006/inco.2000.2866_bib13) 1989 Santos (10.1006/inco.2000.2866_bib17) 1999; 57 Gallivan (10.1006/inco.2000.2866_bib6) 1990 Sameh (10.1006/inco.2000.2866_bib15) 1977 (10.1006/inco.2000.2866_bib9) 1992 Culler (10.1006/inco.2000.2866_bib2) 1996 Eisenstat (10.1006/inco.2000.2866_bib5) 1998 Karp (10.1006/inco.2000.2866_bib12) 1990 |
| References_xml | – year: 1990 ident: bib6 article-title: Parallel Algorithms for Matrix Computations – year: 1993 ident: bib3 article-title: Parallel Numerical Linear Algebra publication-title: Technical Report, UCB/CSD 93/703 – year: 1993 ident: bib10 article-title: Stability of Parallel Triangular System Solvers publication-title: Technical Report Numerical Analysis Report, 236 – year: 1977 ident: bib15 article-title: Solving triangular systems on a parallel computer publication-title: SIAM J. Numer. Anal. – volume: 29 year: 1987 ident: bib8 article-title: A survey of parallel algorithms in numerical linear algebra publication-title: SIAM J. Numer. Anal. – year: 1992 ident: bib9 article-title: High Performance Fortran Forum publication-title: High Performance Fortran Language Specification – start-page: 553 year: 1995 end-page: 560 ident: bib16 article-title: Solving triangular linear systems in parallel using substitution publication-title: Proceedings of the Seventh Annual IEEE Symposium on Parallel and Distributed Processing – year: 1975 ident: bib1 article-title: The Computational Complexity of Algebraic and Numeric Problems – year: 1988 ident: bib7 article-title: Parallel solution of triangular systems on distributed-memory multiprocessors publication-title: SIAM J. Sci. Stat. Comput. – year: 1989 ident: bib13 article-title: A new method for solving triangular systems on distributed-memory message-passing multiprocessors publication-title: SIAM J. Sci. Stat. Comput. – volume: 6 start-page: 109 year: 1988 end-page: 114 ident: bib14 article-title: Parallel solution of triangular systems of equations publication-title: Parallel Comput. – year: 1998 ident: bib5 article-title: Modified cyclic algorithms for solving triangular systems on distributed memory multi-processors publication-title: SIAM J. Sci. Stat. Comput. – year: November 1996 ident: bib2 article-title: LogP: A practical model of parallel computation publication-title: Communications of the ACM – start-page: 869 year: 1990 end-page: 941 ident: bib12 article-title: Parallel algorithms for shared-memory machines publication-title: Handbook of Theoretical Computer Science, Volume A: Algorithms and Complexity – year: 1993 ident: bib11 article-title: Optimal broadcast and summation on the publication-title: Fifth Annual ACM Symposium on Parallel Algorithms and Architectures – volume: 57 start-page: 121 year: 1999 end-page: 139 ident: bib17 article-title: Optimal and near-optimal algorithms for publication-title: J. Parallel Distributed Comput. – year: April 1992 ident: bib4 article-title: A look at scalable dense linear algebra libraries publication-title: Scalable High-Performance Computing Conference – volume: 57 start-page: 121 year: 1999 ident: 10.1006/inco.2000.2866_bib17 article-title: Optimal and near-optimal algorithms for k-item broadcast publication-title: J. Parallel Distributed Comput. doi: 10.1006/jpdc.1999.1529 – volume: 29 year: 1987 ident: 10.1006/inco.2000.2866_bib8 article-title: A survey of parallel algorithms in numerical linear algebra publication-title: SIAM J. Numer. Anal. – year: 1992 ident: 10.1006/inco.2000.2866_bib9 article-title: High Performance Fortran Forum publication-title: High Performance Fortran Language Specification – year: 1988 ident: 10.1006/inco.2000.2866_bib7 article-title: Parallel solution of triangular systems on distributed-memory multiprocessors publication-title: SIAM J. Sci. Stat. Comput. doi: 10.1137/0909037 – year: 1993 ident: 10.1006/inco.2000.2866_bib3 article-title: Parallel Numerical Linear Algebra – year: 1990 ident: 10.1006/inco.2000.2866_bib6 – year: 1977 ident: 10.1006/inco.2000.2866_bib15 article-title: Solving triangular systems on a parallel computer publication-title: SIAM J. Numer. Anal. doi: 10.1137/0714076 – start-page: 869 year: 1990 ident: 10.1006/inco.2000.2866_bib12 article-title: Parallel algorithms for shared-memory machines – year: 1989 ident: 10.1006/inco.2000.2866_bib13 article-title: A new method for solving triangular systems on distributed-memory message-passing multiprocessors publication-title: SIAM J. Sci. Stat. Comput. doi: 10.1137/0910025 – year: 1993 ident: 10.1006/inco.2000.2866_bib11 article-title: Optimal broadcast and summation on the LogP model – year: 1993 ident: 10.1006/inco.2000.2866_bib10 article-title: Stability of Parallel Triangular System Solvers – start-page: 553 year: 1995 ident: 10.1006/inco.2000.2866_bib16 article-title: Solving triangular linear systems in parallel using substitution – year: 1998 ident: 10.1006/inco.2000.2866_bib5 article-title: Modified cyclic algorithms for solving triangular systems on distributed memory multi-processors publication-title: SIAM J. Sci. Stat. Comput. – volume: 6 start-page: 109 year: 1988 ident: 10.1006/inco.2000.2866_bib14 article-title: Parallel solution of triangular systems of equations publication-title: Parallel Comput. doi: 10.1016/0167-8191(88)90009-9 – year: 1992 ident: 10.1006/inco.2000.2866_bib4 article-title: A look at scalable dense linear algebra libraries – year: 1975 ident: 10.1006/inco.2000.2866_bib1 – year: 1996 ident: 10.1006/inco.2000.2866_bib2 article-title: LogP: A practical model of parallel computation publication-title: Communications of the ACM doi: 10.1145/240455.240477 |
| SSID | ssj0011546 |
| Score | 1.5519848 |
| Snippet | This paper explores the problem of solving triangular linear systems on parallel distributed-memory machines. Working within the LogP model, tight asymptotic... |
| SourceID | pascalfrancis crossref elsevier |
| SourceType | Index Database Enrichment Source Publisher |
| StartPage | 172 |
| SubjectTerms | Algorithmics. Computability. Computer arithmetics Applied sciences Computer science; control theory; systems distributed-memory Exact sciences and technology LogP model Mathematics matrix computation Nonlinear algebraic and transcendental equations Numerical analysis Numerical analysis. Scientific computation Numerical linear algebra numerical methods parallel algorithmsand complexity Sciences and techniques of general use Theoretical computing triangular solvers |
| Title | On designing optimal parallel triangular solvers |
| URI | https://dx.doi.org/10.1006/inco.2000.2866 |
| Volume | 161 |
| WOSCitedRecordID | wos000089567800003&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| journalDatabaseRights | – providerCode: PRVESC databaseName: Elsevier SD Freedom Collection Journals 2021 customDbUrl: eissn: 1090-2651 dateEnd: 20171231 omitProxy: false ssIdentifier: ssj0011546 issn: 0890-5401 databaseCode: AIEXJ dateStart: 19950101 isFulltext: true titleUrlDefault: https://www.sciencedirect.com providerName: Elsevier |
| link | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV3fa9swED62tA8do1u7lbVrhx8Geyju5F-y9FhGyraHdtA85M3IsgSF1QtJGvrn786SkrglsBb2YoyQZVmfdHeS7-4D-JzlolZ5IeNMcBbnqBLiGmdGbFiD6qXGHUWmOrKJ8vJSjMfyl2cUnHV0AmXbivt7OfmvUGMZgk2hs0-Ae9koFuA9go5XhB2v_wT8Fbm3kltG58-MEuGWgq3UlEhTyLH8hk4oyfcUu7Hw_u_BPPXBSfPgo6w7zofez_prYh12MQwUWGJOh2e9kwO2OsgKwSw9X0smJCM_CQe1cfKQYVnKfU7YIDBd-nQ_M9I18Zc4Gh6vSVPnsPpISONCx0Gk5BNdoNBZKviDbNidfr2mDlF_KC8UVpEvYSstcQM0gK3zH8Pxz-XfosQHZIUPCMk5Gf_af8sm4-P1RM1wSVjHZbJmYIzewq7fGUTnDtE9eGHafXgTWDciL4T34dVaCsl3wK7aaAl35OGOAtzRCu7Iw_0eRhfD0bfvsWfBiHWWFHPUk1lRM1XgxzXWKsGNpW1ykwqlLGeoH7gxNpdGNEpYYbitE1Vrnqui5rXODmDQ_mnNB4h4WeqsMdLi-ssTrSUzaMPYQmKrSmtxCHEYnUr7DPFEVPK7crmteUWjScSlrKLRPIQvy_oTlxtlY80kDHblLTtnsVU4JzY-c9JDZfWKImV5Io6e0eZH2HGpFOgI7RgG8-mdOYFtvZjfzKaf_LT6C8Lve2U |
| linkProvider | Elsevier |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=On+designing+optimal+parallel+triangular+solvers&rft.jtitle=Information+and+computation&rft.au=Santos%2C+Eunice+E.&rft.date=2000&rft.pub=Elsevier+Inc&rft.issn=0890-5401&rft.eissn=1090-2651&rft.volume=161&rft.issue=2&rft.spage=172&rft.epage=210&rft_id=info:doi/10.1006%2Finco.2000.2866&rft.externalDocID=S0890540100928669 |
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0890-5401&client=summon |
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0890-5401&client=summon |
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0890-5401&client=summon |