On designing optimal parallel triangular solvers

This paper explores the problem of solving triangular linear systems on parallel distributed-memory machines. Working within the LogP model, tight asymptotic bounds for solving these systems using forward/backward substitution are presented. Specifically, lower bounds on execution time independent o...

Celý popis

Uloženo v:
Podrobná bibliografie
Vydáno v:Information and computation Ročník 161; číslo 2; s. 172 - 210
Hlavní autor: Santos, Eunice E.
Médium: Journal Article
Jazyk:angličtina
Vydáno: San Diego, CA Elsevier Inc 2000
Elsevier
Témata:
ISSN:0890-5401, 1090-2651
On-line přístup:Získat plný text
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Abstract This paper explores the problem of solving triangular linear systems on parallel distributed-memory machines. Working within the LogP model, tight asymptotic bounds for solving these systems using forward/backward substitution are presented. Specifically, lower bounds on execution time independent of the data layout, lower bounds for data layouts in which the number of data items per processor is bounded, and lower bounds for specific data layouts commonly used in designing parallel algorithms for this problem are presented in this paper. Furthermore, algorithms are provided which have running times within a constant factor of the lower bounds described. One interesting result is that the popular two-dimensional block matrix layout necessarily results in significantly longer running times than simpler one-dimensional schemes. Finally, a generalization of the lower bounds to banded triangular linear systems is presented.
AbstractList This paper explores the problem of solving triangular linear systems on parallel distributed-memory machines. Working within the LogP model, tight asymptotic bounds for solving these systems using forward/backward substitution are presented. Specifically, lower bounds on execution time independent of the data layout, lower bounds for data layouts in which the number of data items per processor is bounded, and lower bounds for specific data layouts commonly used in designing parallel algorithms for this problem are presented in this paper. Furthermore, algorithms are provided which have running times within a constant factor of the lower bounds described. One interesting result is that the popular two-dimensional block matrix layout necessarily results in significantly longer running times than simpler one-dimensional schemes. Finally, a generalization of the lower bounds to banded triangular linear systems is presented.
Author Santos, Eunice E.
Author_xml – sequence: 1
  givenname: Eunice E.
  surname: Santos
  fullname: Santos, Eunice E.
  email: santos@cs.vt.edu
  organization: Department of Computer Science, Virginia Polytechnic Institute & State University, Blacksburg, Virginia, USA
BackLink http://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&idt=1520418$$DView record in Pascal Francis
BookMark eNp1kD1rwzAQhkVJoUnatbOHrnZPtqUqYwn9gkCW7OIsn4KKKhnJDfTf1yaFQqHT3fA-9_Gs2CLEQIzdcqg4gLx3wcSqBoCqVlJesCWHDZS1FHzBlqCmXrTAr9gq53cAzkUrlwz2oegpu2Nw4VjEYXQf6IsBE3pPvhiTw3D89JiKHP2JUr5mlxZ9ppufumaH56fD9rXc7V_eto-70jRcjCWHRnSAYlrbW4tKkq1VrfpaIVoJD7KRRLbdkOpRWUXSdhw7I1sUnexMs2Z357EDZoPeJgzGZT2k6b70pbmooeVqilXnmEkx50T2NwF6tqJnK3q2omcrE9D-AYwbcXQxjAmd_x9TZ4yml0-Oks7GUTDUu0Rm1H10_6HfEk98wA
CODEN INFCEC
CitedBy_id crossref_primary_10_1016_j_procs_2011_04_025
crossref_primary_10_1016_j_cpc_2006_05_002
crossref_primary_10_1023_A_1020586405464
Cites_doi 10.1006/jpdc.1999.1529
10.1137/0909037
10.1137/0714076
10.1137/0910025
10.1016/0167-8191(88)90009-9
10.1145/240455.240477
ContentType Journal Article
Copyright 2000 Academic Press
2000 INIST-CNRS
Copyright_xml – notice: 2000 Academic Press
– notice: 2000 INIST-CNRS
DBID 6I.
AAFTH
AAYXX
CITATION
IQODW
DOI 10.1006/inco.2000.2866
DatabaseName ScienceDirect Open Access Titles
Elsevier:ScienceDirect:Open Access
CrossRef
Pascal-Francis
DatabaseTitle CrossRef
DatabaseTitleList
DeliveryMethod fulltext_linktorsrc
Discipline Engineering
Computer Science
Mathematics
Applied Sciences
EISSN 1090-2651
EndPage 210
ExternalDocumentID 1520418
10_1006_inco_2000_2866
S0890540100928669
GroupedDBID --K
--M
--Z
-~X
.~1
0R~
1B1
1~.
1~5
29I
4.4
457
4G.
5GY
5VS
6I.
6TJ
7-5
71M
8P~
9JN
AACTN
AAEDT
AAEDW
AAFTH
AAIAV
AAIKJ
AAKOC
AALRI
AAOAW
AAQFI
AAQXK
AAXUO
AAYFN
ABAOU
ABBOA
ABFNM
ABJNI
ABMAC
ABTAH
ABVKL
ABXDB
ABYKQ
ACAZW
ACDAQ
ACGFS
ACNNM
ACRLP
ACZNC
ADBBV
ADEZE
ADFGL
ADMUD
AEBSH
AEKER
AENEX
AEXQZ
AFKWA
AFTJW
AGHFR
AGUBO
AGYEJ
AHHHB
AHZHX
AIALX
AIEXJ
AIKHN
AITUG
AJBFU
AJOXV
ALMA_UNASSIGNED_HOLDINGS
AMFUW
AMRAJ
AOUOD
ARUGR
ASPBG
AVWKF
AXJTR
AZFZN
BKOJK
BLXMC
CAG
COF
CS3
DM4
DU5
E3Z
EBS
EFBJH
EFLBG
EJD
EO8
EO9
EP2
EP3
FDB
FEDTE
FGOYB
FIRID
FNPLU
FYGXN
G-Q
G8K
GBLVA
GBOLZ
HVGLF
HZ~
H~9
IHE
IXB
J1W
KOM
LG5
LX9
M41
MHUIS
MO0
MVM
N9A
NCXOZ
O-L
O9-
OAUVE
OK1
OZT
P-8
P-9
P2P
PC.
Q38
R2-
RIG
RNS
ROL
RPZ
SDF
SDG
SDP
SES
SEW
SPC
SPCBC
SSV
SSW
SSZ
T5K
TN5
WH7
WUQ
XJT
XPP
ZMT
ZU3
ZY4
~G-
9DU
AATTM
AAXKI
AAYWO
AAYXX
ABDPE
ABWVN
ACLOT
ACRPL
ACVFH
ADCNI
ADNMO
ADVLN
AEIPS
AEUPX
AFJKZ
AFPUW
AGQPQ
AIGII
AIIUN
AKBMS
AKRWK
AKYEP
ANKPU
APXCP
CITATION
EFKBS
~HD
AFXIZ
AGCQF
AGRNS
BNPGV
IQODW
SSH
ID FETCH-LOGICAL-c315t-1035b0a5890dffa86ef2828d28aaf607636eef49e8da8f8e6fb1abc64a5b6bc3
ISICitedReferencesCount 3
ISICitedReferencesURI http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000089567800003&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
ISSN 0890-5401
IngestDate Mon Jul 21 09:13:58 EDT 2025
Sat Nov 29 01:56:25 EST 2025
Tue Nov 18 22:43:36 EST 2025
Fri Feb 23 02:30:04 EST 2024
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue 2
Keywords matrix computation
distributed-memory
LogP model
numerical methods
triangular solvers
parallel algorithmsand complexity
Triangular matrix
Parallel algorithm
Triangular linear system
Numerical method
Forward substitution scheme
Complexity
Latency
Equation system
Distributed memory
Distributed memory multiprocessor system
Linear system
Computer program
Matrix calculus
Backward substitution scheme
Time complexity
Message transmission
Language English
License http://www.elsevier.com/open-access/userlicense/1.0
https://www.elsevier.com/tdm/userlicense/1.0
https://www.elsevier.com/open-access/userlicense/1.0
CC BY 4.0
LinkModel OpenURL
MergedId FETCHMERGED-LOGICAL-c315t-1035b0a5890dffa86ef2828d28aaf607636eef49e8da8f8e6fb1abc64a5b6bc3
OpenAccessLink https://dx.doi.org/10.1006/inco.2000.2866
PageCount 39
ParticipantIDs pascalfrancis_primary_1520418
crossref_primary_10_1006_inco_2000_2866
crossref_citationtrail_10_1006_inco_2000_2866
elsevier_sciencedirect_doi_10_1006_inco_2000_2866
PublicationCentury 2000
PublicationDate 2000-00-00
PublicationDateYYYYMMDD 2000-01-01
PublicationDate_xml – year: 2000
  text: 2000-00-00
PublicationDecade 2000
PublicationPlace San Diego, CA
PublicationPlace_xml – name: San Diego, CA
PublicationTitle Information and computation
PublicationYear 2000
Publisher Elsevier Inc
Elsevier
Publisher_xml – name: Elsevier Inc
– name: Elsevier
References Santos (bib16) 1995
Eisenstat, Heath, Henkel, Romine (bib5) 1998
Gallivan (bib6) 1990
Dongarra, van de Geijn, Walker (bib4) April 1992
Borodin, Munro (bib1) 1975
Karp, Sahay, Santos, Schauser (bib11) 1993
Culler, Karp, Patterson, Sahay, Santos, Schauser, Subramonian, von Eicken (bib2) November 1996
(bib9) 1992
Karp, Ramachandran (bib12) 1990
Heath, Romine (bib7) 1988
Santos (bib17) 1999; 57
Heller (bib8) 1987; 29
Li, Coleman (bib13) 1989
Higham (bib10) 1993
Demmel, Heaths, van der Vorst (bib3) 1993
Romine, Ortega (bib14) 1988; 6
Sameh, Brent (bib15) 1977
Heath (10.1006/inco.2000.2866_bib7) 1988
Demmel (10.1006/inco.2000.2866_bib3) 1993
Dongarra (10.1006/inco.2000.2866_bib4) 1992
Romine (10.1006/inco.2000.2866_bib14) 1988; 6
Borodin (10.1006/inco.2000.2866_bib1) 1975
Higham (10.1006/inco.2000.2866_bib10) 1993
Santos (10.1006/inco.2000.2866_bib16) 1995
Heller (10.1006/inco.2000.2866_bib8) 1987; 29
Karp (10.1006/inco.2000.2866_bib11) 1993
Li (10.1006/inco.2000.2866_bib13) 1989
Santos (10.1006/inco.2000.2866_bib17) 1999; 57
Gallivan (10.1006/inco.2000.2866_bib6) 1990
Sameh (10.1006/inco.2000.2866_bib15) 1977
(10.1006/inco.2000.2866_bib9) 1992
Culler (10.1006/inco.2000.2866_bib2) 1996
Eisenstat (10.1006/inco.2000.2866_bib5) 1998
Karp (10.1006/inco.2000.2866_bib12) 1990
References_xml – year: 1990
  ident: bib6
  article-title: Parallel Algorithms for Matrix Computations
– year: 1993
  ident: bib3
  article-title: Parallel Numerical Linear Algebra
  publication-title: Technical Report, UCB/CSD 93/703
– year: 1993
  ident: bib10
  article-title: Stability of Parallel Triangular System Solvers
  publication-title: Technical Report Numerical Analysis Report, 236
– year: 1977
  ident: bib15
  article-title: Solving triangular systems on a parallel computer
  publication-title: SIAM J. Numer. Anal.
– volume: 29
  year: 1987
  ident: bib8
  article-title: A survey of parallel algorithms in numerical linear algebra
  publication-title: SIAM J. Numer. Anal.
– year: 1992
  ident: bib9
  article-title: High Performance Fortran Forum
  publication-title: High Performance Fortran Language Specification
– start-page: 553
  year: 1995
  end-page: 560
  ident: bib16
  article-title: Solving triangular linear systems in parallel using substitution
  publication-title: Proceedings of the Seventh Annual IEEE Symposium on Parallel and Distributed Processing
– year: 1975
  ident: bib1
  article-title: The Computational Complexity of Algebraic and Numeric Problems
– year: 1988
  ident: bib7
  article-title: Parallel solution of triangular systems on distributed-memory multiprocessors
  publication-title: SIAM J. Sci. Stat. Comput.
– year: 1989
  ident: bib13
  article-title: A new method for solving triangular systems on distributed-memory message-passing multiprocessors
  publication-title: SIAM J. Sci. Stat. Comput.
– volume: 6
  start-page: 109
  year: 1988
  end-page: 114
  ident: bib14
  article-title: Parallel solution of triangular systems of equations
  publication-title: Parallel Comput.
– year: 1998
  ident: bib5
  article-title: Modified cyclic algorithms for solving triangular systems on distributed memory multi-processors
  publication-title: SIAM J. Sci. Stat. Comput.
– year: November 1996
  ident: bib2
  article-title: LogP: A practical model of parallel computation
  publication-title: Communications of the ACM
– start-page: 869
  year: 1990
  end-page: 941
  ident: bib12
  article-title: Parallel algorithms for shared-memory machines
  publication-title: Handbook of Theoretical Computer Science, Volume A: Algorithms and Complexity
– year: 1993
  ident: bib11
  article-title: Optimal broadcast and summation on the
  publication-title: Fifth Annual ACM Symposium on Parallel Algorithms and Architectures
– volume: 57
  start-page: 121
  year: 1999
  end-page: 139
  ident: bib17
  article-title: Optimal and near-optimal algorithms for
  publication-title: J. Parallel Distributed Comput.
– year: April 1992
  ident: bib4
  article-title: A look at scalable dense linear algebra libraries
  publication-title: Scalable High-Performance Computing Conference
– volume: 57
  start-page: 121
  year: 1999
  ident: 10.1006/inco.2000.2866_bib17
  article-title: Optimal and near-optimal algorithms for k-item broadcast
  publication-title: J. Parallel Distributed Comput.
  doi: 10.1006/jpdc.1999.1529
– volume: 29
  year: 1987
  ident: 10.1006/inco.2000.2866_bib8
  article-title: A survey of parallel algorithms in numerical linear algebra
  publication-title: SIAM J. Numer. Anal.
– year: 1992
  ident: 10.1006/inco.2000.2866_bib9
  article-title: High Performance Fortran Forum
  publication-title: High Performance Fortran Language Specification
– year: 1988
  ident: 10.1006/inco.2000.2866_bib7
  article-title: Parallel solution of triangular systems on distributed-memory multiprocessors
  publication-title: SIAM J. Sci. Stat. Comput.
  doi: 10.1137/0909037
– year: 1993
  ident: 10.1006/inco.2000.2866_bib3
  article-title: Parallel Numerical Linear Algebra
– year: 1990
  ident: 10.1006/inco.2000.2866_bib6
– year: 1977
  ident: 10.1006/inco.2000.2866_bib15
  article-title: Solving triangular systems on a parallel computer
  publication-title: SIAM J. Numer. Anal.
  doi: 10.1137/0714076
– start-page: 869
  year: 1990
  ident: 10.1006/inco.2000.2866_bib12
  article-title: Parallel algorithms for shared-memory machines
– year: 1989
  ident: 10.1006/inco.2000.2866_bib13
  article-title: A new method for solving triangular systems on distributed-memory message-passing multiprocessors
  publication-title: SIAM J. Sci. Stat. Comput.
  doi: 10.1137/0910025
– year: 1993
  ident: 10.1006/inco.2000.2866_bib11
  article-title: Optimal broadcast and summation on the LogP model
– year: 1993
  ident: 10.1006/inco.2000.2866_bib10
  article-title: Stability of Parallel Triangular System Solvers
– start-page: 553
  year: 1995
  ident: 10.1006/inco.2000.2866_bib16
  article-title: Solving triangular linear systems in parallel using substitution
– year: 1998
  ident: 10.1006/inco.2000.2866_bib5
  article-title: Modified cyclic algorithms for solving triangular systems on distributed memory multi-processors
  publication-title: SIAM J. Sci. Stat. Comput.
– volume: 6
  start-page: 109
  year: 1988
  ident: 10.1006/inco.2000.2866_bib14
  article-title: Parallel solution of triangular systems of equations
  publication-title: Parallel Comput.
  doi: 10.1016/0167-8191(88)90009-9
– year: 1992
  ident: 10.1006/inco.2000.2866_bib4
  article-title: A look at scalable dense linear algebra libraries
– year: 1975
  ident: 10.1006/inco.2000.2866_bib1
– year: 1996
  ident: 10.1006/inco.2000.2866_bib2
  article-title: LogP: A practical model of parallel computation
  publication-title: Communications of the ACM
  doi: 10.1145/240455.240477
SSID ssj0011546
Score 1.5519848
Snippet This paper explores the problem of solving triangular linear systems on parallel distributed-memory machines. Working within the LogP model, tight asymptotic...
SourceID pascalfrancis
crossref
elsevier
SourceType Index Database
Enrichment Source
Publisher
StartPage 172
SubjectTerms Algorithmics. Computability. Computer arithmetics
Applied sciences
Computer science; control theory; systems
distributed-memory
Exact sciences and technology
LogP model
Mathematics
matrix computation
Nonlinear algebraic and transcendental equations
Numerical analysis
Numerical analysis. Scientific computation
Numerical linear algebra
numerical methods
parallel algorithmsand complexity
Sciences and techniques of general use
Theoretical computing
triangular solvers
Title On designing optimal parallel triangular solvers
URI https://dx.doi.org/10.1006/inco.2000.2866
Volume 161
WOSCitedRecordID wos000089567800003&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVESC
  databaseName: Elsevier SD Freedom Collection Journals 2021
  customDbUrl:
  eissn: 1090-2651
  dateEnd: 20171231
  omitProxy: false
  ssIdentifier: ssj0011546
  issn: 0890-5401
  databaseCode: AIEXJ
  dateStart: 19950101
  isFulltext: true
  titleUrlDefault: https://www.sciencedirect.com
  providerName: Elsevier
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV3fa9swED62tA8do1u7lbVrhx8Geyju5F-y9FhGyraHdtA85M3IsgSF1QtJGvrn786SkrglsBb2YoyQZVmfdHeS7-4D-JzlolZ5IeNMcBbnqBLiGmdGbFiD6qXGHUWmOrKJ8vJSjMfyl2cUnHV0AmXbivt7OfmvUGMZgk2hs0-Ae9koFuA9go5XhB2v_wT8Fbm3kltG58-MEuGWgq3UlEhTyLH8hk4oyfcUu7Hw_u_BPPXBSfPgo6w7zofez_prYh12MQwUWGJOh2e9kwO2OsgKwSw9X0smJCM_CQe1cfKQYVnKfU7YIDBd-nQ_M9I18Zc4Gh6vSVPnsPpISONCx0Gk5BNdoNBZKviDbNidfr2mDlF_KC8UVpEvYSstcQM0gK3zH8Pxz-XfosQHZIUPCMk5Gf_af8sm4-P1RM1wSVjHZbJmYIzewq7fGUTnDtE9eGHafXgTWDciL4T34dVaCsl3wK7aaAl35OGOAtzRCu7Iw_0eRhfD0bfvsWfBiHWWFHPUk1lRM1XgxzXWKsGNpW1ykwqlLGeoH7gxNpdGNEpYYbitE1Vrnqui5rXODmDQ_mnNB4h4WeqsMdLi-ssTrSUzaMPYQmKrSmtxCHEYnUr7DPFEVPK7crmteUWjScSlrKLRPIQvy_oTlxtlY80kDHblLTtnsVU4JzY-c9JDZfWKImV5Io6e0eZH2HGpFOgI7RgG8-mdOYFtvZjfzKaf_LT6C8Lve2U
linkProvider Elsevier
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=On+designing+optimal+parallel+triangular+solvers&rft.jtitle=Information+and+computation&rft.au=Santos%2C+Eunice+E.&rft.date=2000&rft.pub=Elsevier+Inc&rft.issn=0890-5401&rft.eissn=1090-2651&rft.volume=161&rft.issue=2&rft.spage=172&rft.epage=210&rft_id=info:doi/10.1006%2Finco.2000.2866&rft.externalDocID=S0890540100928669
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0890-5401&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0890-5401&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0890-5401&client=summon