Parallel and fully recursive multifrontal sparse Cholesky

We describe the design, implementation, and performance of a new parallel sparse Cholesky factorization code. The code uses a multifrontal factorization strategy. Operations on small dense submatrices are performed using new dense matrix subroutines that are part of the code, although the code can a...

Celý popis

Uloženo v:
Podrobná bibliografie
Vydáno v:Future generation computer systems Ročník 20; číslo 3; s. 425 - 440
Hlavní autoři: Irony, Dror, Shklarski, Gil, Toledo, Sivan
Médium: Journal Article
Jazyk:angličtina
Vydáno: Elsevier B.V 01.04.2004
Témata:
ISSN:0167-739X, 1872-7115
On-line přístup:Získat plný text
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Abstract We describe the design, implementation, and performance of a new parallel sparse Cholesky factorization code. The code uses a multifrontal factorization strategy. Operations on small dense submatrices are performed using new dense matrix subroutines that are part of the code, although the code can also use the blas and lapack. The new code is recursive at both the sparse and the dense levels, it uses a novel recursive data layout for dense submatrices, and it is parallelized using Cilk, an extension of C specifically designed to parallelize recursive codes. We demonstrate that the new code performs well and scales well on SMPs. In particular, on up to 16 processors, the code outperforms two state-of-the-art message-passing codes. The scalability and high performance that the code achieves imply that recursive schedules, blocked data layouts, and dynamic scheduling are effective in the implementation of sparse factorization codes.
AbstractList We describe the design, implementation, and performance of a new parallel sparse Cholesky factorization code. The code uses a multifrontal factorization strategy. Operations on small dense submatrices are performed using new dense matrix subroutines that are part of the code, although the code can also use the blas and lapack. The new code is recursive at both the sparse and the dense levels, it uses a novel recursive data layout for dense submatrices, and it is parallelized using Cilk, an extension of C specifically designed to parallelize recursive codes. We demonstrate that the new code performs well and scales well on SMPs. In particular, on up to 16 processors, the code outperforms two state-of-the-art message-passing codes. The scalability and high performance that the code achieves imply that recursive schedules, blocked data layouts, and dynamic scheduling are effective in the implementation of sparse factorization codes.
We describe the design, implementation, and performance of a new parallel sparse Cholesky factorization code. The code uses a multifrontal factorization strategy. Operations on small dense submatrices are performed using new dense matrix subroutines that are part of the code, although the code can also use the BLAS and LAPACK. The new code is recursive at both the sparse and the dense levels, it uses a novel recursive data layout for dense submatrices, and it is parallelized using Cilk, an extension of C specifically designed to parallelize recursive codes. We demonstrate that the new code performs well and scales well on SMPs. In particular, on up to 16 processors, the code outperforms two state- of-the-art message-passing codes. The scalability and high performance that the code achieves imply that recursive schedules, blocked data layouts, and dynamic scheduling are effective in the implementation of sparse factorization codes.
Author Irony, Dror
Toledo, Sivan
Shklarski, Gil
Author_xml – sequence: 1
  givenname: Dror
  surname: Irony
  fullname: Irony, Dror
– sequence: 2
  givenname: Gil
  surname: Shklarski
  fullname: Shklarski, Gil
– sequence: 3
  givenname: Sivan
  surname: Toledo
  fullname: Toledo, Sivan
  email: stoledo@tau.ac.il
BookMark eNqFkEtLxDAUhYOM4Dj6D1x05a41adImdSHI4AsGdKHgLqTpLWbMtGMeA_PvzVBXLnR1OXC-A_c7RbNhHAChC4ILgkl9tS76GKKDosSYFpgXGPMjNCeClzknpJqhearxnNPm_QSder_GGBNOyRw1L8opa8FmauiyPlq7zxzo6LzZQbaJNpjejUNQNvNb5Txky4_Rgv_cn6HjXlkP5z93gd7u716Xj_nq-eFpebvKNaVVyFWtWk0UEzVl0DHcQKcoS7mECmhZiha3lSK05wwE7UTLdacbLURbMiI0pwt0Oe1u3fgVwQe5MV6DtWqAMXpZCsowIXUqsqmo3ei9g15undkot5cEy4MnuZaTJ3nwJDGXyVPCrn9h2gQVTHraKWP_g28mGJKBnQEnvTYwaOhMshhkN5q_B74B6o-Jow
CitedBy_id crossref_primary_10_1109_TPDS_2019_2953852
crossref_primary_10_1145_1326548_1326549
crossref_primary_10_1002_cpe_4280
crossref_primary_10_1137_090757216
crossref_primary_10_1145_1326548_1326550
crossref_primary_10_1145_1095878_1095880
crossref_primary_10_1145_2629641
crossref_primary_10_1016_j_parco_2014_02_003
crossref_primary_10_1002_spe_802
crossref_primary_10_1137_17M1147615
crossref_primary_10_1137_06067256X
crossref_primary_10_1017_S0962492916000076
crossref_primary_10_1145_1326548_1326551
Cites_doi 10.1109/71.598277
10.1109/FMPC.1992.234898
10.1145/77626.77627
10.1147/rd.383.0265
10.1016/S0167-8191(01)00141-7
10.1145/76909.76911
10.1147/rd.416.0737
10.1109/SC.1998.10004
10.1007/978-1-4615-5205-5_1
10.1145/7921.11325
10.1137/S0895479899358194
10.1016/S0045-7825(99)00242-X
10.1137/1034004
10.1147/rd.385.0563
10.1147/rd.446.0823
10.1137/S0895479896297744
10.1145/383738.383741
10.1016/0167-8191(86)90019-0
10.1145/77626.79170
10.1007/3-540-46080-2_35
10.1145/76909.76910
10.1145/277652.277725
10.1137/0613024
10.1023/A:1021928926132
10.1145/263580.263662
10.1109/SFCS.1994.365680
10.1137/1.9780898719642
10.1145/356044.356047
10.1007/BFb0095337
10.1147/rd.444.0605
ContentType Journal Article
Copyright 2003 Elsevier B.V.
Copyright_xml – notice: 2003 Elsevier B.V.
DBID AAYXX
CITATION
7SC
8FD
H8D
JQ2
L7M
L~C
L~D
DOI 10.1016/j.future.2003.07.007
DatabaseName CrossRef
Computer and Information Systems Abstracts
Technology Research Database
Aerospace Database
ProQuest Computer Science Collection
Advanced Technologies Database with Aerospace
Computer and Information Systems Abstracts – Academic
Computer and Information Systems Abstracts Professional
DatabaseTitle CrossRef
Aerospace Database
Technology Research Database
Computer and Information Systems Abstracts – Academic
ProQuest Computer Science Collection
Computer and Information Systems Abstracts
Advanced Technologies Database with Aerospace
Computer and Information Systems Abstracts Professional
DatabaseTitleList
Aerospace Database
DeliveryMethod fulltext_linktorsrc
Discipline Computer Science
EISSN 1872-7115
EndPage 440
ExternalDocumentID 10_1016_j_future_2003_07_007
S0167739X03001845
GroupedDBID --K
--M
-~X
.DC
.~1
0R~
1B1
1~.
1~5
29H
4.4
457
4G.
5GY
5VS
7-5
71M
8P~
9JN
AACTN
AAEDT
AAEDW
AAIAV
AAIKJ
AAKOC
AALRI
AAOAW
AAQFI
AAQXK
AAXUO
AAYFN
ABBOA
ABFNM
ABJNI
ABMAC
ABXDB
ABYKQ
ACDAQ
ACGFS
ACNNM
ACRLP
ACZNC
ADBBV
ADEZE
ADJOM
ADMUD
AEBSH
AEKER
AFKWA
AFTJW
AGHFR
AGUBO
AGYEJ
AHHHB
AHZHX
AIALX
AIEXJ
AIKHN
AITUG
AJBFU
AJOXV
ALMA_UNASSIGNED_HOLDINGS
AMFUW
AMRAJ
AOUOD
ASPBG
AVWKF
AXJTR
AZFZN
BKOJK
BLXMC
CS3
EBS
EFJIC
EFLBG
EJD
EO8
EO9
EP2
EP3
F5P
FDB
FEDTE
FGOYB
FIRID
FNPLU
FYGXN
G-Q
G8K
GBLVA
GBOLZ
HLZ
HVGLF
HZ~
IHE
J1W
KOM
LG9
M41
MO0
MS~
N9A
O-L
O9-
OAUVE
OZT
P-8
P-9
PC.
Q38
R2-
RIG
ROL
RPZ
SBC
SDF
SDG
SES
SEW
SPC
SPCBC
SSV
SSZ
T5K
UHS
WUQ
XPP
ZMT
~G-
9DU
AATTM
AAXKI
AAYWO
AAYXX
ABDPE
ABWVN
ACLOT
ACRPL
ADNMO
AEIPS
AFJKZ
AGQPQ
AIIUN
ANKPU
APXCP
CITATION
EFKBS
~HD
7SC
8FD
H8D
JQ2
L7M
L~C
L~D
ID FETCH-LOGICAL-c335t-a6abc1a48634ed409eda34a482e5e3228b0b5a13f74e83d8b7cdc9c88b2418c73
ISICitedReferencesCount 21
ISICitedReferencesURI http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000220688000008&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
ISSN 0167-739X
IngestDate Thu Oct 02 13:20:07 EDT 2025
Tue Nov 18 22:00:25 EST 2025
Sat Nov 29 02:59:31 EST 2025
Fri Feb 23 02:34:32 EST 2024
IsPeerReviewed true
IsScholarly true
Issue 3
Keywords Parallel Cholesky factorization
Recursive factorizations
Cilk
Block layouts
Sparse Cholesky factorization
Multifrontal factorizations
Recursive layouts
Language English
License https://www.elsevier.com/tdm/userlicense/1.0
LinkModel OpenURL
MergedId FETCHMERGED-LOGICAL-c335t-a6abc1a48634ed409eda34a482e5e3228b0b5a13f74e83d8b7cdc9c88b2418c73
Notes ObjectType-Article-2
SourceType-Scholarly Journals-1
ObjectType-Feature-1
content type line 23
PQID 28340116
PQPubID 23500
PageCount 16
ParticipantIDs proquest_miscellaneous_28340116
crossref_primary_10_1016_j_future_2003_07_007
crossref_citationtrail_10_1016_j_future_2003_07_007
elsevier_sciencedirect_doi_10_1016_j_future_2003_07_007
PublicationCentury 2000
PublicationDate 2004-04-01
PublicationDateYYYYMMDD 2004-04-01
PublicationDate_xml – month: 04
  year: 2004
  text: 2004-04-01
  day: 01
PublicationDecade 2000
PublicationTitle Future generation computer systems
PublicationYear 2004
Publisher Elsevier B.V
Publisher_xml – name: Elsevier B.V
References L.S. Blackford, J. Choi, A. Cleary, E. D’Azevedo, J. Demmel, I. Dhillon, J. Dongarra, S. Hammarling, G. Henry, A. Petitet, K. Stanley, D. Walker, R.C. Whaley, ScaLAPACK User’s Guide, SIAM, Philadelphia, PA, 1997.
R.D. Blumofe, C.E. Leiserson, Scheduling multithreaded computations by work stealing, in: Proceedings of the 35th Annual Symposium on Foundations of Computer Science, Santa Fe, New Mexico, IEEE Computer Society Press, November 1994, pp. 356–368.
Cilk-5.3.2 Reference Manual, Supercomputing Technologies Group, MIT Laboratory for Computer Science, Cambridge, MA, November 2001.
Dongarra, Cruz, Hammarling, Duff (BIB15) 1990; 16
Duff, Reid (BIB17) 1983; 9
IBM, Engineering and Scientific Subroutine Library (SCSL), 2001.
R.D. Blumofe, Executing multithreaded programs efficiently, Ph.D. Thesis, MIT Department of Electrical Engineering and Computer Science, September 1995.
Elmroth, Gustavson (BIB20) 2001; 41
J.J. Dongarra, P. Raghavan, A new recursive implementation of sparse Cholesky factorization, in: Proceedings of the 16th IMACS World Congress 2000 on Scientific Computing, Applications, Mathematics, and Simulation, Lausanne, Switzerland, August 2000.
Gilbert, Moler, Schreiber (BIB22) 1992; 13
J. Bilmes, K. Asanovic, C.W. Chin, J. Demmel, Optimizing matrix multiply using PHIPAC: a portable, high-performance, ANSI C coding methodology, in: Proceedings of the International Conference on Supercomputing, Vienna, Austria, 1997.
Frigo, Leiserson, Randall (BIB21) 1998; 33
Intel, Math Kernel Library (MKL), 2001.
SGI, Scientific Computing Software Library (SCSL), 1993–2002.
Kamath, Ho, Manley (BIB33) 1994; 6
Liu (BIB35) 1989; 15
Toledo (BIB41) 1997; 18
Liu (BIB36) 1992; 34
O. Schenk, K. Gärtner, Sparse factorization with two-level scheduling in PARADISO, in: Proceedings of the 10th SIAM Conference on Parallel Processing for Scientific Computing, Portsmouth, VA, March 2001, p. 10 (CD-ROM).
J. Choi, J. Dongarra, R. Pozo, D. Walker, ScaLAPACK: a scalable linear algebra for distributed memory concurrent computers, in: Proceedings of the Fourth Symposium on the Frontiers of Massively Parallel Computation, 1992, pp. 120–127. Also available as University of Tennessee Technical Report CS-92-181.
M. Joshi, A. Gupta, F. Gustavson, G. Karypis, V. Kumar, PSPASES: scalable parallel direct solver library for sparse symmetric positive definite linear systems, in: User’s Manual for Version 1.0.3, Technical Report TR 97-059, Department of Computer Science, University of Minnesota, 1997, revised 1999.
Agarwal, Gustavson, Zubair (BIB2) 1994; 38
Andersen, Waśniewski, Gustavson (BIB5) 2001; 27
Compaq, Compaq Extended Math Library (CXML), 2001.
Dongarra, Cruz, Hammarling, Duff (BIB14) 1990; 16
Gustavson, Jonsson (BIB26) 2000; 44
Hénon, Ramet, Roman (BIB27) 2002; 28
Ashcraft, Grimes (BIB6) 1989; 15
Elmroth, Gustavson (BIB19) 2000; 44
F. Gustavson, A. Henriksson, I. Jonsson, B. Kågström, P. Ling, Recursive blocked data formats and BLAS’s for dense linear algebra algorithms, in: B. Kågström, J. Dongarra, E. Elmroth, J. Waśniewski (Eds.), Proceedings of the Fourth International Workshop on Applied Parallel Computing and Large Scale Scientific and Industrial Problems (PARA’98), Lecture Notes in Computer Science Number 1541, Springer, Umeå, Sweden, June 1998, pp. 574–578.
Amestoy, Duff, Koster, L’Excellent (BIB3) 2001; 23
.
J. Dongarra, R. Whaley, A User’s Guide to the Blacs v1.0, Technical Report UT CS-95-281, LAPACK Working Note 94, University of Tennessee, 1995.
D. Irony, G. Shklarski, S. Toledo, Parallel and fully recursive multifrontal supernodal sparse Cholesky, in: Proceedings of the International Conference on Computational Science (ICCS 2002), Part II, Amsterdam, April 2002, pp. 335–344.
P.R. Amestoy, I.S. Duff, J. L’Excellent, J. Koster, M. Tuma, MUltifrontal Massively Parallel Solver (MUMPS version 4.1), Specification Sheets, March 2000.
Gustavson (BIB25) 1997; 41
M. Joshi, A. Gupta, F. Gustavson, G. Karypis, V. Kumar, PSPASES: an efficient and scalable parallel sparse direct solver, in: Proceedings of the International Workshop on Frontiers of Parallel Numerical Computations and Applications (Frontiers’99), Annapolis, MD, February 1999, Unpublished article.
Reference Guide, August 1992, The MathWorks Inc., Natick, MA.
R.C. Whaley, J.J. Dongarra, Automatically tuned linear algebra software, Technical Report, Computer Science Department, University of Tennessee, 1998
Agarwal, Gustavson, Zubair (BIB1) 1994; 38
I.S. Duff, Parallel implementation of multifrontal schemes, Parallel Comput. 3 (1986).
Gupta, Karypis, Kumar (BIB23) 1997; 8
Liu (BIB34) 1986; 12
Gupta (10.1016/j.future.2003.07.007_BIB23) 1997; 8
Dongarra (10.1016/j.future.2003.07.007_BIB15) 1990; 16
10.1016/j.future.2003.07.007_BIB30
10.1016/j.future.2003.07.007_BIB10
10.1016/j.future.2003.07.007_BIB32
10.1016/j.future.2003.07.007_BIB31
10.1016/j.future.2003.07.007_BIB12
10.1016/j.future.2003.07.007_BIB11
Agarwal (10.1016/j.future.2003.07.007_BIB2) 1994; 38
Elmroth (10.1016/j.future.2003.07.007_BIB20) 2001; 41
Frigo (10.1016/j.future.2003.07.007_BIB21) 1998; 33
Gustavson (10.1016/j.future.2003.07.007_BIB26) 2000; 44
Liu (10.1016/j.future.2003.07.007_BIB35) 1989; 15
Ashcraft (10.1016/j.future.2003.07.007_BIB6) 1989; 15
Duff (10.1016/j.future.2003.07.007_BIB17) 1983; 9
Gustavson (10.1016/j.future.2003.07.007_BIB25) 1997; 41
Toledo (10.1016/j.future.2003.07.007_BIB41) 1997; 18
10.1016/j.future.2003.07.007_BIB13
10.1016/j.future.2003.07.007_BIB16
Liu (10.1016/j.future.2003.07.007_BIB34) 1986; 12
10.1016/j.future.2003.07.007_BIB38
10.1016/j.future.2003.07.007_BIB37
Andersen (10.1016/j.future.2003.07.007_BIB5) 2001; 27
10.1016/j.future.2003.07.007_BIB18
10.1016/j.future.2003.07.007_BIB39
Liu (10.1016/j.future.2003.07.007_BIB36) 1992; 34
10.1016/j.future.2003.07.007_BIB40
Gilbert (10.1016/j.future.2003.07.007_BIB22) 1992; 13
10.1016/j.future.2003.07.007_BIB42
10.1016/j.future.2003.07.007_BIB4
Dongarra (10.1016/j.future.2003.07.007_BIB14) 1990; 16
10.1016/j.future.2003.07.007_BIB7
10.1016/j.future.2003.07.007_BIB9
10.1016/j.future.2003.07.007_BIB8
Kamath (10.1016/j.future.2003.07.007_BIB33) 1994; 6
Agarwal (10.1016/j.future.2003.07.007_BIB1) 1994; 38
Amestoy (10.1016/j.future.2003.07.007_BIB3) 2001; 23
Hénon (10.1016/j.future.2003.07.007_BIB27) 2002; 28
Elmroth (10.1016/j.future.2003.07.007_BIB19) 2000; 44
10.1016/j.future.2003.07.007_BIB24
10.1016/j.future.2003.07.007_BIB29
10.1016/j.future.2003.07.007_BIB28
References_xml – volume: 28
  start-page: 301
  year: 2002
  end-page: 321
  ident: BIB27
  article-title: PaStiX: a high-performance parallel direct solver for sparse symmetric definite systems
  publication-title: Parallel Comput.
– volume: 41
  start-page: 737
  year: 1997
  end-page: 755
  ident: BIB25
  article-title: Recursion leads to automatic variable blocking for dense linear-algebra algorithms
  publication-title: IBM J. Res. Dev.
– reference: D. Irony, G. Shklarski, S. Toledo, Parallel and fully recursive multifrontal supernodal sparse Cholesky, in: Proceedings of the International Conference on Computational Science (ICCS 2002), Part II, Amsterdam, April 2002, pp. 335–344.
– volume: 6
  start-page: 44
  year: 1994
  end-page: 56
  ident: BIB33
  article-title: DXML: a high-performance scientific subroutine library
  publication-title: Dig. Tech. J.
– volume: 9
  start-page: 302
  year: 1983
  end-page: 325
  ident: BIB17
  article-title: The multifrontal solution of indefinite sparse symmetric linear equations
  publication-title: ACM Trans. Math. Softw.
– reference: M. Joshi, A. Gupta, F. Gustavson, G. Karypis, V. Kumar, PSPASES: an efficient and scalable parallel sparse direct solver, in: Proceedings of the International Workshop on Frontiers of Parallel Numerical Computations and Applications (Frontiers’99), Annapolis, MD, February 1999, Unpublished article.
– reference: R.C. Whaley, J.J. Dongarra, Automatically tuned linear algebra software, Technical Report, Computer Science Department, University of Tennessee, 1998,
– volume: 23
  start-page: 15
  year: 2001
  end-page: 41
  ident: BIB3
  article-title: A fully asynchronous multifrontal solver using distributed dynamic scheduling
  publication-title: SIAM J. Matrix Anal. Appl.
– volume: 8
  start-page: 502
  year: 1997
  end-page: 520
  ident: BIB23
  article-title: Highly scalable parallel algorithms for sparse matrix factorization
  publication-title: IEEE Trans. Parallel Distrib. Syst.
– volume: 15
  start-page: 310
  year: 1989
  end-page: 325
  ident: BIB35
  article-title: The multifrontal method and paging in sparse Cholesky factorization
  publication-title: ACM Trans. Math. Softw.
– reference: P.R. Amestoy, I.S. Duff, J. L’Excellent, J. Koster, M. Tuma, MUltifrontal Massively Parallel Solver (MUMPS version 4.1), Specification Sheets, March 2000.
– volume: 12
  start-page: 249
  year: 1986
  end-page: 264
  ident: BIB34
  article-title: On the storage requirement in the out-of-core multifrontal method for sparse factorization
  publication-title: ACM Trans. Math. Softw.
– volume: 34
  start-page: 82
  year: 1992
  end-page: 109
  ident: BIB36
  article-title: The multifrontal method for sparse matrix solution: theory and practice
  publication-title: SIAM Rev.
– reference: SGI, Scientific Computing Software Library (SCSL), 1993–2002.
– reference: I.S. Duff, Parallel implementation of multifrontal schemes, Parallel Comput. 3 (1986).
– volume: 15
  start-page: 291
  year: 1989
  end-page: 309
  ident: BIB6
  article-title: The influence of relaxed supernode partitions on the multifrontal method
  publication-title: ACM Trans. Math. Softw.
– reference: IBM, Engineering and Scientific Subroutine Library (SCSL), 2001.
– reference: Compaq, Compaq Extended Math Library (CXML), 2001.
– reference: Intel, Math Kernel Library (MKL), 2001.
– volume: 16
  start-page: 1
  year: 1990
  end-page: 17
  ident: BIB15
  article-title: A set of level 3 basic linear algebra subprograms
  publication-title: ACM Trans. Math. Softw.
– volume: 18
  start-page: 1065
  year: 1997
  end-page: 1081
  ident: BIB41
  article-title: Locality of reference in LU decomposition with partial pivoting
  publication-title: SIAM J. Matrix Anal. Appl.
– reference: Reference Guide, August 1992, The MathWorks Inc., Natick, MA.
– reference: R.D. Blumofe, Executing multithreaded programs efficiently, Ph.D. Thesis, MIT Department of Electrical Engineering and Computer Science, September 1995.
– volume: 33
  start-page: 212
  year: 1998
  end-page: 223
  ident: BIB21
  article-title: The implementation of the Cilk-5 multithreaded language
  publication-title: ACM SIGPLAN Notices
– reference: O. Schenk, K. Gärtner, Sparse factorization with two-level scheduling in PARADISO, in: Proceedings of the 10th SIAM Conference on Parallel Processing for Scientific Computing, Portsmouth, VA, March 2001, p. 10 (CD-ROM).
– volume: 16
  start-page: 18
  year: 1990
  end-page: 28
  ident: BIB14
  article-title: Algorithm 679: a set of level 3 basic linear algebra subprograms
  publication-title: ACM Trans. Math. Softw.
– reference: R.D. Blumofe, C.E. Leiserson, Scheduling multithreaded computations by work stealing, in: Proceedings of the 35th Annual Symposium on Foundations of Computer Science, Santa Fe, New Mexico, IEEE Computer Society Press, November 1994, pp. 356–368.
– reference: J.J. Dongarra, P. Raghavan, A new recursive implementation of sparse Cholesky factorization, in: Proceedings of the 16th IMACS World Congress 2000 on Scientific Computing, Applications, Mathematics, and Simulation, Lausanne, Switzerland, August 2000.
– volume: 38
  start-page: 265
  year: 1994
  end-page: 275
  ident: BIB2
  article-title: Improving performance of linear algebra algorithms for dense matrices using algorithmic prefetch
  publication-title: IBM J. Res. Dev.
– reference: L.S. Blackford, J. Choi, A. Cleary, E. D’Azevedo, J. Demmel, I. Dhillon, J. Dongarra, S. Hammarling, G. Henry, A. Petitet, K. Stanley, D. Walker, R.C. Whaley, ScaLAPACK User’s Guide, SIAM, Philadelphia, PA, 1997.
– reference: M. Joshi, A. Gupta, F. Gustavson, G. Karypis, V. Kumar, PSPASES: scalable parallel direct solver library for sparse symmetric positive definite linear systems, in: User’s Manual for Version 1.0.3, Technical Report TR 97-059, Department of Computer Science, University of Minnesota, 1997, revised 1999.
– volume: 38
  start-page: 563
  year: 1994
  end-page: 576
  ident: BIB1
  article-title: Exploiting functional parallelism of POWER2 to design high-performance numerical algorithms
  publication-title: IBM J. Res. Dev.
– reference: J. Choi, J. Dongarra, R. Pozo, D. Walker, ScaLAPACK: a scalable linear algebra for distributed memory concurrent computers, in: Proceedings of the Fourth Symposium on the Frontiers of Massively Parallel Computation, 1992, pp. 120–127. Also available as University of Tennessee Technical Report CS-92-181.
– volume: 27
  start-page: 214
  year: 2001
  end-page: 244
  ident: BIB5
  article-title: A recursive formulation of Cholesky factorization of a matrix in packed storage
  publication-title: ACM Trans. Math. Softw.
– reference: J. Bilmes, K. Asanovic, C.W. Chin, J. Demmel, Optimizing matrix multiply using PHIPAC: a portable, high-performance, ANSI C coding methodology, in: Proceedings of the International Conference on Supercomputing, Vienna, Austria, 1997.
– volume: 44
  start-page: 605
  year: 2000
  end-page: 624
  ident: BIB19
  article-title: Applying recursion to serial and parallel QR factorization leads to better performance
  publication-title: IBM J. Res. Dev.
– volume: 41
  start-page: 936
  year: 2001
  end-page: 949
  ident: BIB20
  article-title: A faster and simpler recursive algorithm for the LAPACK routine DGELS
  publication-title: BIT
– volume: 44
  start-page: 823
  year: 2000
  end-page: 850
  ident: BIB26
  article-title: Minimal-storage high-performance Cholesky factorization via blocking and recursion
  publication-title: IBM J. Res. Dev.
– volume: 13
  start-page: 333
  year: 1992
  end-page: 356
  ident: BIB22
  article-title: Sparse matrices in MATLAB: design and implementation
  publication-title: SIAM J. Matrix Anal. Appl.
– reference: F. Gustavson, A. Henriksson, I. Jonsson, B. Kågström, P. Ling, Recursive blocked data formats and BLAS’s for dense linear algebra algorithms, in: B. Kågström, J. Dongarra, E. Elmroth, J. Waśniewski (Eds.), Proceedings of the Fourth International Workshop on Applied Parallel Computing and Large Scale Scientific and Industrial Problems (PARA’98), Lecture Notes in Computer Science Number 1541, Springer, Umeå, Sweden, June 1998, pp. 574–578.
– reference: Cilk-5.3.2 Reference Manual, Supercomputing Technologies Group, MIT Laboratory for Computer Science, Cambridge, MA, November 2001.
– reference: .
– reference: J. Dongarra, R. Whaley, A User’s Guide to the Blacs v1.0, Technical Report UT CS-95-281, LAPACK Working Note 94, University of Tennessee, 1995.
– volume: 8
  start-page: 502
  issue: 5
  year: 1997
  ident: 10.1016/j.future.2003.07.007_BIB23
  article-title: Highly scalable parallel algorithms for sparse matrix factorization
  publication-title: IEEE Trans. Parallel Distrib. Syst.
  doi: 10.1109/71.598277
– ident: 10.1016/j.future.2003.07.007_BIB11
  doi: 10.1109/FMPC.1992.234898
– volume: 16
  start-page: 18
  issue: 1
  year: 1990
  ident: 10.1016/j.future.2003.07.007_BIB14
  article-title: Algorithm 679: a set of level 3 basic linear algebra subprograms
  publication-title: ACM Trans. Math. Softw.
  doi: 10.1145/77626.77627
– volume: 38
  start-page: 265
  issue: 3
  year: 1994
  ident: 10.1016/j.future.2003.07.007_BIB2
  article-title: Improving performance of linear algebra algorithms for dense matrices using algorithmic prefetch
  publication-title: IBM J. Res. Dev.
  doi: 10.1147/rd.383.0265
– volume: 28
  start-page: 301
  year: 2002
  ident: 10.1016/j.future.2003.07.007_BIB27
  article-title: PaStiX: a high-performance parallel direct solver for sparse symmetric definite systems
  publication-title: Parallel Comput.
  doi: 10.1016/S0167-8191(01)00141-7
– volume: 15
  start-page: 310
  issue: 4
  year: 1989
  ident: 10.1016/j.future.2003.07.007_BIB35
  article-title: The multifrontal method and paging in sparse Cholesky factorization
  publication-title: ACM Trans. Math. Softw.
  doi: 10.1145/76909.76911
– ident: 10.1016/j.future.2003.07.007_BIB29
– volume: 41
  start-page: 737
  year: 1997
  ident: 10.1016/j.future.2003.07.007_BIB25
  article-title: Recursion leads to automatic variable blocking for dense linear-algebra algorithms
  publication-title: IBM J. Res. Dev.
  doi: 10.1147/rd.416.0737
– ident: 10.1016/j.future.2003.07.007_BIB42
  doi: 10.1109/SC.1998.10004
– ident: 10.1016/j.future.2003.07.007_BIB32
  doi: 10.1007/978-1-4615-5205-5_1
– volume: 12
  start-page: 249
  issue: 3
  year: 1986
  ident: 10.1016/j.future.2003.07.007_BIB34
  article-title: On the storage requirement in the out-of-core multifrontal method for sparse factorization
  publication-title: ACM Trans. Math. Softw.
  doi: 10.1145/7921.11325
– ident: 10.1016/j.future.2003.07.007_BIB38
– ident: 10.1016/j.future.2003.07.007_BIB13
– volume: 23
  start-page: 15
  year: 2001
  ident: 10.1016/j.future.2003.07.007_BIB3
  article-title: A fully asynchronous multifrontal solver using distributed dynamic scheduling
  publication-title: SIAM J. Matrix Anal. Appl.
  doi: 10.1137/S0895479899358194
– ident: 10.1016/j.future.2003.07.007_BIB4
  doi: 10.1016/S0045-7825(99)00242-X
– volume: 34
  start-page: 82
  issue: 1
  year: 1992
  ident: 10.1016/j.future.2003.07.007_BIB36
  article-title: The multifrontal method for sparse matrix solution: theory and practice
  publication-title: SIAM Rev.
  doi: 10.1137/1034004
– volume: 38
  start-page: 563
  issue: 5
  year: 1994
  ident: 10.1016/j.future.2003.07.007_BIB1
  article-title: Exploiting functional parallelism of POWER2 to design high-performance numerical algorithms
  publication-title: IBM J. Res. Dev.
  doi: 10.1147/rd.385.0563
– volume: 44
  start-page: 823
  year: 2000
  ident: 10.1016/j.future.2003.07.007_BIB26
  article-title: Minimal-storage high-performance Cholesky factorization via blocking and recursion
  publication-title: IBM J. Res. Dev.
  doi: 10.1147/rd.446.0823
– volume: 18
  start-page: 1065
  issue: 4
  year: 1997
  ident: 10.1016/j.future.2003.07.007_BIB41
  article-title: Locality of reference in LU decomposition with partial pivoting
  publication-title: SIAM J. Matrix Anal. Appl.
  doi: 10.1137/S0895479896297744
– volume: 27
  start-page: 214
  year: 2001
  ident: 10.1016/j.future.2003.07.007_BIB5
  article-title: A recursive formulation of Cholesky factorization of a matrix in packed storage
  publication-title: ACM Trans. Math. Softw.
  doi: 10.1145/383738.383741
– ident: 10.1016/j.future.2003.07.007_BIB18
  doi: 10.1016/0167-8191(86)90019-0
– ident: 10.1016/j.future.2003.07.007_BIB9
– volume: 16
  start-page: 1
  issue: 1
  year: 1990
  ident: 10.1016/j.future.2003.07.007_BIB15
  article-title: A set of level 3 basic linear algebra subprograms
  publication-title: ACM Trans. Math. Softw.
  doi: 10.1145/77626.79170
– ident: 10.1016/j.future.2003.07.007_BIB30
  doi: 10.1007/3-540-46080-2_35
– ident: 10.1016/j.future.2003.07.007_BIB31
  doi: 10.1007/978-1-4615-5205-5_1
– ident: 10.1016/j.future.2003.07.007_BIB28
– ident: 10.1016/j.future.2003.07.007_BIB40
– volume: 15
  start-page: 291
  issue: 4
  year: 1989
  ident: 10.1016/j.future.2003.07.007_BIB6
  article-title: The influence of relaxed supernode partitions on the multifrontal method
  publication-title: ACM Trans. Math. Softw.
  doi: 10.1145/76909.76910
– volume: 33
  start-page: 212
  issue: 5
  year: 1998
  ident: 10.1016/j.future.2003.07.007_BIB21
  article-title: The implementation of the Cilk-5 multithreaded language
  publication-title: ACM SIGPLAN Notices
  doi: 10.1145/277652.277725
– volume: 13
  start-page: 333
  issue: 1
  year: 1992
  ident: 10.1016/j.future.2003.07.007_BIB22
  article-title: Sparse matrices in MATLAB: design and implementation
  publication-title: SIAM J. Matrix Anal. Appl.
  doi: 10.1137/0613024
– volume: 41
  start-page: 936
  year: 2001
  ident: 10.1016/j.future.2003.07.007_BIB20
  article-title: A faster and simpler recursive algorithm for the LAPACK routine DGELS
  publication-title: BIT
  doi: 10.1023/A:1021928926132
– ident: 10.1016/j.future.2003.07.007_BIB7
  doi: 10.1145/263580.263662
– ident: 10.1016/j.future.2003.07.007_BIB10
  doi: 10.1109/SFCS.1994.365680
– volume: 6
  start-page: 44
  issue: 3
  year: 1994
  ident: 10.1016/j.future.2003.07.007_BIB33
  article-title: DXML: a high-performance scientific subroutine library
  publication-title: Dig. Tech. J.
– ident: 10.1016/j.future.2003.07.007_BIB37
– ident: 10.1016/j.future.2003.07.007_BIB39
– ident: 10.1016/j.future.2003.07.007_BIB8
  doi: 10.1137/1.9780898719642
– volume: 9
  start-page: 302
  year: 1983
  ident: 10.1016/j.future.2003.07.007_BIB17
  article-title: The multifrontal solution of indefinite sparse symmetric linear equations
  publication-title: ACM Trans. Math. Softw.
  doi: 10.1145/356044.356047
– ident: 10.1016/j.future.2003.07.007_BIB24
  doi: 10.1007/BFb0095337
– ident: 10.1016/j.future.2003.07.007_BIB12
– volume: 44
  start-page: 605
  issue: 4
  year: 2000
  ident: 10.1016/j.future.2003.07.007_BIB19
  article-title: Applying recursion to serial and parallel QR factorization leads to better performance
  publication-title: IBM J. Res. Dev.
  doi: 10.1147/rd.444.0605
– ident: 10.1016/j.future.2003.07.007_BIB16
SSID ssj0001731
Score 1.7908937
Snippet We describe the design, implementation, and performance of a new parallel sparse Cholesky factorization code. The code uses a multifrontal factorization...
SourceID proquest
crossref
elsevier
SourceType Aggregation Database
Enrichment Source
Index Database
Publisher
StartPage 425
SubjectTerms Block layouts
Cilk
Multifrontal factorizations
Parallel Cholesky factorization
Recursive factorizations
Recursive layouts
Sparse Cholesky factorization
Title Parallel and fully recursive multifrontal sparse Cholesky
URI https://dx.doi.org/10.1016/j.future.2003.07.007
https://www.proquest.com/docview/28340116
Volume 20
WOSCitedRecordID wos000220688000008&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVESC
  databaseName: Elsevier SD Freedom Collection Journals 2021
  customDbUrl:
  eissn: 1872-7115
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0001731
  issn: 0167-739X
  databaseCode: AIEXJ
  dateStart: 19950201
  isFulltext: true
  titleUrlDefault: https://www.sciencedirect.com
  providerName: Elsevier
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV3JTsMwELXYDlzYETs-cENBSezUzhEhViGEBEi9RY7tiE1plRYEf894SUhBbAcuUWs1Tpp5GY_H4_cQ2uGFhnE0TwKlOmFAi0gEAoATEKVETPOQs8JS5p-ziwve7aaXvtx2YOUEWFnyl5e0_6-mhjYwttk6-wdzN51CA3wGo8MRzA7HXxn-UlRGH8VRAJjs-utuZZLqtk7d1g8WhrTA7BPpw6xW7x4YidzBw8j67pGlGjH6ytpDRHr5B8_93ITip9CZW4mvek2h79Xtg5kxO0ns41YVB1xJ2eTslVmBGkk5tCtVfBYSvCsjVgO3caNx2IILaflE6nY2--GVOnamT57bJRHu9xyViiVqtayqThN3lCj7wwDWlBXWFWv3mevFaGySLDQr7GwcTcYsScHxTe6fHnbPmuE6Yl600v-nen-lLQL8fDdfxS8fRnIbnlzPoRk_r8D7Dg_zaEyXC2i21uzA3oUvorSGBwZ4YAsP3MADt-GBHTxwDY8ldHN0eH1wEnj5jEASkgwD0RG5jATlHUK1gnm8VoJQ-B7rRIMf5zm8oyIiBaOaE8VzJpVMJec5RHVcMrKMJspeqVcQ5kUkWSjilAmbMxQRtKR5UkC4K5KUriJSP5RMem55I3HymH1nklUUNGf1HbfKD79n9fPOfHzo4r4MQPTDmdu1eTJwn2ZNTJS69zTIILqmZi1y7Y_3so6m39-MDTQxrJ70JpqSz8O7QbXlMfYGqbiXUw
linkProvider Elsevier
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Parallel+and+fully+recursive+multifrontal+sparse+Cholesky&rft.jtitle=Future+generation+computer+systems&rft.au=Irony%2C+Dror&rft.au=Shklarski%2C+Gil&rft.au=Toledo%2C+Sivan&rft.date=2004-04-01&rft.issn=0167-739X&rft.volume=20&rft.issue=3&rft.spage=425&rft.epage=440&rft_id=info:doi/10.1016%2Fj.future.2003.07.007&rft.externalDBID=n%2Fa&rft.externalDocID=10_1016_j_future_2003_07_007
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0167-739X&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0167-739X&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0167-739X&client=summon