A Class of Communication-avoiding Algorithms for Solving General Dense Linear Systems on CPU/GPU Parallel Machines

We study several solvers for the solution of general linear systems where the main objective is to reduce the communication overhead due to pivoting. We first describe two existing algorithms for the LU factorization on hybrid CPU/GPU architectures. The first one is based on partial pivoting and the...

Full description

Saved in:

Bibliographic Details
Published in:	Procedia computer science Vol. 9; pp. 17 - 26
Main Authors:	Baboulin, Marc, Donfack, Simplice, Dongarra, Jack, Grigori, Laura, Rémy, Adrien, Tomov, Stanimire
Format:	Journal Article
Language:	English
Published:	Elsevier B.V 2012
Subjects:	communication-avoiding algorithms dense linear algebra libraries hybrid multicore/GPU computing linear system solvers LU factorization linear system solvers dense linear algebra libraries LU factorization communication-avoiding algorithms hybrid multicore/GPU computing
ISSN:	1877-0509, 1877-0509
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

Abstract	We study several solvers for the solution of general linear systems where the main objective is to reduce the communication overhead due to pivoting. We first describe two existing algorithms for the LU factorization on hybrid CPU/GPU architectures. The first one is based on partial pivoting and the second uses a random preconditioning of the original matrix to avoid pivoting. Then we introduce a solver where the panel factorization is performed using a communication-avoiding pivoting heuristic while the update of the trailing submatrix is performed by the GPU. We provide performance comparisons and tests on accuracy for these solvers on current hybrid multicore-GPU parallel machines.
AbstractList	We study several solvers for the solution of general linear systems where the main objective is to reduce the communication overhead due to pivoting. We first describe two existing algorithms for the LU factorization on hybrid CPU/GPU architectures. The first one is based on partial pivoting and the second uses a random preconditioning of the original matrix to avoid pivoting. Then we introduce a solver where the panel factorization is performed using a communication-avoiding pivoting heuristic while the update of the trailing submatrix is performed by the GPU. We provide performance comparisons and tests on accuracy for these solvers on current hybrid multicore-GPU parallel machines.
Author	Baboulin, Marc Rémy, Adrien Dongarra, Jack Grigori, Laura Donfack, Simplice Tomov, Stanimire
Author_xml	– sequence: 1 givenname: Marc surname: Baboulin fullname: Baboulin, Marc email: marc.baboulin@inria.fr organization: Inria and University Paris-Sud, France – sequence: 2 givenname: Simplice surname: Donfack fullname: Donfack, Simplice email: simplice.donfack@lri.fr organization: Inria and University Paris-Sud, France – sequence: 3 givenname: Jack surname: Dongarra fullname: Dongarra, Jack email: dongarra@eecs.utk.edu organization: University of Tennessee, USA – sequence: 4 givenname: Laura surname: Grigori fullname: Grigori, Laura email: laura.grigori@inria.fr organization: Inria and University Paris-Sud, France – sequence: 5 givenname: Adrien surname: Rémy fullname: Rémy, Adrien email: adrien.remy@lri.fr organization: Inria and University Paris-Sud, France – sequence: 6 givenname: Stanimire surname: Tomov fullname: Tomov, Stanimire email: tomov@eecs.utk.edu organization: University of Tennessee, USA
BookMark	eNqFkEFPwyAUx4mZiXPuE3jhC7RCaWl78LBUnSYzLtGdCaWvG0sLC9Ql-_ayzYPxoO_Ce4__j4TfNRoZawChW0piSii_28Y7Z5WPE0KTmKQxIewCjWmR5xHJSDn60V-hqfdbEooVRUnzMXIzXHXSe2xbXNm-_zRayUFbE8m91Y02azzr1tbpYdN73FqH3223P67nYMDJDj-A8YAX2oAMlwc_QAhag6vl6m6-XOGlDKkOOvwq1Sak_A26bGXnYfp9TtDq6fGjeo4Wb_OXaraIFEuLIVK0YBm0TU2TkhdUcUbLMoM0zyRnDXAuw1QWRc2TugUOOVNZXpcSGJMprzM2Qez8rnLWewet2DndS3cQlIijObEVJ3PiaE6QVAQtgSp_UUoPJyODk7r7h70_sxC-tdfghFcajIJGO1CDaKz-k_8CrR2Nzg
CitedBy_id	crossref_primary_10_1007_s11227_020_03340_9 crossref_primary_10_1177_1094342016665471 crossref_primary_10_15803_ijnc_4_1_131 crossref_primary_10_1016_j_jcp_2017_12_028 crossref_primary_10_3390_mca26030052
Cites_doi	10.1177/1094342010385729 10.1016/j.parco.2009.12.005 10.1016/j.parco.2010.06.001 10.1137/1.9780898718027 10.1109/SC.2008.5214287 10.1090/S0025-5718-1980-0572859-4 10.1137/100788926 10.1002/cpe.1301 10.1137/1.9780898719642 10.1007/3-540-60902-4_13 10.1137/1.9781611971811 10.1137/1.9780898719604 10.1147/rd.416.0737 10.1109/IPDPS.2011.15 10.1109/IPDPS.2010.5470348 10.1137/1.9780898719611
ContentType	Journal Article
Copyright	2012
Copyright_xml	– notice: 2012
DBID	6I. AAFTH AAYXX CITATION
DOI	10.1016/j.procs.2012.04.003
DatabaseName	ScienceDirect Open Access Titles Elsevier:ScienceDirect:Open Access CrossRef
DatabaseTitle	CrossRef
DatabaseTitleList
DeliveryMethod	fulltext_linktorsrc
Discipline	Computer Science
EISSN	1877-0509
EndPage	26
ExternalDocumentID	10_1016_j_procs_2012_04_003 S187705091200124X
GroupedDBID	--K 0R~ 0SF 1B1 457 5VS 6I. 71M AACTN AAEDT AAEDW AAFTH AAIKJ AALRI AAQFI AAXUO ABMAC ACGFS ADBBV ADEZE AEXQZ AFTJW AGHFR AITUG ALMA_UNASSIGNED_HOLDINGS AMRAJ E3Z EBS EJD EP3 FDB FNPLU HZ~ IXB KQ8 M41 M~E NCXOZ O-L O9- OK1 P2P RIG ROL SES SSZ 9DU AAYWO AAYXX ABWVN ACRPL ACVFH ADCNI ADNMO ADVLN AEUPX AFPUW AIGII AKBMS AKRWK AKYEP CITATION ~HD
ID	FETCH-LOGICAL-c348t-c1835efdb129681c631995e475a63de66a95e988b62bfe6e73c57b9ae33a46b53
ISICitedReferencesCount	14
ISICitedReferencesURI	http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000306288400002&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
ISSN	1877-0509
IngestDate	Sat Nov 29 02:44:16 EST 2025 Tue Nov 18 22:01:52 EST 2025 Wed May 17 00:09:02 EDT 2023
IsDoiOpenAccess	true
IsOpenAccess	true
IsPeerReviewed	true
IsScholarly	true
Keywords	linear system solvers dense linear algebra libraries LU factorization communication-avoiding algorithms hybrid multicore/GPU computing
Language	English
License	http://creativecommons.org/licenses/by-nc-nd/3.0 https://www.elsevier.com/tdm/userlicense/1.0
LinkModel	OpenURL
MergedId	FETCHMERGED-LOGICAL-c348t-c1835efdb129681c631995e475a63de66a95e988b62bfe6e73c57b9ae33a46b53
OpenAccessLink	https://dx.doi.org/10.1016/j.procs.2012.04.003
PageCount	10
ParticipantIDs	crossref_primary_10_1016_j_procs_2012_04_003 crossref_citationtrail_10_1016_j_procs_2012_04_003 elsevier_sciencedirect_doi_10_1016_j_procs_2012_04_003
PublicationCentury	2000
PublicationDate	2012 2012-00-00
PublicationDateYYYYMMDD	2012-01-01
PublicationDate_xml	– year: 2012 text: 2012
PublicationDecade	2010
PublicationTitle	Procedia computer science
PublicationYear	2012
Publisher	Elsevier B.V
Publisher_xml	– name: Elsevier B.V
References	N. J. Higham, Accuracy and Stability of Numerical Algorithms, SIAM, 2002, second edition. Grigori, Demmel, Xiang (bib0050) 2011; 32 S. Donfack, L. Grigori, A.K. Gupta, Adapting communication-avoiding LU and QR factorizations to multicore architectures, in: Parallel & Distributed Processing (IPDPS), 2010 IEEE International Symposium on, IEEE, 2010, pp. 1-10. Nath, Tomov, Dongarra (bib0025) 2010; 24 J. Dongarra, M. Faverge, H. Ltaief, P. Luszcsek, Achieving numerical accuracy and high performance using recursive tile LU factorization, Tech. rep., LAPACK Working Note 259 (2011). L. Blackford, J. Choi, A. Cleary, E. D’Azevedo, J. Demmel, I. Dhillon, J. Dongarra, S. Hammarling, G. Henry, A. Petitet, K. Stanley, D. Walker, R. Whaley, ScaLAPACK Users’ Guide, SIAM, 1997. Tomov, Nath, Dongarra (bib0035) 2010; 36 S. Tomov, J. Dongarra, M. Baboulin, Towards dense linear algebra for hybrid GPU accelerated manycore systems, Parallel Computing 36 (5&6) (2010) 232-240. L. Grigori, J. Demmel, H. Xiang, Communication avoiding Gaussian elimination, in: Proceedings of the 2008 ACM/IEEE conference on Supercomputing, IEEE Press, 2008, p. 29. Gustavson (bib0100) 1997; 41 J. Dongarra, I. Duff, D. Sorensen, H. van der Vorst, Numerical Linear Algebra for High-Performance Computers, SIAM, 1998. Intel, Math Kernel Library (MKL), http://www.intel.com/software/products/mkl/. D. S. Parker, Random butterﬂy transformations with applications in computational linear algebra, Technical Report CSD-950023, Computer Science Department, UCLA (1995). E. Anderson, Z. Bai, C. Bischof, S. Blackford, J. Demmel, J. Dongarra, J.D. Croz, A. Greenbaum, S. Hammarling, A. McKenney, D. Sorensen, LAPACK Users’ Guide, SIAM, 1999, third edition. M. Baboulin, J. Dongarra, J. Herrmann, S. Tomov, Accelerating linear system solutions using randomization techniques, Tech. rep., LAPACK Working Note 246 (2011). G. H. Golub, C.F. van Loan, Matrix Computations, The Johns Hopkins University Press, 1996, third edition. J. J. Dongarra, C.B. Moler, J.R. Bunch, G.W. Stewart, LINPACK Users’ Guide, SIAM, 1979. M. Anderson, G. Ballard, J. Demmel, K. Keutzer, Communication-Avoiding QR decomposition for GPUs, Tech. rep., LAPACK Working Note 240, proceedings of IPDPS’11 (2011). S. Blackford, J. Dongarra, Installation Guide for LAPACK, Tech. rep., LAPACK Working Note 41, revised version 3.0 (1999). Buttari, Langou, Kurzak, Dongarra (bib0090) 2007; 20 Skeel (bib0120) 1980; 35 J. Kurzak, J. Dongarra, Implementing linear algebra routines on multi-core processors with pipelining and a look ahead, Tech. rep., LAPACK Working Note 178 (2006). A. Buttari, J. Dongarra, J. Kurzak, J. Langou, P. Luszczek, S. Tomov, The impact of multicore on math software, in: Proceedings of PARA 2006, Workshop on state-of-the art in scientific computing, 2006. J. Choi, J. Dongarra, L. Ostrouchov, A. Petitet, D. Walker, R. Whaley, A proposal for a set of parallel basic linear algebra subprograms, Tech. rep., LAPACK Working Note 100 (1995). Tomov (10.1016/j.procs.2012.04.003_bib0035) 2010; 36 10.1016/j.procs.2012.04.003_bib0045 Skeel (10.1016/j.procs.2012.04.003_bib0120) 1980; 35 10.1016/j.procs.2012.04.003_bib0015 10.1016/j.procs.2012.04.003_bib0030 10.1016/j.procs.2012.04.003_bib0085 10.1016/j.procs.2012.04.003_bib0020 10.1016/j.procs.2012.04.003_bib0075 10.1016/j.procs.2012.04.003_bib0010 Nath (10.1016/j.procs.2012.04.003_bib0025) 2010; 24 10.1016/j.procs.2012.04.003_bib0065 10.1016/j.procs.2012.04.003_bib0055 10.1016/j.procs.2012.04.003_bib0110 10.1016/j.procs.2012.04.003_bib0005 10.1016/j.procs.2012.04.003_bib0115 10.1016/j.procs.2012.04.003_bib0105 Gustavson (10.1016/j.procs.2012.04.003_bib0100) 1997; 41 10.1016/j.procs.2012.04.003_bib0070 Buttari (10.1016/j.procs.2012.04.003_bib0090) 2007; 20 10.1016/j.procs.2012.04.003_bib0060 10.1016/j.procs.2012.04.003_bib0040 10.1016/j.procs.2012.04.003_bib0095 Grigori (10.1016/j.procs.2012.04.003_bib0050) 2011; 32 10.1016/j.procs.2012.04.003_bib0080
References_xml	– volume: 32 start-page: 1317 year: 2011 end-page: 1350 ident: bib0050 article-title: CALU: a communication optimal LU factorization algorithm, publication-title: SIAM J. Matrix Anal. and Appl. – reference: Intel, Math Kernel Library (MKL), http://www.intel.com/software/products/mkl/. – reference: S. Blackford, J. Dongarra, Installation Guide for LAPACK, Tech. rep., LAPACK Working Note 41, revised version 3.0 (1999). – reference: J. Kurzak, J. Dongarra, Implementing linear algebra routines on multi-core processors with pipelining and a look ahead, Tech. rep., LAPACK Working Note 178 (2006). – reference: L. Grigori, J. Demmel, H. Xiang, Communication avoiding Gaussian elimination, in: Proceedings of the 2008 ACM/IEEE conference on Supercomputing, IEEE Press, 2008, p. 29. – reference: J. Dongarra, I. Duff, D. Sorensen, H. van der Vorst, Numerical Linear Algebra for High-Performance Computers, SIAM, 1998. – volume: 24 start-page: 511 year: 2010 end-page: 515 ident: bib0025 article-title: An improved MAGMA GEMM for Fermi GPUs publication-title: International Journal of High Performance Computing Applications – reference: M. Baboulin, J. Dongarra, J. Herrmann, S. Tomov, Accelerating linear system solutions using randomization techniques, Tech. rep., LAPACK Working Note 246 (2011). – reference: J. J. Dongarra, C.B. Moler, J.R. Bunch, G.W. Stewart, LINPACK Users’ Guide, SIAM, 1979. – volume: 20 start-page: 1573 year: 2007 end-page: 1590 ident: bib0090 article-title: Parallel tiled QR factorization for multicore architectures publication-title: Concurr. Comput.: Pract. Exper. – reference: M. Anderson, G. Ballard, J. Demmel, K. Keutzer, Communication-Avoiding QR decomposition for GPUs, Tech. rep., LAPACK Working Note 240, proceedings of IPDPS’11 (2011). – reference: D. S. Parker, Random butterﬂy transformations with applications in computational linear algebra, Technical Report CSD-950023, Computer Science Department, UCLA (1995). – volume: 35 start-page: 817 year: 1980 end-page: 832 ident: bib0120 article-title: Iterative refinement implies numerical stability for Gaussian elimination publication-title: Math. Comput. – reference: S. Donfack, L. Grigori, A.K. Gupta, Adapting communication-avoiding LU and QR factorizations to multicore architectures, in: Parallel & Distributed Processing (IPDPS), 2010 IEEE International Symposium on, IEEE, 2010, pp. 1-10. – reference: J. Choi, J. Dongarra, L. Ostrouchov, A. Petitet, D. Walker, R. Whaley, A proposal for a set of parallel basic linear algebra subprograms, Tech. rep., LAPACK Working Note 100 (1995). – reference: A. Buttari, J. Dongarra, J. Kurzak, J. Langou, P. Luszczek, S. Tomov, The impact of multicore on math software, in: Proceedings of PARA 2006, Workshop on state-of-the art in scientific computing, 2006. – volume: 36 start-page: 645 year: 2010 end-page: 654 ident: bib0035 article-title: Accelerating the reduction to upper Hessenberg, tridiagonal, and bidiagonal forms through hybrid GPU-based computing publication-title: Parallel Computing – reference: N. J. Higham, Accuracy and Stability of Numerical Algorithms, SIAM, 2002, second edition. – reference: G. H. Golub, C.F. van Loan, Matrix Computations, The Johns Hopkins University Press, 1996, third edition. – reference: S. Tomov, J. Dongarra, M. Baboulin, Towards dense linear algebra for hybrid GPU accelerated manycore systems, Parallel Computing 36 (5&6) (2010) 232-240. – volume: 41 start-page: 737 year: 1997 end-page: 755 ident: bib0100 article-title: Recursion leads to automatic variable blocking for dense linear-algebra algorithms publication-title: IBM Journal of Research and Development – reference: J. Dongarra, M. Faverge, H. Ltaief, P. Luszcsek, Achieving numerical accuracy and high performance using recursive tile LU factorization, Tech. rep., LAPACK Working Note 259 (2011). – reference: L. Blackford, J. Choi, A. Cleary, E. D’Azevedo, J. Demmel, I. Dhillon, J. Dongarra, S. Hammarling, G. Henry, A. Petitet, K. Stanley, D. Walker, R. Whaley, ScaLAPACK Users’ Guide, SIAM, 1997. – reference: E. Anderson, Z. Bai, C. Bischof, S. Blackford, J. Demmel, J. Dongarra, J.D. Croz, A. Greenbaum, S. Hammarling, A. McKenney, D. Sorensen, LAPACK Users’ Guide, SIAM, 1999, third edition. – volume: 24 start-page: 511 issue: 4 year: 2010 ident: 10.1016/j.procs.2012.04.003_bib0025 article-title: An improved MAGMA GEMM for Fermi GPUs publication-title: International Journal of High Performance Computing Applications doi: 10.1177/1094342010385729 – ident: 10.1016/j.procs.2012.04.003_bib0030 doi: 10.1016/j.parco.2009.12.005 – volume: 36 start-page: 645 issue: 12 year: 2010 ident: 10.1016/j.procs.2012.04.003_bib0035 article-title: Accelerating the reduction to upper Hessenberg, tridiagonal, and bidiagonal forms through hybrid GPU-based computing publication-title: Parallel Computing doi: 10.1016/j.parco.2010.06.001 – ident: 10.1016/j.procs.2012.04.003_bib0055 doi: 10.1137/1.9780898718027 – ident: 10.1016/j.procs.2012.04.003_bib0115 – ident: 10.1016/j.procs.2012.04.003_bib0065 – ident: 10.1016/j.procs.2012.04.003_bib0060 doi: 10.1109/SC.2008.5214287 – volume: 35 start-page: 817 year: 1980 ident: 10.1016/j.procs.2012.04.003_bib0120 article-title: Iterative refinement implies numerical stability for Gaussian elimination publication-title: Math. Comput. doi: 10.1090/S0025-5718-1980-0572859-4 – volume: 32 start-page: 1317 year: 2011 ident: 10.1016/j.procs.2012.04.003_bib0050 article-title: CALU: a communication optimal LU factorization algorithm, publication-title: SIAM J. Matrix Anal. and Appl. doi: 10.1137/100788926 – ident: 10.1016/j.procs.2012.04.003_bib0085 – volume: 20 start-page: 1573 year: 2007 ident: 10.1016/j.procs.2012.04.003_bib0090 article-title: Parallel tiled QR factorization for multicore architectures publication-title: Concurr. Comput.: Pract. Exper. doi: 10.1002/cpe.1301 – ident: 10.1016/j.procs.2012.04.003_bib0045 – ident: 10.1016/j.procs.2012.04.003_bib0015 doi: 10.1137/1.9780898719642 – ident: 10.1016/j.procs.2012.04.003_bib0020 doi: 10.1007/3-540-60902-4_13 – ident: 10.1016/j.procs.2012.04.003_bib0095 – ident: 10.1016/j.procs.2012.04.003_bib0105 – ident: 10.1016/j.procs.2012.04.003_bib0005 doi: 10.1137/1.9781611971811 – ident: 10.1016/j.procs.2012.04.003_bib0010 doi: 10.1137/1.9780898719604 – volume: 41 start-page: 737 issue: 6 year: 1997 ident: 10.1016/j.procs.2012.04.003_bib0100 article-title: Recursion leads to automatic variable blocking for dense linear-algebra algorithms publication-title: IBM Journal of Research and Development doi: 10.1147/rd.416.0737 – ident: 10.1016/j.procs.2012.04.003_bib0040 doi: 10.1109/IPDPS.2011.15 – ident: 10.1016/j.procs.2012.04.003_bib0080 doi: 10.1109/IPDPS.2010.5470348 – ident: 10.1016/j.procs.2012.04.003_bib0110 – ident: 10.1016/j.procs.2012.04.003_bib0070 doi: 10.1137/1.9780898719611 – ident: 10.1016/j.procs.2012.04.003_bib0075
SSID	ssj0000388917
Score	1.977275
Snippet	We study several solvers for the solution of general linear systems where the main objective is to reduce the communication overhead due to pivoting. We first...
SourceID	crossref elsevier
SourceType	Enrichment Source Index Database Publisher
StartPage	17
SubjectTerms	communication-avoiding algorithms dense linear algebra libraries hybrid multicore/GPU computing linear system solvers LU factorization
Title	A Class of Communication-avoiding Algorithms for Solving General Dense Linear Systems on CPU/GPU Parallel Machines
URI	https://dx.doi.org/10.1016/j.procs.2012.04.003
Volume	9
WOSCitedRecordID	wos000306288400002&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
journalDatabaseRights	– providerCode: PRVHPJ databaseName: ROAD: Directory of Open Access Scholarly Resources customDbUrl: eissn: 1877-0509 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0000388917 issn: 1877-0509 databaseCode: M~E dateStart: 20100101 isFulltext: true titleUrlDefault: https://road.issn.org providerName: ISSN International Centre
link	http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV1Lb9swDBaybodduu6Fdi_osFtnLH7KOgbF1h62IkCaITdDkuXWhWEXbhr0tD-zPzpSkh9AgmIbsIsRK7HliF9IiuFHEvIRPFgN22TlxdyXsEEphCdZzL08FKzguSqmpk3nj2_s_Dxdrfh8MvnVcWE2Favr9P6e3_xXUcMYCBups38h7v6mMACvQehwBLHD8Y8EP7ONLk2exZj94YlNUxoKy6y6bNpyfWVrMRwvmspEFVwFalBB9a3GzTrW-HEVzfE_hZP5Ep7qdL4Et7PFFiwVti26wrz5sYtrqAeAOpOtjg0jjp2VHYKmEtuw144qpHpfuqkLYbXzojSJ7nr01qVoW5fXq3p20Wlb4lfpCN5iHMPwh73uNq3GaOGUMQ8L01gjtWPMqW4-Ur2WAuqMuGXhb5kHG6m4RuOksFY7BoKxeno4WMM-R3GBM-KEPqadBdHqEXkcAIyxP8j3n0MgD8vpcNPZuX_ErrqVySPcmmu3BzTyai4OyL7bjtCZhdFzMtH1C_Ksa_VBneZ_SdoZNaiiTUF3o4oOqKKAKupQRR2qqEEVtaiiDlW0qSmg6jNginaYoh2mXpHl1y8XJ2eea9fhqTBK154C6xDrIpfgQiapr5IQ6f86YrFIwlwniYAznqYyCWShE81CFTPJBegKESUyDl-Tvbqp9SGhQgolmGY5j_yoAK8ySgsVMK2nMlJ-nh-RoFvCTLla9thSpcq6pMXrzKx7huueTSMsgXtEPvUX3dhSLg9_POlkk7nfifUyM0DTQxe--dcL35KneGYDfO_I3rq90-_JE7VZl7ftB4O631MHspk
linkProvider	ISSN International Centre
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=A+Class+of+Communication-avoiding+Algorithms+for+Solving+General+Dense+Linear+Systems+on+CPU%2FGPU+Parallel+Machines&rft.jtitle=Procedia+computer+science&rft.au=Baboulin%2C+Marc&rft.au=Donfack%2C+Simplice&rft.au=Dongarra%2C+Jack&rft.au=Grigori%2C+Laura&rft.date=2012&rft.pub=Elsevier+B.V&rft.issn=1877-0509&rft.eissn=1877-0509&rft.volume=9&rft.spage=17&rft.epage=26&rft_id=info:doi/10.1016%2Fj.procs.2012.04.003&rft.externalDocID=S187705091200124X
thumbnail_l	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1877-0509&client=summon
thumbnail_m	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1877-0509&client=summon
thumbnail_s	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1877-0509&client=summon