Sparse triangular solves for ILU revisited: data layout crucial to better performance.

Uložené v:
Podrobná bibliografia
Názov: Sparse triangular solves for ILU revisited: data layout crucial to better performance.
Autori: Smith, Barry, Zhang, Hong
Zdroj: International Journal of High Performance Computing Applications; Nov2011, Vol. 25 Issue 4, p386-391, 6p
Predmety: MACRO processors, DATA, COMPUTER storage devices, BUFFER storage (Computer science), CYBERNETICS
Abstrakt: A key to good processor utilization for sparse matrix computations is storing the data in the format that is most conducive to fast access by the memory system. In particular, for sparse matrix triangular solves the traditional compressed sparse matrix format is poor, and minor adjustments to the data structure can increase the processor utilization dramatically. Such adjustments involve storing the L and U factors separately and storing the U rows ‘backwards' so that they are accessed in a simple streaming fashion during the triangular solves. Changes to the PETSc libraries to use this modified storage format resulted in over twice the floating-point rate for some matrices. This improvement can be accounted for by a decrease in the cache misses and TLB (transaction lookaside buffer) misses in the modified code. [ABSTRACT FROM PUBLISHER]
Copyright of International Journal of High Performance Computing Applications is the property of Sage Publications Inc. and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
Databáza: Complementary Index
FullText Text:
  Availability: 0
CustomLinks:
  – Url: https://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=EBSCO&SrcAuth=EBSCO&DestApp=WOS&ServiceName=TransferToWoS&DestLinkType=GeneralSearchSummary&Func=Links&author=Smith%20B
    Name: ISI
    Category: fullText
    Text: Nájsť tento článok vo Web of Science
    Icon: https://imagesrvr.epnet.com/ls/20docs.gif
    MouseOverText: Nájsť tento článok vo Web of Science
Header DbId: edb
DbLabel: Complementary Index
An: 67672579
RelevancyScore: 834
AccessLevel: 6
PubType: Academic Journal
PubTypeId: academicJournal
PreciseRelevancyScore: 834.066711425781
IllustrationInfo
Items – Name: Title
  Label: Title
  Group: Ti
  Data: Sparse triangular solves for ILU revisited: data layout crucial to better performance.
– Name: Author
  Label: Authors
  Group: Au
  Data: <searchLink fieldCode="AR" term="%22Smith%2C+Barry%22">Smith, Barry</searchLink><br /><searchLink fieldCode="AR" term="%22Zhang%2C+Hong%22">Zhang, Hong</searchLink>
– Name: TitleSource
  Label: Source
  Group: Src
  Data: International Journal of High Performance Computing Applications; Nov2011, Vol. 25 Issue 4, p386-391, 6p
– Name: Subject
  Label: Subject Terms
  Group: Su
  Data: <searchLink fieldCode="DE" term="%22MACRO+processors%22">MACRO processors</searchLink><br /><searchLink fieldCode="DE" term="%22DATA%22">DATA</searchLink><br /><searchLink fieldCode="DE" term="%22COMPUTER+storage+devices%22">COMPUTER storage devices</searchLink><br /><searchLink fieldCode="DE" term="%22BUFFER+storage+%28Computer+science%29%22">BUFFER storage (Computer science)</searchLink><br /><searchLink fieldCode="DE" term="%22CYBERNETICS%22">CYBERNETICS</searchLink>
– Name: Abstract
  Label: Abstract
  Group: Ab
  Data: A key to good processor utilization for sparse matrix computations is storing the data in the format that is most conducive to fast access by the memory system. In particular, for sparse matrix triangular solves the traditional compressed sparse matrix format is poor, and minor adjustments to the data structure can increase the processor utilization dramatically. Such adjustments involve storing the L and U factors separately and storing the U rows ‘backwards' so that they are accessed in a simple streaming fashion during the triangular solves. Changes to the PETSc libraries to use this modified storage format resulted in over twice the floating-point rate for some matrices. This improvement can be accounted for by a decrease in the cache misses and TLB (transaction lookaside buffer) misses in the modified code. [ABSTRACT FROM PUBLISHER]
– Name: Abstract
  Label:
  Group: Ab
  Data: <i>Copyright of International Journal of High Performance Computing Applications is the property of Sage Publications Inc. and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract.</i> (Copyright applies to all Abstracts.)
PLink https://erproxy.cvtisr.sk/sfx/access?url=https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&db=edb&AN=67672579
RecordInfo BibRecord:
  BibEntity:
    Identifiers:
      – Type: doi
        Value: 10.1177/1094342010389857
    Languages:
      – Code: eng
        Text: English
    PhysicalDescription:
      Pagination:
        PageCount: 6
        StartPage: 386
    Subjects:
      – SubjectFull: MACRO processors
        Type: general
      – SubjectFull: DATA
        Type: general
      – SubjectFull: COMPUTER storage devices
        Type: general
      – SubjectFull: BUFFER storage (Computer science)
        Type: general
      – SubjectFull: CYBERNETICS
        Type: general
    Titles:
      – TitleFull: Sparse triangular solves for ILU revisited: data layout crucial to better performance.
        Type: main
  BibRelationships:
    HasContributorRelationships:
      – PersonEntity:
          Name:
            NameFull: Smith, Barry
      – PersonEntity:
          Name:
            NameFull: Zhang, Hong
    IsPartOfRelationships:
      – BibEntity:
          Dates:
            – D: 01
              M: 11
              Text: Nov2011
              Type: published
              Y: 2011
          Identifiers:
            – Type: issn-print
              Value: 10943420
          Numbering:
            – Type: volume
              Value: 25
            – Type: issue
              Value: 4
          Titles:
            – TitleFull: International Journal of High Performance Computing Applications
              Type: main
ResultId 1