HARP: Holistic Analysis for Refactoring Python-Based Analytics Programs.

Gespeichert in:
Bibliographische Detailangaben
Titel: HARP: Holistic Analysis for Refactoring Python-Based Analytics Programs.
Autoren: Weijie Zhou, Yue Zhao, Guoqiang Zhang, Xipeng Shen
Quelle: ICSE: International Conference on Software Engineering; 6/17/2020, p506-517, 12p
Schlagwörter: PYTHON programming language, MACHINE learning, DATA analytics, SOFTWARE engineering, COMPUTER software development
Abstract: Modern machine learning programs are often written in Python, with the main computations specified through calls to some highly optimized libraries (e.g., TensorFlow, PyTorch). How to maximize the computing efficiency of such programs is essential for many application domains, which has drawn lots of recent attention. This work points out a common limitation in existing efforts: they focus their views only on the static computation graphs specified by library APIs, but leave the influence from the hosting Python code largely unconsidered. The limitation often causes them to miss the big picture and hence many important optimization opportunities. This work proposes a new approach named HARP to address the problem. HARP enables holistic analysis that spans across computation graphs and their hosting Python code. HARP achieves it through a set of novel techniques: analytics-conscious speculative analysis to circumvent Python complexities, a unified representation augmented computation graphs to capture all dimensions of knowledge related with the holistic analysis, and conditioned feedback mechanism to allow risk-controlled aggressive analysis. Refactoring based on HARP gives 1.3-3X and 2.07X average speedups on a set of TensorFlow and PyTorch programs. [ABSTRACT FROM AUTHOR]
Copyright of ICSE: International Conference on Software Engineering is the property of Association for Computing Machinery and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
Datenbank: Complementary Index
FullText Text:
  Availability: 0
Header DbId: edb
DbLabel: Complementary Index
An: 155540321
RelevancyScore: 915
AccessLevel: 6
PubType: Conference
PubTypeId: conference
PreciseRelevancyScore: 914.608215332031
IllustrationInfo
Items – Name: Title
  Label: Title
  Group: Ti
  Data: HARP: Holistic Analysis for Refactoring Python-Based Analytics Programs.
– Name: Author
  Label: Authors
  Group: Au
  Data: <searchLink fieldCode="AR" term="%22Weijie+Zhou%22">Weijie Zhou</searchLink><br /><searchLink fieldCode="AR" term="%22Yue+Zhao%22">Yue Zhao</searchLink><br /><searchLink fieldCode="AR" term="%22Guoqiang+Zhang%22">Guoqiang Zhang</searchLink><br /><searchLink fieldCode="AR" term="%22Xipeng+Shen%22">Xipeng Shen</searchLink>
– Name: TitleSource
  Label: Source
  Group: Src
  Data: ICSE: International Conference on Software Engineering; 6/17/2020, p506-517, 12p
– Name: Subject
  Label: Subject Terms
  Group: Su
  Data: <searchLink fieldCode="DE" term="%22PYTHON+programming+language%22">PYTHON programming language</searchLink><br /><searchLink fieldCode="DE" term="%22MACHINE+learning%22">MACHINE learning</searchLink><br /><searchLink fieldCode="DE" term="%22DATA+analytics%22">DATA analytics</searchLink><br /><searchLink fieldCode="DE" term="%22SOFTWARE+engineering%22">SOFTWARE engineering</searchLink><br /><searchLink fieldCode="DE" term="%22COMPUTER+software+development%22">COMPUTER software development</searchLink>
– Name: Abstract
  Label: Abstract
  Group: Ab
  Data: Modern machine learning programs are often written in Python, with the main computations specified through calls to some highly optimized libraries (e.g., TensorFlow, PyTorch). How to maximize the computing efficiency of such programs is essential for many application domains, which has drawn lots of recent attention. This work points out a common limitation in existing efforts: they focus their views only on the static computation graphs specified by library APIs, but leave the influence from the hosting Python code largely unconsidered. The limitation often causes them to miss the big picture and hence many important optimization opportunities. This work proposes a new approach named HARP to address the problem. HARP enables holistic analysis that spans across computation graphs and their hosting Python code. HARP achieves it through a set of novel techniques: analytics-conscious speculative analysis to circumvent Python complexities, a unified representation augmented computation graphs to capture all dimensions of knowledge related with the holistic analysis, and conditioned feedback mechanism to allow risk-controlled aggressive analysis. Refactoring based on HARP gives 1.3-3X and 2.07X average speedups on a set of TensorFlow and PyTorch programs. [ABSTRACT FROM AUTHOR]
– Name: Abstract
  Label:
  Group: Ab
  Data: <i>Copyright of ICSE: International Conference on Software Engineering is the property of Association for Computing Machinery and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract.</i> (Copyright applies to all Abstracts.)
PLink https://erproxy.cvtisr.sk/sfx/access?url=https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&db=edb&AN=155540321
RecordInfo BibRecord:
  BibEntity:
    Identifiers:
      – Type: doi
        Value: 10.1145/3377811.3380434
    Languages:
      – Code: eng
        Text: English
    PhysicalDescription:
      Pagination:
        PageCount: 12
        StartPage: 506
    Subjects:
      – SubjectFull: PYTHON programming language
        Type: general
      – SubjectFull: MACHINE learning
        Type: general
      – SubjectFull: DATA analytics
        Type: general
      – SubjectFull: SOFTWARE engineering
        Type: general
      – SubjectFull: COMPUTER software development
        Type: general
    Titles:
      – TitleFull: HARP: Holistic Analysis for Refactoring Python-Based Analytics Programs.
        Type: main
  BibRelationships:
    HasContributorRelationships:
      – PersonEntity:
          Name:
            NameFull: Weijie Zhou
      – PersonEntity:
          Name:
            NameFull: Yue Zhao
      – PersonEntity:
          Name:
            NameFull: Guoqiang Zhang
      – PersonEntity:
          Name:
            NameFull: Xipeng Shen
    IsPartOfRelationships:
      – BibEntity:
          Dates:
            – D: 17
              M: 06
              Text: 6/17/2020
              Type: published
              Y: 2020
          Titles:
            – TitleFull: ICSE: International Conference on Software Engineering
              Type: main
ResultId 1