CodeTranFix: A Neural Machine Translation Approach for Context-Aware Java Program Repair with CodeBERT.

Gespeichert in:
Bibliographische Detailangaben
Titel: CodeTranFix: A Neural Machine Translation Approach for Context-Aware Java Program Repair with CodeBERT.
Autoren: Lu, Yiwei, Ye, Shuxia, Qi, Liang
Quelle: Applied Sciences (2076-3417); Apr2025, Vol. 15 Issue 7, p3632, 14p
Schlagwörter: LANGUAGE models, COMPUTER software quality control, CONTEXTUAL learning, BOTTLENECKS (Manufacturing), GENERALIZATION
Abstract: Automated program repair (APR) plays a vital role in enhancing software quality and reducing developer maintenance efforts. Neural Machine Translation (NMT)-based methods demonstrate notable potential by learning translation patterns from bug-fix code pairs. However, traditional approaches are constrained by limited model capacity and training data scale, leading to performance bottlenecks in generalizing to unseen defect patterns. In this paper, we propose CodeTransFix, a novel APR approach that synergistically combines neural machine translation (NMT) methods with code-specific large language models of code (LLMCs) such as CodeBERT. The CodeTransFix approach innovatively learns contextual embeddings of bug-related code through CodeBERT and integrates these representations as supplementary inputs to the Transformer model, enabling context-aware patch generation. The repair performance is evaluated on the widely used Defects4j v1.2 benchmark. Our experimental results showed that CodeTransFix achieved a 54.1% performance improvement compared to the best NMT-based baseline model and a 23.3% performance improvement compared to the best LLMCs for fixing bugs. In addition, CodeTransFix outperformed existing APR methods in the Defects4j v2.0 generalization test. [ABSTRACT FROM AUTHOR]
Copyright of Applied Sciences (2076-3417) is the property of MDPI and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
Datenbank: Complementary Index
FullText Text:
  Availability: 0
CustomLinks:
  – Url: https://resolver.ebscohost.com/openurl?sid=EBSCO:edb&genre=article&issn=20763417&ISBN=&volume=15&issue=7&date=20250401&spage=3632&pages=3632-3645&title=Applied Sciences (2076-3417)&atitle=CodeTranFix%3A%20A%20Neural%20Machine%20Translation%20Approach%20for%20Context-Aware%20Java%20Program%20Repair%20with%20CodeBERT.&aulast=Lu%2C%20Yiwei&id=DOI:10.3390/app15073632
    Name: Full Text Finder
    Category: fullText
    Text: Full Text Finder
    Icon: https://imageserver.ebscohost.com/branding/images/FTF.gif
    MouseOverText: Full Text Finder
  – Url: https://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=EBSCO&SrcAuth=EBSCO&DestApp=WOS&ServiceName=TransferToWoS&DestLinkType=GeneralSearchSummary&Func=Links&author=Lu%20Y
    Name: ISI
    Category: fullText
    Text: Nájsť tento článok vo Web of Science
    Icon: https://imagesrvr.epnet.com/ls/20docs.gif
    MouseOverText: Nájsť tento článok vo Web of Science
Header DbId: edb
DbLabel: Complementary Index
An: 184443749
RelevancyScore: 1023
AccessLevel: 6
PubType: Academic Journal
PubTypeId: academicJournal
PreciseRelevancyScore: 1023.07330322266
IllustrationInfo
Items – Name: Title
  Label: Title
  Group: Ti
  Data: CodeTranFix: A Neural Machine Translation Approach for Context-Aware Java Program Repair with CodeBERT.
– Name: Author
  Label: Authors
  Group: Au
  Data: <searchLink fieldCode="AR" term="%22Lu%2C+Yiwei%22">Lu, Yiwei</searchLink><br /><searchLink fieldCode="AR" term="%22Ye%2C+Shuxia%22">Ye, Shuxia</searchLink><br /><searchLink fieldCode="AR" term="%22Qi%2C+Liang%22">Qi, Liang</searchLink>
– Name: TitleSource
  Label: Source
  Group: Src
  Data: Applied Sciences (2076-3417); Apr2025, Vol. 15 Issue 7, p3632, 14p
– Name: Subject
  Label: Subject Terms
  Group: Su
  Data: <searchLink fieldCode="DE" term="%22LANGUAGE+models%22">LANGUAGE models</searchLink><br /><searchLink fieldCode="DE" term="%22COMPUTER+software+quality+control%22">COMPUTER software quality control</searchLink><br /><searchLink fieldCode="DE" term="%22CONTEXTUAL+learning%22">CONTEXTUAL learning</searchLink><br /><searchLink fieldCode="DE" term="%22BOTTLENECKS+%28Manufacturing%29%22">BOTTLENECKS (Manufacturing)</searchLink><br /><searchLink fieldCode="DE" term="%22GENERALIZATION%22">GENERALIZATION</searchLink>
– Name: Abstract
  Label: Abstract
  Group: Ab
  Data: Automated program repair (APR) plays a vital role in enhancing software quality and reducing developer maintenance efforts. Neural Machine Translation (NMT)-based methods demonstrate notable potential by learning translation patterns from bug-fix code pairs. However, traditional approaches are constrained by limited model capacity and training data scale, leading to performance bottlenecks in generalizing to unseen defect patterns. In this paper, we propose CodeTransFix, a novel APR approach that synergistically combines neural machine translation (NMT) methods with code-specific large language models of code (LLMCs) such as CodeBERT. The CodeTransFix approach innovatively learns contextual embeddings of bug-related code through CodeBERT and integrates these representations as supplementary inputs to the Transformer model, enabling context-aware patch generation. The repair performance is evaluated on the widely used Defects4j v1.2 benchmark. Our experimental results showed that CodeTransFix achieved a 54.1% performance improvement compared to the best NMT-based baseline model and a 23.3% performance improvement compared to the best LLMCs for fixing bugs. In addition, CodeTransFix outperformed existing APR methods in the Defects4j v2.0 generalization test. [ABSTRACT FROM AUTHOR]
– Name: Abstract
  Label:
  Group: Ab
  Data: <i>Copyright of Applied Sciences (2076-3417) is the property of MDPI and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract.</i> (Copyright applies to all Abstracts.)
PLink https://erproxy.cvtisr.sk/sfx/access?url=https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&db=edb&AN=184443749
RecordInfo BibRecord:
  BibEntity:
    Identifiers:
      – Type: doi
        Value: 10.3390/app15073632
    Languages:
      – Code: eng
        Text: English
    PhysicalDescription:
      Pagination:
        PageCount: 14
        StartPage: 3632
    Subjects:
      – SubjectFull: LANGUAGE models
        Type: general
      – SubjectFull: COMPUTER software quality control
        Type: general
      – SubjectFull: CONTEXTUAL learning
        Type: general
      – SubjectFull: BOTTLENECKS (Manufacturing)
        Type: general
      – SubjectFull: GENERALIZATION
        Type: general
    Titles:
      – TitleFull: CodeTranFix: A Neural Machine Translation Approach for Context-Aware Java Program Repair with CodeBERT.
        Type: main
  BibRelationships:
    HasContributorRelationships:
      – PersonEntity:
          Name:
            NameFull: Lu, Yiwei
      – PersonEntity:
          Name:
            NameFull: Ye, Shuxia
      – PersonEntity:
          Name:
            NameFull: Qi, Liang
    IsPartOfRelationships:
      – BibEntity:
          Dates:
            – D: 01
              M: 04
              Text: Apr2025
              Type: published
              Y: 2025
          Identifiers:
            – Type: issn-print
              Value: 20763417
          Numbering:
            – Type: volume
              Value: 15
            – Type: issue
              Value: 7
          Titles:
            – TitleFull: Applied Sciences (2076-3417)
              Type: main
ResultId 1