A Probabilistic Delta Debugging Approach for Abstract Syntax Trees
Delta debugging provides an efficient and systematic approach to isolate and identify a minimal subsequence that exhibit a specific property. A notable trend in the development of delta debugging is to address data with domain-specific structures, such as programs. However, the efficiency and effect...
Saved in:
| Published in: | Proceedings - International Symposium on Software Reliability Engineering pp. 763 - 773 |
|---|---|
| Main Authors: | , , , , , |
| Format: | Conference Proceeding |
| Language: | English |
| Published: |
IEEE
09.10.2023
|
| Subjects: | |
| ISSN: | 2332-6549 |
| Online Access: | Get full text |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| Abstract | Delta debugging provides an efficient and systematic approach to isolate and identify a minimal subsequence that exhibit a specific property. A notable trend in the development of delta debugging is to address data with domain-specific structures, such as programs. However, the efficiency and effectiveness of domain-specific delta debugging algorithms still present challenges. Probabilistic delta debugging (ProbDD) enhances the ddmin algorithm, which forms the foundation of most domain-specific delta debugging approaches, by incorporating a probabilistic model. By replacing the ddmin component with ProbDD, algorithms relying on ddmin can achieve superior performance. Meanwhile, domain-specific delta debugging techniques, such as Perses, have been designed to cater to the abstract syntax tree (AST) and follow predefined sequences of attempts to minimize programs. These techniques benefit from the use of AST-based transformations, enabling them to achieve even smaller results efficiently. However, we observe that ProbDD assumes independence between elements, which may limit their performance in capturing syntactic relationships. Additionally, domain-specific approaches such as Perses rely on a predefined sequence of attempts the removal of the element and fail to utilize the information from existing test results.In this paper, we propose T-PDD, a novel approach that addresses these limitations. T-PDD leverages the AST to construct a probabilistic model, both utilizing historical test results and capturing syntactic relationships to estimate the probabilities of elements being retained in the result. It selects a set of elements that maximizes the gain for the next test based on the model and updates the model using the test results.In our evaluation, we assess our approach on 107 real-world subjects. The results demonstrate an average improvement of 26.95% in processing time and a 3.4x reduction in result size compared to Perses in the best-case scenario. |
|---|---|
| AbstractList | Delta debugging provides an efficient and systematic approach to isolate and identify a minimal subsequence that exhibit a specific property. A notable trend in the development of delta debugging is to address data with domain-specific structures, such as programs. However, the efficiency and effectiveness of domain-specific delta debugging algorithms still present challenges. Probabilistic delta debugging (ProbDD) enhances the ddmin algorithm, which forms the foundation of most domain-specific delta debugging approaches, by incorporating a probabilistic model. By replacing the ddmin component with ProbDD, algorithms relying on ddmin can achieve superior performance. Meanwhile, domain-specific delta debugging techniques, such as Perses, have been designed to cater to the abstract syntax tree (AST) and follow predefined sequences of attempts to minimize programs. These techniques benefit from the use of AST-based transformations, enabling them to achieve even smaller results efficiently. However, we observe that ProbDD assumes independence between elements, which may limit their performance in capturing syntactic relationships. Additionally, domain-specific approaches such as Perses rely on a predefined sequence of attempts the removal of the element and fail to utilize the information from existing test results.In this paper, we propose T-PDD, a novel approach that addresses these limitations. T-PDD leverages the AST to construct a probabilistic model, both utilizing historical test results and capturing syntactic relationships to estimate the probabilities of elements being retained in the result. It selects a set of elements that maximizes the gain for the next test based on the model and updates the model using the test results.In our evaluation, we assess our approach on 107 real-world subjects. The results demonstrate an average improvement of 26.95% in processing time and a 3.4x reduction in result size compared to Perses in the best-case scenario. |
| Author | Wang, Guancheng Zhang, Xin Zhang, Lu Wu, Yiqian Xiong, Yingfei Zhu, Qihao |
| Author_xml | – sequence: 1 givenname: Guancheng surname: Wang fullname: Wang, Guancheng email: guancheng.wang@pku.edu.cn organization: Ministry of Education (Peking University); School of Computer Science, Peking University,Key Laboratory of High Confidence Software Technologies,P. R. China,100871 – sequence: 2 givenname: Yiqian surname: Wu fullname: Wu, Yiqian email: wuyiqian@pku.edu.cn organization: Ministry of Education (Peking University); School of Computer Science, Peking University,Key Laboratory of High Confidence Software Technologies,P. R. China,100871 – sequence: 3 givenname: Qihao surname: Zhu fullname: Zhu, Qihao email: zhuqh@pku.edu.cn organization: Ministry of Education (Peking University); School of Computer Science, Peking University,Key Laboratory of High Confidence Software Technologies,P. R. China,100871 – sequence: 4 givenname: Yingfei surname: Xiong fullname: Xiong, Yingfei email: xiongyf@pku.edu.cn organization: Ministry of Education (Peking University); School of Computer Science, Peking University,Key Laboratory of High Confidence Software Technologies,P. R. China,100871 – sequence: 5 givenname: Xin surname: Zhang fullname: Zhang, Xin email: xin@pku.edu.cn organization: Ministry of Education (Peking University); School of Computer Science, Peking University,Key Laboratory of High Confidence Software Technologies,P. R. China,100871 – sequence: 6 givenname: Lu surname: Zhang fullname: Zhang, Lu email: zhanglucs@pku.edu.cn organization: Ministry of Education (Peking University); School of Computer Science, Peking University,Key Laboratory of High Confidence Software Technologies,P. R. China,100871 |
| BookMark | eNotjdFKwzAUQKMouM39gUJ-oPUmuWmSxzqnDgaKnc8jadMaqW1JK2x_70Rfznk7Z04uur7zhNwySBkDc7cpire1NBp1yoGLFAAyOCNLo4wWEgSTBsU5mXEheJJJNFdkPo6fAByQ8Rm5z-lr7J11oQ3jFEr64NvJnui-myZ0Dc2HIfa2_KB1H2nuxinacqLFsZvsge6i9-M1uaxtO_rlvxfk_XG9Wz0n25enzSrfJuH0mpJKaWMQsMRaedA-k545I6xCz9Eqbp0DVE5qzcq6lkpWhjGJFdSq4hZRLMjNXzd47_dDDF82HvcMuPntih9DM0wy |
| CODEN | IEEPAD |
| ContentType | Conference Proceeding |
| DBID | 6IE 6IL CBEJK RIE RIL |
| DOI | 10.1109/ISSRE59848.2023.00060 |
| DatabaseName | IEEE Electronic Library (IEL) Conference Proceedings IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume IEEE Xplore All Conference Proceedings IEEE Electronic Library (IEL) IEEE Proceedings Order Plans (POP All) 1998-Present |
| DatabaseTitleList | |
| Database_xml | – sequence: 1 dbid: RIE name: IEEE Electronic Library (IEL) url: https://ieeexplore.ieee.org/ sourceTypes: Publisher |
| DeliveryMethod | fulltext_linktorsrc |
| Discipline | Computer Science |
| EISBN | 9798350315943 |
| EISSN | 2332-6549 |
| EndPage | 773 |
| ExternalDocumentID | 10299940 |
| Genre | orig-research |
| GrantInformation_xml | – fundername: National Natural Science Foundation of China funderid: 10.13039/501100001809 – fundername: Research and Development funderid: 10.13039/100006190 |
| GroupedDBID | 6IE 6IF 6IH 6IK 6IL 6IN AAJGR AAWTH ABLEC ACGFS ADZIZ ALMA_UNASSIGNED_HOLDINGS BEFXN BFFAM BGNUA BKEBE BPEOZ CBEJK CHZPO IEGSK IPLJI M43 OCL RIE RIL RNS |
| ID | FETCH-LOGICAL-i204t-d7899404c4f7e08e65e1b93a74e24a72abb047b5881cff575d91154d0f7d2a443 |
| IEDL.DBID | RIE |
| ISICitedReferencesCount | 2 |
| ISICitedReferencesURI | http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=001096886300067&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| IngestDate | Wed Aug 27 02:22:57 EDT 2025 |
| IsPeerReviewed | false |
| IsScholarly | true |
| Language | English |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-LOGICAL-i204t-d7899404c4f7e08e65e1b93a74e24a72abb047b5881cff575d91154d0f7d2a443 |
| PageCount | 11 |
| ParticipantIDs | ieee_primary_10299940 |
| PublicationCentury | 2000 |
| PublicationDate | 2023-Oct.-9 |
| PublicationDateYYYYMMDD | 2023-10-09 |
| PublicationDate_xml | – month: 10 year: 2023 text: 2023-Oct.-9 day: 09 |
| PublicationDecade | 2020 |
| PublicationTitle | Proceedings - International Symposium on Software Reliability Engineering |
| PublicationTitleAbbrev | ISSRE |
| PublicationYear | 2023 |
| Publisher | IEEE |
| Publisher_xml | – name: IEEE |
| SSID | ssj0020412 |
| Score | 2.2489274 |
| Snippet | Delta debugging provides an efficient and systematic approach to isolate and identify a minimal subsequence that exhibit a specific property. A notable trend... |
| SourceID | ieee |
| SourceType | Publisher |
| StartPage | 763 |
| SubjectTerms | Abstract Syntax Tree Benchmark testing Debugging Delta Debugging Estimation Probabilistic logic Probabilistic Model Refining Syntactics Systematics |
| Title | A Probabilistic Delta Debugging Approach for Abstract Syntax Trees |
| URI | https://ieeexplore.ieee.org/document/10299940 |
| WOSCitedRecordID | wos001096886300067&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| link | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlZ1LS8NAEMcXLR481UfFN3vwGt0km-7usWqLgpRiq_RW9jGRgqQlTUW_vTvbtHrx4CWEEEgyy2ZmZ-f_G0KuHDNaWwuRMSaJuHcYkcm1H5C2iEF4n88CdvH1SfT7cjxWg1qsHrQwABCKz-AaT8NevpvZJabK_Az3P0_F_Qp9WwixEmttVlcIjqolOjFTN4_D4XM3UzLUbyWIMWWIofzVQiV4kF7zn8_eI60fLR4dbLzMPtmC4oA0180YaD03D8ltB28zAZiL7GV6D--V9kezxKTyG-3U9HDqw1TaMZjisBUdfhWV_qSjEmDRIi-97ujuIaobJERT_7lV5ITEV-KW5wKYhHYGsVGpFhwSrkWijWFcmEzK2Oa5D8ycQvqOY7lwieY8PSKNYlbAMaGpdOBA6yy32scoIFMbdhhBaAtZJk9IC20yma8YGJO1OU7_uH5GdtHsoexNnZNGVS7hguzYj2q6KC_DyH0DNKaaIA |
| linkProvider | IEEE |
| linkToHtml | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlZ3PT8IwFMcbgyZ6wh8Yf9uD12m3tbQ9okIgIiGChhvpjzdDYoaBYfS_ty0DvXjwsizLkm196d7r6_t-HkJXlmiljIFIa51E1DmMSGfKGaTOY-DO55OAXXzp8l5PjEayX4rVgxYGAELxGVz707CXb6dm4VNlboa7n6ekboW-yShN4qVca72-8uioUqQTE3nTGQyemkyKUMGVeJAp8SDKX01Ugg9pVf_59F1U-1Hj4f7az-yhDcj3UXXVjgGXs_MA3Tb8bTogcz19Gd_DW6HcUS98WvkVN0p-OHaBKm5on-QwBR585YX6xMMZwLyGnlvN4V07KlskRBP3uUVkufCvRA3NOBABdQaxlqniFBKqeKK0JpRrJkRsssyFZlZ6_o4lGbeJojQ9RJV8msMRwqmwYEEplhnlohQQqQl7jMCVAcbEMar5MRm_LykY49VwnPxx_RJtt4eP3XG303s4RTveBKEITp6hSjFbwDnaMh_FZD67CFb8Bmo6nWc |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=Proceedings+-+International+Symposium+on+Software+Reliability+Engineering&rft.atitle=A+Probabilistic+Delta+Debugging+Approach+for+Abstract+Syntax+Trees&rft.au=Wang%2C+Guancheng&rft.au=Wu%2C+Yiqian&rft.au=Zhu%2C+Qihao&rft.au=Xiong%2C+Yingfei&rft.date=2023-10-09&rft.pub=IEEE&rft.eissn=2332-6549&rft.spage=763&rft.epage=773&rft_id=info:doi/10.1109%2FISSRE59848.2023.00060&rft.externalDocID=10299940 |