Towards Autonomous Testing Agents via Conversational Large Language Models
Software testing is an important part of the development cycle, yet it requires specialized expertise and substantial developer effort to adequately test software. Recent discoveries of the capabilities of large language models (LLMs) suggest that they can be used as automated testing assistants, an...
Saved in:
| Published in: | IEEE/ACM International Conference on Automated Software Engineering : [proceedings] pp. 1688 - 1693 |
|---|---|
| Main Authors: | , , , |
| Format: | Conference Proceeding |
| Language: | English |
| Published: |
IEEE
11.09.2023
|
| Subjects: | |
| ISSN: | 2643-1572 |
| Online Access: | Get full text |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| Abstract | Software testing is an important part of the development cycle, yet it requires specialized expertise and substantial developer effort to adequately test software. Recent discoveries of the capabilities of large language models (LLMs) suggest that they can be used as automated testing assistants, and thus provide helpful information and even drive the testing process. To highlight the potential of this technology, we present a taxonomy of LLM-based testing agents based on their level of autonomy, and describe how a greater level of autonomy can benefit developers in practice. An example use of LLMs as a testing assistant is provided to demonstrate how a conversational framework for testing can help developers. This also highlights how the often criticized "hallucination" of LLMs can be beneficial for testing. We identify other tangible benefits that LLM-driven testing agents can bestow, and also discuss potential limitations. |
|---|---|
| AbstractList | Software testing is an important part of the development cycle, yet it requires specialized expertise and substantial developer effort to adequately test software. Recent discoveries of the capabilities of large language models (LLMs) suggest that they can be used as automated testing assistants, and thus provide helpful information and even drive the testing process. To highlight the potential of this technology, we present a taxonomy of LLM-based testing agents based on their level of autonomy, and describe how a greater level of autonomy can benefit developers in practice. An example use of LLMs as a testing assistant is provided to demonstrate how a conversational framework for testing can help developers. This also highlights how the often criticized "hallucination" of LLMs can be beneficial for testing. We identify other tangible benefits that LLM-driven testing agents can bestow, and also discuss potential limitations. |
| Author | Feldt, Robert Yoon, Juyeon Kang, Sungmin Yoo, Shin |
| Author_xml | – sequence: 1 givenname: Robert surname: Feldt fullname: Feldt, Robert email: robert.feldt@chalmers.se organization: Chalmers University of Technology – sequence: 2 givenname: Sungmin surname: Kang fullname: Kang, Sungmin email: sungmin.kang@kaist.ac.kr organization: KAIST – sequence: 3 givenname: Juyeon surname: Yoon fullname: Yoon, Juyeon email: juyeon.yoon@kaist.ac.kr organization: KAIST – sequence: 4 givenname: Shin surname: Yoo fullname: Yoo, Shin email: shin.yoo@kaist.ac.kr organization: KAIST |
| BookMark | eNotj1FLwzAUhaMouM39An3oH-i8SZp0eSxjTqXig_V53DY3pdIl0rQT_70FhcM533k5cJbsygdPjN1x2HAO5qF43ysthNkIEHIDwLPtBVub3GylAimM0dklWwidyZSrXNywZYyfAGou-YK9VOEbBxuTYhqDD6cwxaSiOHa-TYqW_BiTc4fJLvgzDRHHLnjskxKHlmb37YQzvAZLfbxl1w77SOv_XLGPx321e0rLt8PzrihTlBrG1BFZWYNooEFbK3TKOjJNpmuOwupG5k3jnHV8Vo1mS0ZJgFwYTiRQoVyx-7_djoiOX0N3wuHnyEHMjzXIX6w8UYo |
| CODEN | IEEPAD |
| ContentType | Conference Proceeding |
| DBID | 6IE 6IL CBEJK RIE RIL |
| DOI | 10.1109/ASE56229.2023.00148 |
| DatabaseName | IEEE Electronic Library (IEL) Conference Proceedings IEEE Xplore POP ALL IEEE Xplore All Conference Proceedings IEEE/IET Electronic Library (IEL) (UW System Shared) IEEE Proceedings Order Plans (POP All) 1998-Present |
| DatabaseTitleList | |
| Database_xml | – sequence: 1 dbid: RIE name: IEEE/IET Electronic Library (IEL) (UW System Shared) url: https://ieeexplore.ieee.org/ sourceTypes: Publisher |
| DeliveryMethod | fulltext_linktorsrc |
| Discipline | Computer Science |
| EISBN | 9798350329964 |
| EISSN | 2643-1572 |
| EndPage | 1693 |
| ExternalDocumentID | 10298360 |
| Genre | orig-research |
| GroupedDBID | 6IE 6IF 6IH 6IK 6IL 6IM 6IN 6J9 AAJGR AAWTH ABLEC ACREN ADYOE ADZIZ AFYQB ALMA_UNASSIGNED_HOLDINGS AMTXH BEFXN BFFAM BGNUA BKEBE BPEOZ CBEJK CHZPO IEGSK IPLJI M43 OCL RIE RIL |
| ID | FETCH-LOGICAL-a360t-feed3b02c0cadb5af5dfe9c46b1a2d6c37ccffdf1df1ba98e953007291ee2a5a3 |
| IEDL.DBID | RIE |
| ISICitedReferencesCount | 15 |
| ISICitedReferencesURI | http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=001103357200135&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| IngestDate | Wed Aug 27 02:32:41 EDT 2025 |
| IsPeerReviewed | false |
| IsScholarly | true |
| Language | English |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-LOGICAL-a360t-feed3b02c0cadb5af5dfe9c46b1a2d6c37ccffdf1df1ba98e953007291ee2a5a3 |
| PageCount | 6 |
| ParticipantIDs | ieee_primary_10298360 |
| PublicationCentury | 2000 |
| PublicationDate | 2023-Sept.-11 |
| PublicationDateYYYYMMDD | 2023-09-11 |
| PublicationDate_xml | – month: 09 year: 2023 text: 2023-Sept.-11 day: 11 |
| PublicationDecade | 2020 |
| PublicationTitle | IEEE/ACM International Conference on Automated Software Engineering : [proceedings] |
| PublicationTitleAbbrev | ASE |
| PublicationYear | 2023 |
| Publisher | IEEE |
| Publisher_xml | – name: IEEE |
| SSID | ssj0051577 ssib057256115 |
| Score | 2.3876717 |
| Snippet | Software testing is an important part of the development cycle, yet it requires specialized expertise and substantial developer effort to adequately test... |
| SourceID | ieee |
| SourceType | Publisher |
| StartPage | 1688 |
| SubjectTerms | artificial intelligence Automation Drives large language model machine learning Middleware Oral communication Software testing Taxonomy test automation Testing |
| Title | Towards Autonomous Testing Agents via Conversational Large Language Models |
| URI | https://ieeexplore.ieee.org/document/10298360 |
| WOSCitedRecordID | wos001103357200135&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| link | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1LawIxEA5VeujJPiytfZBDr9ua7CYxRxGlFBGhtniTbB5FKGtxV39_Z-Jqe-mhsITdPQ2ThMlMvu8bQh40ZMk5HB0SbV2GCQpL4NMmnknvpPDcRDGd97GaTHrzuZ7WZPXIhfHeR_CZf8TXeJfvVnaDpTLY4Vwj6aBBGkrJHVlrv3iEguDN2OHsC3FaqVpmiHX1U_91CKGeIzeFo6gpw44_vxqqxHgyav3TklPS_mHm0ekh5pyRI1-ck9a-NQOtd-oFeZlFOGxJ-5sKaQuQ39MZCmoUH7SPbKqSbpeGDhBzvi7rgiAdIywcxl0Jk2KftM-yTd5Gw9ngOanbJiQGLKqSACaA_7ntWuNyYYJwwWubyZwZ7qRNlbUhuMDgyY3ueS3SKCDOPEyNMOklaRarwl8RmjIrTda1kFWZzASep6LnlXUmZyEXMrsmbfTN4munjLHYu6Xzx_8bcoLuR7wFY7ekWa03_o4c2221LNf3cT6_ASecofY |
| linkProvider | IEEE |
| linkToHtml | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1LS8NAEB60Cnqqj4pv9-A12k2ySfdYSqVqLAWj9Fb2KQVJpUn7-93ZptWLByEsSU7Dzi6zMzvf9wHccpclS3d0CLjSMSYoNHCfKjA0MTphJhSeTOc9S4fDznjMRzVY3WNhjDG--czc4au_y9cztcBSmdvhIUfQwTbsoHQWW8G11suHpS58U7o5_bpInaY10RBt8_vua98F-xDRKSHSmlLU_PklqeIjykPzn7YcQOsHm0dGm6hzCFumOILmWpyB1Hv1GJ5y3xBbku6iQuCCy_BJjpQaxQfpIp6qJMupID3sOp-XdUmQZNgY7sZVEZOgUtpn2YK3h37eGwS1cEIgnEVVYJ0JzgOhaiuhJROWaWu4ihNJRagTFaVKWastdY8UvGM4izyFODXOOUxEJ9AoZoU5BRJRlYi4rVxeJWJhQxmxjkmVFpJayZL4DFo4N5OvFTfGZD0t53_8v4G9Qf6STbLH4fMF7KMrsPuC0ktoVPOFuYJdtaym5fza-_YbyT-lQQ |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=IEEE%2FACM+International+Conference+on+Automated+Software+Engineering+%3A+%5Bproceedings%5D&rft.atitle=Towards+Autonomous+Testing+Agents+via+Conversational+Large+Language+Models&rft.au=Feldt%2C+Robert&rft.au=Kang%2C+Sungmin&rft.au=Yoon%2C+Juyeon&rft.au=Yoo%2C+Shin&rft.date=2023-09-11&rft.pub=IEEE&rft.eissn=2643-1572&rft.spage=1688&rft.epage=1693&rft_id=info:doi/10.1109%2FASE56229.2023.00148&rft.externalDocID=10298360 |