Automated Testing for Service-Oriented Architecture: Leveraging Large Language Models for Enhanced Service Composition

This article explores the application of Large Language Models (LLMs), including proprietary models such as OpenAI's ChatGPT 4o and ChatGPT 4o-mini, Anthropic's Claude 3.5 Sonnet and Claude 3.7 Sonnet, and Google's Gemini 1.5 Pro, Gemini 2.0 Flash, and Gemini 2.0 Flash-Lite, as well a...

Full description

Saved in:

Bibliographic Details
Published in:	IEEE access Vol. 13; pp. 89627 - 89640
Main Authors:	Altin, Mahsun, Mutlu, Behcet, Kilinc, Deniz, Cakir, Altan
Format:	Journal Article
Language:	English
Published:	Piscataway IEEE 2025 The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Subjects:	Accuracy AI-driven automation Application programming interface Artificial intelligence Automation Chatbots Completion time Complexity theory Computer architecture Flight Indexing Large language models Parameters Real time Real-time systems Scalability scalability in API testing Service oriented architecture Software development management Software reliability Software systems Software testing software validation
ISSN:	2169-3536, 2169-3536
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

Abstract	This article explores the application of Large Language Models (LLMs), including proprietary models such as OpenAI's ChatGPT 4o and ChatGPT 4o-mini, Anthropic's Claude 3.5 Sonnet and Claude 3.7 Sonnet, and Google's Gemini 1.5 Pro, Gemini 2.0 Flash, and Gemini 2.0 Flash-Lite, as well as open-source alternatives including Qwen2.5-14B-Instruct-1M, and commercially accessed models such as DeepSeek R1 and DeepSeek V3, which were tested via APIs despite having open-source variants, to automate validation and verification in Application Programming Interface (API) testing within a Service-Oriented Architecture (SOA). Our system compares internal responses from the Enuygun Web Server against third-party API outputs in both JSON and XML formats, validating critical parameters such as flight prices, baggage allowances, and seat availability. We generated 100 diverse test scenarios across varying complexities (1-4 flight results) by randomly altering request and response parameters. Experimental results show that Google Gemini 2.0 Flash achieved high accuracy (up to 99.98%) with the lowest completion time (85.34 seconds), while Qwen2.5-14B-Instruct-1M exhibited limited capability in processing complex formats. Models such as OpenAI's ChatGPT and Anthropic's Claude Sonnet models also demonstrated strong performance in single-flight validation scenarios, making them suitable for low-latency, high-precision tasks. Our findings indicate that some open-source models can offer promising cost-effective alternatives, though performance significantly varies. This integration of LLMs reduced manual workload, improved test scalability, and enabled real-time validation across large-scale datasets. As LLM technologies mature, we anticipate further advances in automation, accuracy, and efficiency in software validation systems.
AbstractList	This article explores the application of Large Language Models (LLMs), including proprietary models such as OpenAI’s ChatGPT 4o and ChatGPT 4o-mini, Anthropic’s Claude 3.5 Sonnet and Claude 3.7 Sonnet, and Google’s Gemini 1.5 Pro, Gemini 2.0 Flash, and Gemini 2.0 Flash-Lite, as well as open-source alternatives including Qwen2.5-14B-Instruct-1M, and commercially accessed models such as DeepSeek R1 and DeepSeek V3, which were tested via APIs despite having open-source variants, to automate validation and verification in Application Programming Interface (API) testing within a Service-Oriented Architecture (SOA). Our system compares internal responses from the Enuygun Web Server against third-party API outputs in both JSON and XML formats, validating critical parameters such as flight prices, baggage allowances, and seat availability. We generated 100 diverse test scenarios across varying complexities (1-4 flight results) by randomly altering request and response parameters. Experimental results show that Google Gemini 2.0 Flash achieved high accuracy (up to 99.98%) with the lowest completion time (85.34 seconds), while Qwen2.5-14B-Instruct-1M exhibited limited capability in processing complex formats. Models such as OpenAI’s ChatGPT and Anthropic’s Claude Sonnet models also demonstrated strong performance in single-flight validation scenarios, making them suitable for low-latency, high-precision tasks. Our findings indicate that some open-source models can offer promising cost-effective alternatives, though performance significantly varies. This integration of LLMs reduced manual workload, improved test scalability, and enabled real-time validation across large-scale datasets. As LLM technologies mature, we anticipate further advances in automation, accuracy, and efficiency in software validation systems.
Author	Kilinc, Deniz Altin, Mahsun Mutlu, Behcet Cakir, Altan
Author_xml	– sequence: 1 givenname: Mahsun orcidid: 0000-0002-5285-2593 surname: Altin fullname: Altin, Mahsun email: altinma21@itu.edu.tr organization: Department of Computer Engineering, Istanbul Technical University, İstanbul, Türkiye – sequence: 2 givenname: Behcet orcidid: 0000-0003-1054-8558 surname: Mutlu fullname: Mutlu, Behcet organization: Wingie Enuygun Group, İstanbul, Türkiye – sequence: 3 givenname: Deniz orcidid: 0000-0002-2336-8831 surname: Kilinc fullname: Kilinc, Deniz organization: Department of Computer Engineering, University of Bakirçay, İzmir, Türkiye – sequence: 4 givenname: Altan surname: Cakir fullname: Cakir, Altan organization: Department of Data Science and Analytics, Istanbul Technical University, İstanbul, Türkiye
BookMark	eNp9UU2P0zAQtdAisSz7C-AQiXOKP-I45lZFBVYq2kOXszXrjLOu2rg4TqX99zibIhAHfBg_jd97mvF7S66GMCAh7xldMUb1p3Xbbna7FadcroRUTOvqFbnmrNalkKK--gu_IbfjuKf5NLkl1TU5r6cUjpCwKx5wTH7oCxdiscN49hbL--hxmB_X0T75hDZNET8XWzxjhH5mbyH2mOvQT5DB99DhYXzx2AxPMNisvZgVbTiewuiTD8M78trBYcTby31DfnzZPLTfyu3917t2vS1txVgqK6q5rhrMW9VcNAxErWjHqs4y5-rGCcdBVso92toCANM2o445Ck5Z5FLckLvFtwuwN6fojxCfTQBvXhoh9gZi8vaAxlaPnYK6oY6ySlQSJHdCNNJ1tnFWqOz1cfE6xfBzyp9l9mGKQx7fCM54I5nSNLP0wrIxjGNEZ6xPMO-cIviDYdTMqZklNTOnZi6pZa34R_t74v-rPiwqj4h_FIxSJbkWvwAQyqaJ
CODEN	IAECCG
CitedBy_id	crossref_primary_10_3390_buildings15132273
Cites_doi	10.3390/app14052074 10.1109/TSE.2023.3334955 10.1145/3597503.3639150 10.1145/3641289 10.1109/OJCOMS.2024.3456549 10.1108/00251740710718935 10.1016/j.future.2024.05.034 10.3390/pr10091782 10.3390/app12094369 10.1109/ICNSurv.2013.6548579 10.1109/SANER60148.2024.00041 10.3390/electronics13173563 10.1145/3643733 10.1007/s12599-009-0074-z 10.1109/ACCESS.2021.3073041 10.1109/TKDE.2022.3148237 10.1145/3659677.3659749 10.1145/3611643.3613083 10.1109/QRS60937.2023.00029
ContentType	Journal Article
Copyright	Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2025
Copyright_xml	– notice: Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2025
DBID	97E ESBDL RIA RIE AAYXX CITATION 7SC 7SP 7SR 8BQ 8FD JG9 JQ2 L7M L~C L~D DOA
DOI	10.1109/ACCESS.2025.3571994
DatabaseName	IEEE All-Society Periodicals Package (ASPP) 2005–Present IEEE Xplore Open Access (Activated by CARLI) IEEE All-Society Periodicals Package (ASPP) 1998–Present IEEE Electronic Library (IEL) CrossRef Computer and Information Systems Abstracts Electronics & Communications Abstracts Engineered Materials Abstracts METADEX Technology Research Database Materials Research Database ProQuest Computer Science Collection Advanced Technologies Database with Aerospace Computer and Information Systems Abstracts Academic Computer and Information Systems Abstracts Professional DOAJ Directory of Open Access Journals
DatabaseTitle	CrossRef Materials Research Database Engineered Materials Abstracts Technology Research Database Computer and Information Systems Abstracts – Academic Electronics & Communications Abstracts ProQuest Computer Science Collection Computer and Information Systems Abstracts Advanced Technologies Database with Aerospace METADEX Computer and Information Systems Abstracts Professional
DatabaseTitleList	Materials Research Database
Database_xml	– sequence: 1 dbid: DOA name: DOAJ Directory of Open Access Journals url: https://www.doaj.org/ sourceTypes: Open Website – sequence: 2 dbid: RIE name: IEEE Electronic Library (IEL) url: https://ieeexplore.ieee.org/ sourceTypes: Publisher
DeliveryMethod	fulltext_linktorsrc
Discipline	Engineering
EISSN	2169-3536
EndPage	89640
ExternalDocumentID	oai_doaj_org_article_c4bd7a680f014345a52f3385fdc8fc37 10_1109_ACCESS_2025_3571994 11007529
Genre	orig-research
GrantInformation_xml	– fundername: Wingie Enuygun Group – fundername: ITU-BAP Doctoral Projects 2025
GroupedDBID	0R~ 4.4 5VS 6IK 97E AAJGR ABAZT ABVLG ACGFS ADBBV AGSQL ALMA_UNASSIGNED_HOLDINGS BCNDV BEFXN BFFAM BGNUA BKEBE BPEOZ EBS EJD ESBDL GROUPED_DOAJ IPLJI JAVBF KQ8 M43 M~E O9- OCL OK1 RIA RIE RNS AAYXX CITATION 7SC 7SP 7SR 8BQ 8FD JG9 JQ2 L7M L~C L~D
ID	FETCH-LOGICAL-c411t-4092948e57162381a3670d14dc1ff68f3f2a547fbc6caaa19cbc6d1f0af7ce253
IEDL.DBID	RIE
ISICitedReferencesCount	1
ISICitedReferencesURI	http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=001498048200024&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
ISSN	2169-3536
IngestDate	Fri Oct 03 12:51:36 EDT 2025 Sat Nov 01 15:15:59 EDT 2025 Tue Nov 18 22:32:16 EST 2025 Sat Nov 29 07:54:26 EST 2025 Wed Aug 27 01:51:27 EDT 2025
IsDoiOpenAccess	true
IsOpenAccess	true
IsPeerReviewed	true
IsScholarly	true
Language	English
License	https://creativecommons.org/licenses/by/4.0/legalcode
LinkModel	DirectLink
MergedId	FETCHMERGED-LOGICAL-c411t-4092948e57162381a3670d14dc1ff68f3f2a547fbc6caaa19cbc6d1f0af7ce253
Notes	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14
ORCID	0000-0003-1054-8558 0000-0002-5285-2593 0000-0002-2336-8831
OpenAccessLink	https://ieeexplore.ieee.org/document/11007529
PQID	3212851790
PQPubID	4845423
PageCount	14
ParticipantIDs	doaj_primary_oai_doaj_org_article_c4bd7a680f014345a52f3385fdc8fc37 crossref_primary_10_1109_ACCESS_2025_3571994 ieee_primary_11007529 crossref_citationtrail_10_1109_ACCESS_2025_3571994 proquest_journals_3212851790
PublicationCentury	2000
PublicationDate	20250000 2025-00-00 20250101 2025-01-01
PublicationDateYYYYMMDD	2025-01-01
PublicationDate_xml	– year: 2025 text: 20250000
PublicationDecade	2020
PublicationPlace	Piscataway
PublicationPlace_xml	– name: Piscataway
PublicationTitle	IEEE access
PublicationTitleAbbrev	Access
PublicationYear	2025
Publisher	IEEE The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Publisher_xml	– name: IEEE – name: The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
References	ref13 ref12 Liu (ref29) 2024 ref15 ref14 ref31 ref11 ref10 (ref25) 2024 ref2 ref17 ref16 Li (ref18) 2024 Yang (ref30) 2025 Andriyanto (ref22) 2021 ref23 ref21 Hurst (ref24) 2024 Roche (ref1) 2014 Wang (ref19) 2024 Guo (ref28) 2025 ref8 ref7 ref9 ref4 (ref26) 2025 ref3 Jin (ref20) 2024 ref6 ref5 Georgiev (ref27) 2024
References_xml	– ident: ref6 doi: 10.3390/app14052074 – ident: ref10 doi: 10.1109/TSE.2023.3334955 – year: 2021 ident: ref22 article-title: X-driven methodologies for SOA system development—A survey publication-title: arXiv:2109.01805 – ident: ref13 doi: 10.1145/3597503.3639150 – year: 2024 ident: ref20 article-title: From LLMs to LLM-based agents for software engineering: A survey of current, challenges and future publication-title: arXiv:2408.02479 – ident: ref7 doi: 10.1145/3641289 – year: 2014 ident: ref1 article-title: Integrating service orientated architecture design principles into software as a service applications – ident: ref21 doi: 10.1109/OJCOMS.2024.3456549 – volume-title: Claude 3.7 Sonnet System Card year: 2025 ident: ref26 – ident: ref4 doi: 10.1108/00251740710718935 – ident: ref17 doi: 10.1016/j.future.2024.05.034 – ident: ref2 doi: 10.3390/pr10091782 – ident: ref5 doi: 10.3390/app12094369 – year: 2024 ident: ref27 article-title: Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context publication-title: arXiv:2403.05530 – ident: ref3 doi: 10.1109/ICNSurv.2013.6548579 – year: 2024 ident: ref19 article-title: A roadmap for software testing in open collaborative development environments publication-title: arXiv:2406.05438 – ident: ref16 doi: 10.1109/SANER60148.2024.00041 – ident: ref31 doi: 10.3390/electronics13173563 – volume-title: arXiv:2501.15383 year: 2025 ident: ref30 article-title: Qwen2.5–1M technical report – ident: ref12 doi: 10.1145/3643733 – year: 2024 ident: ref18 article-title: Model-enhanced LLM-driven VUI testing of VPA apps publication-title: arXiv:2407.02791 – ident: ref23 doi: 10.1007/s12599-009-0074-z – ident: ref14 doi: 10.1109/ACCESS.2021.3073041 – ident: ref15 doi: 10.1109/TKDE.2022.3148237 – ident: ref8 doi: 10.1145/3659677.3659749 – ident: ref11 doi: 10.1145/3611643.3613083 – year: 2024 ident: ref24 article-title: GPT-4o system card publication-title: arXiv:2410.21276 – volume-title: arXiv:2412.19437 year: 2024 ident: ref29 article-title: DeepSeek-V3 technical report – volume-title: Claude 3.5 Sonnet Model Card Addendum year: 2024 ident: ref25 – year: 2025 ident: ref28 article-title: DeepSeek-r1: Incentivizing reasoning capability in LLMs via reinforcement learning publication-title: arXiv:2501.12948 – ident: ref9 doi: 10.1109/QRS60937.2023.00029
SSID	ssj0000816957
Score	2.3412054
Snippet	This article explores the application of Large Language Models (LLMs), including proprietary models such as OpenAI's ChatGPT 4o and ChatGPT 4o-mini,... This article explores the application of Large Language Models (LLMs), including proprietary models such as OpenAI’s ChatGPT 4o and ChatGPT 4o-mini,...
SourceID	doaj proquest crossref ieee
SourceType	Open Website Aggregation Database Enrichment Source Index Database Publisher
StartPage	89627
SubjectTerms	Accuracy AI-driven automation Application programming interface Artificial intelligence Automation Chatbots Completion time Complexity theory Computer architecture Flight Indexing Large language models Parameters Real time Real-time systems Scalability scalability in API testing Service oriented architecture Software development management Software reliability Software systems Software testing software validation
SummonAdditionalLinks	– databaseName: DOAJ Directory of Open Access Journals dbid: DOA link: http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrZ1LT9wwEIAthHqgh4oWKha2lQ89khI7dmxzW1asOCDgABI3y5nYUGkVEPv4_Xgc77JVJXrhEkWJEyeeiWcmmXxDyK_gJAQwplAcOEK1oTDai4Jp1brgnFGNS8Um1NWVvr83NxulvjAnrMcD9wN3AqJplat1GZBEJ6STPMSwSoYWdIAq_UdeKrMRTKU5WLPaSJUxQ6w0J6PxON5RDAi5_F1JhUjcv0xRIvbnEiv_zMvJ2Ex2yZfsJdJRf3VfyZbvvpHPG-zAPbIcLeZP0d30Lb1FUkb3QKP_SfOzX1wjvxh3jja-FJzSSx9VNxUmopeYAx6X_ftKikXRprN0jvPuMeUFrE5GcdLIyV375G5yfju-KHIRhQIEY_MYH0YHSGgvERUVzbNDYlvLRAsshFqHKnAnhQoN1OCcYwbiWstC6YICz2X1nWx3T50_IDTaMkBAnSiFF9rUWjjHm7LmTQucN3pA-Go8LWTCOBa6mNoUaZTG9kKwKASbhTAgx-uDnnvAxvvNz1BQ66ZIx04bos7YrDP2fzozIPso5rf-MFVEcjMgw5XcbX6UZ7aKxl0nkNnhR_R9RHbwfvq3OEOyPX9Z-B_kEyznf2YvP5MWvwJwEPRt priority: 102 providerName: Directory of Open Access Journals
Title	Automated Testing for Service-Oriented Architecture: Leveraging Large Language Models for Enhanced Service Composition
URI	https://ieeexplore.ieee.org/document/11007529 https://www.proquest.com/docview/3212851790 https://doaj.org/article/c4bd7a680f014345a52f3385fdc8fc37
Volume	13
WOSCitedRecordID	wos001498048200024&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
journalDatabaseRights	– providerCode: PRVAON databaseName: DOAJ Directory of Open Access Journals customDbUrl: eissn: 2169-3536 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0000816957 issn: 2169-3536 databaseCode: DOA dateStart: 20130101 isFulltext: true titleUrlDefault: https://www.doaj.org/ providerName: Directory of Open Access Journals – providerCode: PRVHPJ databaseName: ROAD: Directory of Open Access Scholarly Resources customDbUrl: eissn: 2169-3536 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0000816957 issn: 2169-3536 databaseCode: M~E dateStart: 20130101 isFulltext: true titleUrlDefault: https://road.issn.org providerName: ISSN International Centre
link	http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV3NbxQhFH-xjQc91K8aV2vDwaPTzjAwgLftZhsPtXqoSW-EeYCaNLOmu9tj_3Z5DLvWNJp4mZAZYJj5Ae8D-D2Ad9FJjGhMpThyItXGyuggqkYr76JzRvUuB5tQ5-f68tJ8KYfV81mYEELefBaOKJnX8v0C1-QqOyZ6MyW52YEdpbrxsNbWoUIRJIxUhVmoqc3xdDZLH5FsQC6PWqmIBfcP6ZNJ-ktUlXtTcZYvp0_-s2VPYa8okmw6Iv8MHoThOTy-Qy_4Am6m69UiaaTBswsi0xi-saSisjI9VJ-J4pgeTu8sJnxgZyH17hy7iJ3RNvF0HV2ajOKmXS1zHfPhe946sKmM0bxS9n_tw9fT-cXsY1XiLFQommaVTMikIwkdJLFJJQnuiNTNN8JjE2OnYxu5k0LFHjt0zjUGU8o3sXZRYeCyfQm7w2IIr4AlcYfEYSdqEYQ2nRbO8b7ueO-R815PgG_-v8VCQk6xMK5sNkZqY0fQLIFmC2gTeL8t9HPk4Ph39hMCdpuVCLTzjYSYLePRoui9cp2uIxEcCukkj8lal9GjjtiqCewTyr_fVwCewMGmn9gy2pe2TfJfZ66z138p9gYeURNH380B7K6u1-EtPMSb1Y_l9WF2BKTrp9v5Ye7UvwBcF_QL
linkProvider	IEEE
linkToHtml	http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV3fb9MwED7BQAIexq8hCgP8wCPZEseObd66atMQpfBQpL1ZzsUeSFOK1nZ_Pz7HLUNok3iJrMR2nHy273w-fwfwPjiJAY0pFEdOpNpYGO1FUWnVueCcUa1LwSbUbKbPzsy3fFg9nYXx3ifnM39AybSX3y1wTaayQ6I3U5Kbu3BPCsHL4bjW1qRCMSSMVJlbqCrN4XgyiZ8RV4FcHtRSEQ_uX_In0fTnuCr_TMZJwpw8_s-2PYHdrEqy8YD9U7jj-2fw6BrB4HO4Gq9Xi6iT-o7NiU6jP2dRSWV5gii-EskxPRxf2074yKY-9u8UvYhNyVE8XgejJqPIaRfLVMdx_yM5D2wqYzSzZA-wPfh-cjyfnBY50kKBoqpWcREZtSShvSQ-qSjDHdG6dZXosAqh0aEO3EmhQosNOucqgzHVVaF0QaHnsn4BO_2i9y-BRYGHxGInSuGFNo0WzvG2bHjbIeetHgHf_H-LmYacomFc2LQcKY0dQLMEms2gjeDDttCvgYXj9uxHBOw2K1FopxsRMZtHpEXRdso1ugxEcSikkzzE9boMHeqAtRrBHqH8530Z4BHsb_qJzeN9aeuoAejEdvbqhmLv4MHp_MvUTj_NPr-Gh9TcwZKzDzury7V_A_fxavVzefk2derfrzD1LA
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Automated+Testing+for+Service-Oriented+Architecture%3A+Leveraging+Large+Language+Models+for+Enhanced+Service+Composition&rft.jtitle=IEEE+access&rft.au=Altin%2C+Mahsun&rft.au=Mutlu%2C+Behcet&rft.au=Kilinc%2C+Deniz&rft.au=Cakir%2C+Altan&rft.date=2025&rft.pub=IEEE&rft.eissn=2169-3536&rft.volume=13&rft.spage=89627&rft.epage=89640&rft_id=info:doi/10.1109%2FACCESS.2025.3571994&rft.externalDocID=11007529
thumbnail_l	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2169-3536&client=summon
thumbnail_m	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2169-3536&client=summon
thumbnail_s	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2169-3536&client=summon