Automated Testing for Service-Oriented Architecture: Leveraging Large Language Models for Enhanced Service Composition

This article explores the application of Large Language Models (LLMs), including proprietary models such as OpenAI's ChatGPT 4o and ChatGPT 4o-mini, Anthropic's Claude 3.5 Sonnet and Claude 3.7 Sonnet, and Google's Gemini 1.5 Pro, Gemini 2.0 Flash, and Gemini 2.0 Flash-Lite, as well a...

Celý popis

Uloženo v:
Podrobná bibliografie
Vydáno v:IEEE access Ročník 13; s. 89627 - 89640
Hlavní autoři: Altin, Mahsun, Mutlu, Behcet, Kilinc, Deniz, Cakir, Altan
Médium: Journal Article
Jazyk:angličtina
Vydáno: Piscataway IEEE 2025
The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Témata:
ISSN:2169-3536, 2169-3536
On-line přístup:Získat plný text
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Abstract This article explores the application of Large Language Models (LLMs), including proprietary models such as OpenAI's ChatGPT 4o and ChatGPT 4o-mini, Anthropic's Claude 3.5 Sonnet and Claude 3.7 Sonnet, and Google's Gemini 1.5 Pro, Gemini 2.0 Flash, and Gemini 2.0 Flash-Lite, as well as open-source alternatives including Qwen2.5-14B-Instruct-1M, and commercially accessed models such as DeepSeek R1 and DeepSeek V3, which were tested via APIs despite having open-source variants, to automate validation and verification in Application Programming Interface (API) testing within a Service-Oriented Architecture (SOA). Our system compares internal responses from the Enuygun Web Server against third-party API outputs in both JSON and XML formats, validating critical parameters such as flight prices, baggage allowances, and seat availability. We generated 100 diverse test scenarios across varying complexities (1-4 flight results) by randomly altering request and response parameters. Experimental results show that Google Gemini 2.0 Flash achieved high accuracy (up to 99.98%) with the lowest completion time (85.34 seconds), while Qwen2.5-14B-Instruct-1M exhibited limited capability in processing complex formats. Models such as OpenAI's ChatGPT and Anthropic's Claude Sonnet models also demonstrated strong performance in single-flight validation scenarios, making them suitable for low-latency, high-precision tasks. Our findings indicate that some open-source models can offer promising cost-effective alternatives, though performance significantly varies. This integration of LLMs reduced manual workload, improved test scalability, and enabled real-time validation across large-scale datasets. As LLM technologies mature, we anticipate further advances in automation, accuracy, and efficiency in software validation systems.
AbstractList This article explores the application of Large Language Models (LLMs), including proprietary models such as OpenAI’s ChatGPT 4o and ChatGPT 4o-mini, Anthropic’s Claude 3.5 Sonnet and Claude 3.7 Sonnet, and Google’s Gemini 1.5 Pro, Gemini 2.0 Flash, and Gemini 2.0 Flash-Lite, as well as open-source alternatives including Qwen2.5-14B-Instruct-1M, and commercially accessed models such as DeepSeek R1 and DeepSeek V3, which were tested via APIs despite having open-source variants, to automate validation and verification in Application Programming Interface (API) testing within a Service-Oriented Architecture (SOA). Our system compares internal responses from the Enuygun Web Server against third-party API outputs in both JSON and XML formats, validating critical parameters such as flight prices, baggage allowances, and seat availability. We generated 100 diverse test scenarios across varying complexities (1-4 flight results) by randomly altering request and response parameters. Experimental results show that Google Gemini 2.0 Flash achieved high accuracy (up to 99.98%) with the lowest completion time (85.34 seconds), while Qwen2.5-14B-Instruct-1M exhibited limited capability in processing complex formats. Models such as OpenAI’s ChatGPT and Anthropic’s Claude Sonnet models also demonstrated strong performance in single-flight validation scenarios, making them suitable for low-latency, high-precision tasks. Our findings indicate that some open-source models can offer promising cost-effective alternatives, though performance significantly varies. This integration of LLMs reduced manual workload, improved test scalability, and enabled real-time validation across large-scale datasets. As LLM technologies mature, we anticipate further advances in automation, accuracy, and efficiency in software validation systems.
Author Kilinc, Deniz
Altin, Mahsun
Mutlu, Behcet
Cakir, Altan
Author_xml – sequence: 1
  givenname: Mahsun
  orcidid: 0000-0002-5285-2593
  surname: Altin
  fullname: Altin, Mahsun
  email: altinma21@itu.edu.tr
  organization: Department of Computer Engineering, Istanbul Technical University, İstanbul, Türkiye
– sequence: 2
  givenname: Behcet
  orcidid: 0000-0003-1054-8558
  surname: Mutlu
  fullname: Mutlu, Behcet
  organization: Wingie Enuygun Group, İstanbul, Türkiye
– sequence: 3
  givenname: Deniz
  orcidid: 0000-0002-2336-8831
  surname: Kilinc
  fullname: Kilinc, Deniz
  organization: Department of Computer Engineering, University of Bakirçay, İzmir, Türkiye
– sequence: 4
  givenname: Altan
  surname: Cakir
  fullname: Cakir, Altan
  organization: Department of Data Science and Analytics, Istanbul Technical University, İstanbul, Türkiye
BookMark eNp9UU2P0zAQtdAisSz7C-AQiXOKP-I45lZFBVYq2kOXszXrjLOu2rg4TqX99zibIhAHfBg_jd97mvF7S66GMCAh7xldMUb1p3Xbbna7FadcroRUTOvqFbnmrNalkKK--gu_IbfjuKf5NLkl1TU5r6cUjpCwKx5wTH7oCxdiscN49hbL--hxmB_X0T75hDZNET8XWzxjhH5mbyH2mOvQT5DB99DhYXzx2AxPMNisvZgVbTiewuiTD8M78trBYcTby31DfnzZPLTfyu3917t2vS1txVgqK6q5rhrMW9VcNAxErWjHqs4y5-rGCcdBVso92toCANM2o445Ck5Z5FLckLvFtwuwN6fojxCfTQBvXhoh9gZi8vaAxlaPnYK6oY6ySlQSJHdCNNJ1tnFWqOz1cfE6xfBzyp9l9mGKQx7fCM54I5nSNLP0wrIxjGNEZ6xPMO-cIviDYdTMqZklNTOnZi6pZa34R_t74v-rPiwqj4h_FIxSJbkWvwAQyqaJ
CODEN IAECCG
CitedBy_id crossref_primary_10_3390_buildings15132273
Cites_doi 10.3390/app14052074
10.1109/TSE.2023.3334955
10.1145/3597503.3639150
10.1145/3641289
10.1109/OJCOMS.2024.3456549
10.1108/00251740710718935
10.1016/j.future.2024.05.034
10.3390/pr10091782
10.3390/app12094369
10.1109/ICNSurv.2013.6548579
10.1109/SANER60148.2024.00041
10.3390/electronics13173563
10.1145/3643733
10.1007/s12599-009-0074-z
10.1109/ACCESS.2021.3073041
10.1109/TKDE.2022.3148237
10.1145/3659677.3659749
10.1145/3611643.3613083
10.1109/QRS60937.2023.00029
ContentType Journal Article
Copyright Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2025
Copyright_xml – notice: Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2025
DBID 97E
ESBDL
RIA
RIE
AAYXX
CITATION
7SC
7SP
7SR
8BQ
8FD
JG9
JQ2
L7M
L~C
L~D
DOA
DOI 10.1109/ACCESS.2025.3571994
DatabaseName IEEE Xplore (IEEE)
IEEE Xplore Open Access Journals
IEEE All-Society Periodicals Package (ASPP) 1998–Present
IEEE Xplore
CrossRef
Computer and Information Systems Abstracts
Electronics & Communications Abstracts
Engineered Materials Abstracts
METADEX
Technology Research Database
Materials Research Database
ProQuest Computer Science Collection
Advanced Technologies Database with Aerospace
Computer and Information Systems Abstracts – Academic
Computer and Information Systems Abstracts Professional
DOAJ Directory of Open Access Journals
DatabaseTitle CrossRef
Materials Research Database
Engineered Materials Abstracts
Technology Research Database
Computer and Information Systems Abstracts – Academic
Electronics & Communications Abstracts
ProQuest Computer Science Collection
Computer and Information Systems Abstracts
Advanced Technologies Database with Aerospace
METADEX
Computer and Information Systems Abstracts Professional
DatabaseTitleList Materials Research Database


Database_xml – sequence: 1
  dbid: DOA
  name: DOAJ Open Access Full Text
  url: https://www.doaj.org/
  sourceTypes: Open Website
– sequence: 2
  dbid: RIE
  name: IEEE Xplore
  url: https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Engineering
EISSN 2169-3536
EndPage 89640
ExternalDocumentID oai_doaj_org_article_c4bd7a680f014345a52f3385fdc8fc37
10_1109_ACCESS_2025_3571994
11007529
Genre orig-research
GrantInformation_xml – fundername: Wingie Enuygun Group
– fundername: ITU-BAP Doctoral Projects 2025
GroupedDBID 0R~
4.4
5VS
6IK
97E
AAJGR
ABAZT
ABVLG
ACGFS
ADBBV
AGSQL
ALMA_UNASSIGNED_HOLDINGS
BCNDV
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
EBS
EJD
ESBDL
GROUPED_DOAJ
IPLJI
JAVBF
KQ8
M43
M~E
O9-
OCL
OK1
RIA
RIE
RNS
AAYXX
CITATION
7SC
7SP
7SR
8BQ
8FD
JG9
JQ2
L7M
L~C
L~D
ID FETCH-LOGICAL-c411t-4092948e57162381a3670d14dc1ff68f3f2a547fbc6caaa19cbc6d1f0af7ce253
IEDL.DBID DOA
ISICitedReferencesCount 1
ISICitedReferencesURI http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=001498048200024&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
ISSN 2169-3536
IngestDate Fri Oct 03 12:51:36 EDT 2025
Sat Nov 01 15:15:59 EDT 2025
Tue Nov 18 22:32:16 EST 2025
Sat Nov 29 07:54:26 EST 2025
Wed Aug 27 01:51:27 EDT 2025
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Language English
License https://creativecommons.org/licenses/by/4.0/legalcode
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c411t-4092948e57162381a3670d14dc1ff68f3f2a547fbc6caaa19cbc6d1f0af7ce253
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ORCID 0000-0003-1054-8558
0000-0002-5285-2593
0000-0002-2336-8831
OpenAccessLink https://doaj.org/article/c4bd7a680f014345a52f3385fdc8fc37
PQID 3212851790
PQPubID 4845423
PageCount 14
ParticipantIDs doaj_primary_oai_doaj_org_article_c4bd7a680f014345a52f3385fdc8fc37
crossref_primary_10_1109_ACCESS_2025_3571994
ieee_primary_11007529
crossref_citationtrail_10_1109_ACCESS_2025_3571994
proquest_journals_3212851790
PublicationCentury 2000
PublicationDate 20250000
2025-00-00
20250101
2025-01-01
PublicationDateYYYYMMDD 2025-01-01
PublicationDate_xml – year: 2025
  text: 20250000
PublicationDecade 2020
PublicationPlace Piscataway
PublicationPlace_xml – name: Piscataway
PublicationTitle IEEE access
PublicationTitleAbbrev Access
PublicationYear 2025
Publisher IEEE
The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Publisher_xml – name: IEEE
– name: The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
References ref13
ref12
Liu (ref29) 2024
ref15
ref14
ref31
ref11
ref10
(ref25) 2024
ref2
ref17
ref16
Li (ref18) 2024
Yang (ref30) 2025
Andriyanto (ref22) 2021
ref23
ref21
Hurst (ref24) 2024
Roche (ref1) 2014
Wang (ref19) 2024
Guo (ref28) 2025
ref8
ref7
ref9
ref4
(ref26) 2025
ref3
Jin (ref20) 2024
ref6
ref5
Georgiev (ref27) 2024
References_xml – ident: ref6
  doi: 10.3390/app14052074
– ident: ref10
  doi: 10.1109/TSE.2023.3334955
– year: 2021
  ident: ref22
  article-title: X-driven methodologies for SOA system development—A survey
  publication-title: arXiv:2109.01805
– ident: ref13
  doi: 10.1145/3597503.3639150
– year: 2024
  ident: ref20
  article-title: From LLMs to LLM-based agents for software engineering: A survey of current, challenges and future
  publication-title: arXiv:2408.02479
– ident: ref7
  doi: 10.1145/3641289
– year: 2014
  ident: ref1
  article-title: Integrating service orientated architecture design principles into software as a service applications
– ident: ref21
  doi: 10.1109/OJCOMS.2024.3456549
– volume-title: Claude 3.7 Sonnet System Card
  year: 2025
  ident: ref26
– ident: ref4
  doi: 10.1108/00251740710718935
– ident: ref17
  doi: 10.1016/j.future.2024.05.034
– ident: ref2
  doi: 10.3390/pr10091782
– ident: ref5
  doi: 10.3390/app12094369
– year: 2024
  ident: ref27
  article-title: Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context
  publication-title: arXiv:2403.05530
– ident: ref3
  doi: 10.1109/ICNSurv.2013.6548579
– year: 2024
  ident: ref19
  article-title: A roadmap for software testing in open collaborative development environments
  publication-title: arXiv:2406.05438
– ident: ref16
  doi: 10.1109/SANER60148.2024.00041
– ident: ref31
  doi: 10.3390/electronics13173563
– volume-title: arXiv:2501.15383
  year: 2025
  ident: ref30
  article-title: Qwen2.5–1M technical report
– ident: ref12
  doi: 10.1145/3643733
– year: 2024
  ident: ref18
  article-title: Model-enhanced LLM-driven VUI testing of VPA apps
  publication-title: arXiv:2407.02791
– ident: ref23
  doi: 10.1007/s12599-009-0074-z
– ident: ref14
  doi: 10.1109/ACCESS.2021.3073041
– ident: ref15
  doi: 10.1109/TKDE.2022.3148237
– ident: ref8
  doi: 10.1145/3659677.3659749
– ident: ref11
  doi: 10.1145/3611643.3613083
– year: 2024
  ident: ref24
  article-title: GPT-4o system card
  publication-title: arXiv:2410.21276
– volume-title: arXiv:2412.19437
  year: 2024
  ident: ref29
  article-title: DeepSeek-V3 technical report
– volume-title: Claude 3.5 Sonnet Model Card Addendum
  year: 2024
  ident: ref25
– year: 2025
  ident: ref28
  article-title: DeepSeek-r1: Incentivizing reasoning capability in LLMs via reinforcement learning
  publication-title: arXiv:2501.12948
– ident: ref9
  doi: 10.1109/QRS60937.2023.00029
SSID ssj0000816957
Score 2.3412054
Snippet This article explores the application of Large Language Models (LLMs), including proprietary models such as OpenAI's ChatGPT 4o and ChatGPT 4o-mini,...
This article explores the application of Large Language Models (LLMs), including proprietary models such as OpenAI’s ChatGPT 4o and ChatGPT 4o-mini,...
SourceID doaj
proquest
crossref
ieee
SourceType Open Website
Aggregation Database
Enrichment Source
Index Database
Publisher
StartPage 89627
SubjectTerms Accuracy
AI-driven automation
Application programming interface
Artificial intelligence
Automation
Chatbots
Completion time
Complexity theory
Computer architecture
Flight
Indexing
Large language models
Parameters
Real time
Real-time systems
Scalability
scalability in API testing
Service oriented architecture
Software development management
Software reliability
Software systems
Software testing
software validation
SummonAdditionalLinks – databaseName: IEEE Xplore
  dbid: RIE
  link: http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1Lb9QwEB7RigMcoEARS1vkA0fSxkkcO71tV604rAqHIvVmOfYYkKps1d3t72fG8S5FFUhcIiuxHSdjex6e-QbgY4iIxMYNKzlYcMxywZgwhSJJrg8GGxwh8-f68tJcX3dfc7B6ioVBxOR8hsdcTGf5YeHXbCo7YXgzrapuB3a0bsdgra1BhTNIdEpnZCFZdifT2Yw-gnTASh3XSjMK7h_cJ4H056wqj7bixF8uXv7nyPbgRRYkxXSk_Ct4gsNreP4AXvAN3E_XqwVJpBjEFYNpDN8Fiagibw_FF4Y45ofTB4cJp2KONLtT7iIxZzdxuo4mTcF5026WqY_z4UdyHdh0Jnhfyf5f-_Dt4vxq9rnIeRYK30i5IhWSZKTGoGI0KeLgjkHdgmyClzG2JtaxcqrRsfetd87JzlMpyFi6qD1Wqn4Lu8NiwHcg-k6Z0pOMVzrZxFb2fdXHmqQ6g6i6vp5Atfn_1mcQcs6FcWOTMlJ2diSaZaLZTLQJfNo2uh0xOP5d_YwJu63KANrpBlHM5vVofdMH7VpTRgY4bJRTVSRtXcXgTfS1nsA-U_n3-zKBJ3C4mSc2r_alrYn_m4R19v4vzQ7gGQ9xtN0cwu7qbo1H8NTfr34u7z6kifwL8nvxKw
  priority: 102
  providerName: IEEE
Title Automated Testing for Service-Oriented Architecture: Leveraging Large Language Models for Enhanced Service Composition
URI https://ieeexplore.ieee.org/document/11007529
https://www.proquest.com/docview/3212851790
https://doaj.org/article/c4bd7a680f014345a52f3385fdc8fc37
Volume 13
WOSCitedRecordID wos001498048200024&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVAON
  databaseName: DOAJ Open Access Full Text
  customDbUrl:
  eissn: 2169-3536
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0000816957
  issn: 2169-3536
  databaseCode: DOA
  dateStart: 20130101
  isFulltext: true
  titleUrlDefault: https://www.doaj.org/
  providerName: Directory of Open Access Journals
– providerCode: PRVHPJ
  databaseName: ROAD: Directory of Open Access Scholarly Resources
  customDbUrl:
  eissn: 2169-3536
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0000816957
  issn: 2169-3536
  databaseCode: M~E
  dateStart: 20130101
  isFulltext: true
  titleUrlDefault: https://road.issn.org
  providerName: ISSN International Centre
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrZ07T8MwEIAtVDHAgHgUUR6VB0YCcRInNlupihhKYSgSm-U4NiChFNHCyG_nznFLERIsLFGUOA_nLr675PwdIceVsxbMuMAgx0Y4ZzlCJkzEwZMrK2Ez2yDzh8VoJO7v5e1SqS_MCWvwwM2DOzNZWRU6F7FDEl3GNU8chFXcVUY4k_p55HEhl4IpPwYLlkteBMwQi-VZr9-HHkFAmPDTlBeIxP1mijyxP5RY-TEue2NzuUk2gpdIe83dbZEVW2-T9SV24A55773NJuBu2oqOkZRRP1DwP2l496Mb5Bfjzt7Sn4JzOrSgur4wER1iDjgsm--VFIuiPU_9OQb1o88LmJ-M4qARkrva5O5yMO5fRaGIQmQyxmYQH4IDlAnLERUF5lkjsa1iWWWYc7lwqUs0zwpXmtxorZk0sFYxF2tXGJvwdJe06klt9wgtJRexAQcu1ixzOSvLpHQgCias5bJMOySZP09lAmEcC108Kx9pxFI1QlAoBBWE0CEni4NeGsDG780vUFCLpkjH9htAZ1TQGfWXznRIG8X8dT1MFeGJ7JDDudxVeJWnKgXjLjzIbP8_rn1A1rA_zVecQ9Kavb7ZI7Jq3mdP09eu12JYXn8Mun4u4ifp_PXl
linkProvider Directory of Open Access Journals
linkToHtml http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1Lb9QwEB5BQaIcyqsV2wf4wJG0cWJvnN62q1ZFhIXDIvVmOfa4IFVZ1N3t78fjeLdFqJV6iazEdpyM7Xl45huAT84jBjauSMnBjGKWM8KEyWSQ5FqnUGAPmd9Uk4m6uKh_pGD1GAuDiNH5DA-pGM_y3cwuyVR2RPBmlSzqp_BMClHkfbjW2qRCOSRqWSVsIZ7XR6PxOHxG0AILeVjKinBw_-E_EaY_5VX5bzOOHObs1SPH9hq2kijJRj3t38AT7N7CyzsAg-_gZrRczIJMio5NCU6ju2RBSGVpg8i-E8gxPRzdOU44Zg2G-R2zF7GGHMXDtTdqMsqcdjWPfZx2v6LzwKozRjtL8gDbhp9np9PxeZYyLWRWcL4ISmSQkoRCSXhSgYcbgnVzXDjLvR8qX_rCSFH51g6tMYbXNpQc97nxlcVCljuw0c06fA-sraXKbZDycsOFH_K2LVpfBrlOIcq6LQdQrP6_tgmGnLJhXOmojuS17ommiWg6EW0An9eN_vQoHA9XPyHCrqsShHa8ESim04rUVrSuMkOVe4I4FNLIwgd9XXpnlbdlNYBtovLt-xKBB7C_mic6rfe5LoMEoCLa2e49zT7Ci_Ppt0Y3XyZf92CThttbcvZhY3G9xAN4bm8Wv-fXH-Kk_gud9_Ry
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Automated+Testing+for+Service-Oriented+Architecture%3A+Leveraging+Large+Language+Models+for+Enhanced+Service+Composition&rft.jtitle=IEEE+access&rft.au=Altin%2C+Mahsun&rft.au=Mutlu%2C+Behcet&rft.au=Kilinc%2C+Deniz&rft.au=Cakir%2C+Altan&rft.date=2025&rft.issn=2169-3536&rft.eissn=2169-3536&rft.volume=13&rft.spage=89627&rft.epage=89640&rft_id=info:doi/10.1109%2FACCESS.2025.3571994&rft.externalDBID=n%2Fa&rft.externalDocID=10_1109_ACCESS_2025_3571994
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2169-3536&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2169-3536&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2169-3536&client=summon