Correctness-guaranteed strategy synthesis and compression for multi-agent autonomous systems

Planning is a critical function of multi-agent autonomous systems, which includes path finding and task scheduling. Exhaustive search-based methods such as model checking and algorithmic game theory can solve simple instances of multi-agent planning. However, these methods suffer from state-space ex...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Science of computer programming Jg. 224; S. 102894
Hauptverfasser:	Gu, Rong, Jensen, Peter G., Seceleanu, Cristina, Enoiu, Eduard, Lundqvist, Kristina
Format:	Journal Article
Sprache:	Englisch
Veröffentlicht:	Elsevier B.V 01.12.2022
Schlagworte:	Multi-agent autonomous systems Planning Reinforcement learning Strategy compression Timed games Multi-agent autonomous systems Strategy compression Planning Reinforcement learning Timed games
ISSN:	0167-6423, 1872-7964, 1872-7964
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Abstract	Planning is a critical function of multi-agent autonomous systems, which includes path finding and task scheduling. Exhaustive search-based methods such as model checking and algorithmic game theory can solve simple instances of multi-agent planning. However, these methods suffer from state-space explosion when the number of agents is large. Learning-based methods can alleviate this problem, but lack a guarantee of correctness of the results. In this paper, we introduce MoCReL, a new version of our previously proposed method that combines model checking with reinforcement learning in solving the planning problem. The approach takes advantage of reinforcement learning to synthesize path plans and task schedules for large numbers of autonomous agents, and of model checking to verify the correctness of the synthesized strategies. Further, MoCReL can compress large strategies into smaller ones that have down to 0.05% of the original sizes, while preserving their correctness, which we show in this paper. MoCReL is integrated into a new version of Uppaal Stratego that supports calling external libraries when running learning and verification of timed games models.
AbstractList	Planning is a critical function of multi-agent autonomous systems, which includes path finding and task scheduling. Exhaustive search-based methods such as model checking and algorithmic game theory can solve simple instances of multi-agent planning. However, these methods suffer from the state-space explosion when the number of agents is large. Learning-based methods can alleviate this problem but lack a guarantee of the correctness of the results. In this paper, we introduce MoCReL, a new version of our previously proposed method that combines model checking with reinforcement learning in solving the planning problem. The approach takes advantage of reinforcement learning to synthesize path plans and task schedules for large numbers of autonomous agents, and of model checking to verify the correctness of the synthesized strategies. Further, MoCReL can compress large strategies into smaller ones that have down to 0.05% of the original sizes, while preserving their correctness, which we show in this paper. MoCReL is integrated into a new version of UPPAAL Stratego that supports calling external libraries when running learning and verification of timed games models. Planning is a critical function of multi-agent autonomous systems, which includes path finding and task scheduling. Exhaustive search-based methods such as model checking and algorithmic game theory can solve simple instances of multi-agent planning. However, these methods suffer from state-space explosion when the number of agents is large. Learning-based methods can alleviate this problem, but lack a guarantee of correctness of the results. In this paper, we introduce MoCReL, a new version of our previously proposed method that combines model checking with reinforcement learning in solving the planning problem. The approach takes advantage of reinforcement learning to synthesize path plans and task schedules for large numbers of autonomous agents, and of model checking to verify the correctness of the synthesized strategies. Further, MoCReL can compress large strategies into smaller ones that have down to 0.05% of the original sizes, while preserving their correctness, which we show in this paper. MoCReL is integrated into a new version of Uppaal Stratego that supports calling external libraries when running learning and verification of timed games models.
ArticleNumber	102894
Author	Gu, Rong Seceleanu, Cristina Enoiu, Eduard Jensen, Peter G. Lundqvist, Kristina
Author_xml	– sequence: 1 givenname: Rong orcidid: 0000-0003-0570-6005 surname: Gu fullname: Gu, Rong email: rong.gu@mdu.se organization: Mälardalen University, Sweden – sequence: 2 givenname: Peter G. surname: Jensen fullname: Jensen, Peter G. email: pgj@cs.aau.dk organization: Aalborg University, Denmark – sequence: 3 givenname: Cristina surname: Seceleanu fullname: Seceleanu, Cristina email: cristina.seceleanu@mdu.se organization: Mälardalen University, Sweden – sequence: 4 givenname: Eduard surname: Enoiu fullname: Enoiu, Eduard email: eduard.paul.enoiu@mdu.se organization: Mälardalen University, Sweden – sequence: 5 givenname: Kristina surname: Lundqvist fullname: Lundqvist, Kristina email: kristina.lundqvist@mdu.se organization: Mälardalen University, Sweden
BackLink	https://urn.kb.se/resolve?urn=urn:nbn:se:mdh:diva-58045$$DView record from Swedish Publication Index (Mälardalens högskola)
BookMark	eNp9kLtOwzAUQC1UJNrCF7DkA0ixncSJB4aqPKVKLMCEZLnOTeqqsSvbAfXvcQgsDEyWrs651j0zNDHWAEKXBC8IJux6t_BKK7ugmNI4oRXPT9CUVCVNS87yCZpGqkxZTrMzNPN-hzFmeUmm6H1lnQMVDHiftr100gSAOvHByQDtMfFHE7bgtU-kqRNlu4OLqLYmaaxLun4fdCpbMCGRfbDGdrb3UfIBOn-OThu593Dx887R6_3dy-oxXT8_PK2W61RlVRHSHDOeFxLXmEnKVSarmmeYkoYpzAtGCKgNjmRV1lzFcZE3krNNUbJKUkaabI6uxr3-Ew79Rhyc7qQ7Ciu1uNVvS2FdK7p6K4oK50XE-YgrZ7130Ailgwzxpni03guCxVBV7MR3VTFUFWPV6GZ_3N_P_rduRgtihQ8NbmDAKKj1EF_UVv_rfwEFjZaR
CitedBy_id	crossref_primary_10_3390_s25113484 crossref_primary_10_1002_spe_3372 crossref_primary_10_1145_3672445
Cites_doi	10.1007/s42979-020-00326-5 10.1287/mnsc.35.11.1367 10.2514/1.G003724 10.1016/S0921-8890(98)00085-2 10.1016/j.scico.2022.102809 10.1016/j.tcs.2005.11.018 10.1007/s100090050010 10.1007/s10009-022-00657-z 10.1016/S1474-6670(17)32043-8 10.1007/s00236-016-0280-3 10.1007/s00165-020-00509-0 10.1016/0304-3975(94)90010-8 10.1016/j.arcontrol.2018.07.002 10.1007/s10994-020-05938-9 10.1016/j.simpat.2014.10.007
ContentType	Journal Article
Copyright	2022 The Author(s)
Copyright_xml	– notice: 2022 The Author(s)
DBID	6I. AAFTH AAYXX CITATION ABGEM ADTPV AOWAS D8T DF7 ZZAVC
DOI	10.1016/j.scico.2022.102894
DatabaseName	ScienceDirect Open Access Titles Elsevier:ScienceDirect:Open Access CrossRef SWEPUB Mälardalens högskola full text SwePub SwePub Articles SWEPUB Freely available online SWEPUB Mälardalens högskola SwePub Articles full text
DatabaseTitle	CrossRef
DatabaseTitleList
DeliveryMethod	fulltext_linktorsrc
Discipline	Computer Science
EISSN	1872-7964
ExternalDocumentID	oai_DiVA_org_mdh_58045 10_1016_j_scico_2022_102894 S0167642322001277
GroupedDBID	--K --M .DC .~1 0R~ 123 1B1 1RT 1~. 1~5 4.4 457 4G. 5VS 6I. 7-5 71M 8P~ 9JN AACTN AAEDW AAFTH AAIAV AAIKJ AAKOC AALRI AAOAW AAQFI AAXUO AAYFN ABBOA ABJNI ABMAC ABYKQ ACDAQ ACGFS ACRLP ACZNC ADBBV ADEZE AEBSH AEKER AENEX AFKWA AFTJW AGUBO AGYEJ AHHHB AHZHX AIALX AIEXJ AIKHN AITUG AJOXV ALMA_UNASSIGNED_HOLDINGS AMFUW AMRAJ AOUOD AXJTR BKOJK BLXMC CS3 DU5 E.L EBS EFJIC EFLBG EO8 EO9 EP2 EP3 FDB FEDTE FIRID FNPLU FYGXN G-Q GBLVA GBOLZ HVGLF IHE IXB J1W KOM LG9 M26 M41 MO0 N9A O-L O9- OAUVE OK1 OZT P-8 P-9 P2P PC. Q38 ROL RPZ SDF SDG SDP SES SPC SPCBC SSV T5K TN5 XPP ZMT ~G- 9DU AAEDT AAQXK AATTM AAXKI AAYWO AAYXX ABFNM ABWVN ABXDB ACLOT ACNNM ACRPL ACVFH ADCNI ADHUB ADMUD ADNMO ADVLN AEIPS AEUPX AEXQZ AFFNX AFJKZ AFPUW AGHFR AGQPQ AIGII AIIUN AKBMS AKRWK AKYEP ANKPU APXCP ASPBG AVWKF AZFZN CITATION EFKBS EJD FGOYB G-2 HZ~ R2- SEW SSZ WUQ ZY4 ~HD ABGEM ADTPV AOWAS D8T DF7 ZZAVC
ID	FETCH-LOGICAL-c385t-406945a0d06a29c3a8d93021f6c095611ecb038587d9c21f54fa96b5768a261f3
ISICitedReferencesCount	11
ISICitedReferencesURI	http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000878770800001&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
ISSN	0167-6423 1872-7964
IngestDate	Tue Nov 04 16:16:39 EST 2025 Tue Nov 18 22:27:57 EST 2025 Sat Nov 29 07:23:21 EST 2025 Fri Feb 23 02:38:50 EST 2024
IsDoiOpenAccess	true
IsOpenAccess	true
IsPeerReviewed	true
IsScholarly	true
Keywords	Multi-agent autonomous systems Strategy compression Planning Reinforcement learning Timed games
Language	English
License	This is an open access article under the CC BY license.
LinkModel	OpenURL
MergedId	FETCHMERGED-LOGICAL-c385t-406945a0d06a29c3a8d93021f6c095611ecb038587d9c21f54fa96b5768a261f3
ORCID	0000-0003-0570-6005
OpenAccessLink	https://urn.kb.se/resolve?urn=urn:nbn:se:mdh:diva-58045
ParticipantIDs	swepub_primary_oai_DiVA_org_mdh_58045 crossref_citationtrail_10_1016_j_scico_2022_102894 crossref_primary_10_1016_j_scico_2022_102894 elsevier_sciencedirect_doi_10_1016_j_scico_2022_102894
PublicationCentury	2000
PublicationDate	2022-12-01
PublicationDateYYYYMMDD	2022-12-01
PublicationDate_xml	– month: 12 year: 2022 text: 2022-12-01 day: 01
PublicationDecade	2020
PublicationTitle	Science of computer programming
PublicationYear	2022
Publisher	Elsevier B.V
Publisher_xml	– name: Elsevier B.V
References	Tomita, Ueno, Shimakawa, Hagihara, Yonezaki (br0310) 2017; 54 Kempf, Bozga, Maler (br0050) 2013 David, Du, Larsen, Legay, Mikučionis, Poulsen, Sedwards (br0170) 2012 Julian, Kochenderfer, Owen (br0260) 2019; 42 Bersani, Soldo, Menghi, Pelliccione, Rossi (br0140) 2020 Křetínskỳ (br0380) 2021 Watkins (br0240) 1989 Larsen, Le Coënt, Mikučionis, Taankvist (br0060) 2018 Baier, Katoen (br0210) 2008 Cormen, Leiserson, Rivest, Stein (br0270) 2009 Behrmann, David, Fleury, Larsen, Lime, Nantes (br0300) 2007 Gu, Enoiu, Seceleanu, Lundqvist (br0120) 2020 Ashok, Jackermeier, Křetínskỳ, Weinhuber, Weininger, Yadav (br0480) 2021 Alur, Moarref, Topcu (br0370) 2016 Zhang, Dietterich (br0070) 1996; 8 Sutton, Barto (br0230) 2018 Andersen, Jensen, Bak, Quottrup (br0400) 2004; 37 Jegourel, Larsen, Legay, Mikučionis, Poulsen, Sedwards (br0280) 2016 Bengtsson, Yi (br0160) 2004 Larsen, Pettersson, Yi (br0220) 1997; 1 Frank, Mannor, Precup (br0340) 2008 Basile, ter Beek, Legay (br0410) 2020 David, Jensen, Larsen, Legay, Lime, Sørensen, Taankvist (br0200) 2014 Gleirscher, Calinescu, Douthwaite, Lesage, Paterson, Aitken, Alexander, Law (br0390) 2022; 218 Julian, Kochenderfer (br0460) 2019 Morio, Balesdent, Jacquemart, Vergé (br0330) 2014; 49 Bouton, Karlsson, Nakhaei, Fujimura, Kochenderfer, Tumova (br0430) 2019 Abdeddaı, Asarin, Maler (br0110) 2006; 354 Bouton, Cosgun, Kochenderfer (br0100) 2017 Hanna, Niekum, Stone (br0360) 2021; 110 LaValle (br0040) 1998 Behrmann, Larsen, Rasmussen (br0320) 2005 Gu, Jensen, Poulsen, Seceleanu, Enoiu, Lundqvist (br0130) 2022; 24 Oliveira, Fischer, Stepankova (br0010) 1999; 27 Atyabi, MahmoudZadeh, Nefti-Meziani (br0250) 2018; 46 Rabin (br0030) 2000 Behjati, Sirjani, Ahmadabadi (br0420) 2009 Cassez, David, Fleury, Larsen, Lime (br0180) 2005 Jaeger, Jensen, Larsen, Legay, Sedwards, Taankvist (br0290) 2019 Shyalika, Silva, Karunananda (br0080) 2020; 1 Julian, Lopez, Brush, Owen, Kochenderfer (br0450) 2016 Jothimurugan, Bansal, Bastani, Alur (br0440) 2021; 34 Glynn, Iglehart (br0350) 1989; 35 Piterman, Pnueli, Sa'ar (br0490) 2006 David, Jensen, Larsen, Mikučionis, Taankvist (br0150) 2015 Chandler, Pachter (br0020) 1998 Alur, Dill (br0190) 1994; 126 Ashok, Jackermeier, Jagtap, Křetínskỳ, Weininger, Zamani (br0470) 2020 Gu, Enoiu, Seceleanu (br0090) 2019 Behjati (10.1016/j.scico.2022.102894_br0420) 2009 Frank (10.1016/j.scico.2022.102894_br0340) 2008 Julian (10.1016/j.scico.2022.102894_br0450) 2016 Julian (10.1016/j.scico.2022.102894_br0260) 2019; 42 Gu (10.1016/j.scico.2022.102894_br0090) 2019 Bouton (10.1016/j.scico.2022.102894_br0430) Sutton (10.1016/j.scico.2022.102894_br0230) 2018 Jaeger (10.1016/j.scico.2022.102894_br0290) 2019 Alur (10.1016/j.scico.2022.102894_br0370) 2016 Larsen (10.1016/j.scico.2022.102894_br0220) 1997; 1 Basile (10.1016/j.scico.2022.102894_br0410) 2020 Bouton (10.1016/j.scico.2022.102894_br0100) 2017 Gleirscher (10.1016/j.scico.2022.102894_br0390) 2022; 218 Oliveira (10.1016/j.scico.2022.102894_br0010) 1999; 27 Křetínskỳ (10.1016/j.scico.2022.102894_br0380) Abdeddaı (10.1016/j.scico.2022.102894_br0110) 2006; 354 Rabin (10.1016/j.scico.2022.102894_br0030) 2000 Piterman (10.1016/j.scico.2022.102894_br0490) 2006 Ashok (10.1016/j.scico.2022.102894_br0470) 2020 Gu (10.1016/j.scico.2022.102894_br0120) 2020 Hanna (10.1016/j.scico.2022.102894_br0360) 2021; 110 Shyalika (10.1016/j.scico.2022.102894_br0080) 2020; 1 Cormen (10.1016/j.scico.2022.102894_br0270) 2009 Kempf (10.1016/j.scico.2022.102894_br0050) 2013 Alur (10.1016/j.scico.2022.102894_br0190) 1994; 126 Atyabi (10.1016/j.scico.2022.102894_br0250) 2018; 46 Chandler (10.1016/j.scico.2022.102894_br0020) 1998 Larsen (10.1016/j.scico.2022.102894_br0060) 2018 Andersen (10.1016/j.scico.2022.102894_br0400) 2004; 37 David (10.1016/j.scico.2022.102894_br0170) Glynn (10.1016/j.scico.2022.102894_br0350) 1989; 35 Bengtsson (10.1016/j.scico.2022.102894_br0160) 2004 Tomita (10.1016/j.scico.2022.102894_br0310) 2017; 54 Gu (10.1016/j.scico.2022.102894_br0130) 2022; 24 Zhang (10.1016/j.scico.2022.102894_br0070) 1996; 8 Baier (10.1016/j.scico.2022.102894_br0210) 2008 Morio (10.1016/j.scico.2022.102894_br0330) 2014; 49 Ashok (10.1016/j.scico.2022.102894_br0480) 2021 Watkins (10.1016/j.scico.2022.102894_br0240) 1989 Julian (10.1016/j.scico.2022.102894_br0460) 2019 David (10.1016/j.scico.2022.102894_br0150) 2015 LaValle (10.1016/j.scico.2022.102894_br0040) 1998 Jegourel (10.1016/j.scico.2022.102894_br0280) 2016 Cassez (10.1016/j.scico.2022.102894_br0180) 2005 David (10.1016/j.scico.2022.102894_br0200) 2014 Jothimurugan (10.1016/j.scico.2022.102894_br0440) 2021; 34 Bersani (10.1016/j.scico.2022.102894_br0140) 2020 Behrmann (10.1016/j.scico.2022.102894_br0320) 2005 Behrmann (10.1016/j.scico.2022.102894_br0300) 2007
References_xml	– start-page: 292 year: 2009 end-page: 307 ident: br0420 article-title: Bounded rational search for on-the-fly model checking of LTL properties publication-title: FSE – year: 2019 ident: br0430 article-title: Reinforcement learning with probabilistic guarantees for autonomous driving – year: 1998 ident: br0040 article-title: Rapidly-exploring random trees: a new tool for path planning – volume: 126 start-page: 183 year: 1994 end-page: 235 ident: br0190 article-title: A theory of timed automata publication-title: Theor. Comput. Sci. – year: 2017 ident: br0100 article-title: Belief state planning for autonomously navigating urban intersections publication-title: Intelligent Vehicles Symposium – volume: 354 start-page: 272 year: 2006 end-page: 300 ident: br0110 article-title: Scheduling with timed automata publication-title: Theor. Comput. Sci. – year: 1998 ident: br0020 article-title: Research issues in autonomous control of tactical UAVs publication-title: Proceedings of the 1998 American Control Conference. ACC (IEEE Cat. No. 98CH36207) – start-page: 121 year: 2007 end-page: 125 ident: br0300 article-title: UPPAAL-Tiga: time for playing games! (tool paper) publication-title: Proceedings of the 2007 Computer Aided Verification – start-page: 1 year: 2016 end-page: 10 ident: br0450 article-title: Policy compression for aircraft collision avoidance systems publication-title: 2016 IEEE/AIAA 35th Digital Avionics Systems Conference (DASC) – volume: 46 start-page: 196 year: 2018 end-page: 215 ident: br0250 article-title: Current advancements on autonomous mission planning and management systems: an AUV and UAV perspective publication-title: Annu. Rev. Control – start-page: 113 year: 2018 end-page: 133 ident: br0060 article-title: Guaranteed control synthesis for continuous systems in UPPAAL Tiga publication-title: Cyber Physical Systems. Model-Based Design – year: 1989 ident: br0240 article-title: Learning from Delayed Rewards – volume: 8 start-page: 1024 year: 1996 end-page: 1030 ident: br0070 article-title: High-performance job-shop scheduling with a timedelay TD( publication-title: Adv. Neural Inf. Process. Syst. – volume: 49 start-page: 287 year: 2014 end-page: 304 ident: br0330 article-title: A survey of rare event simulation methods for static input–output models publication-title: Simul. Model. Pract. Theory – start-page: 364 year: 2006 end-page: 380 ident: br0490 article-title: Synthesis of reactive (1) designs publication-title: International Workshop on Verification, Model Checking, and Abstract Interpretation – start-page: 87 year: 2004 end-page: 124 ident: br0160 article-title: Timed automata: semantics, algorithms and tools publication-title: Lectures on Concurrency and Petri Nets: Advances in Petri Nets – volume: 37 start-page: 597 year: 2004 end-page: 602 ident: br0400 article-title: Motion planning in multi-robot systems using timed automata publication-title: IFAC Proc. Vol. – start-page: 81 year: 2005 end-page: 94 ident: br0320 article-title: Beyond liveness: efficient parameter synthesis for time bounded liveness publication-title: International Conference on Formal Modeling and Analysis of Timed Systems – start-page: 385 year: 2013 end-page: 400 ident: br0050 article-title: As soon as probable: optimal scheduling under stochastic uncertainty publication-title: International Conference on Tools and Algorithms for the Construction and Analysis of Systems – year: 2021 ident: br0380 article-title: LTL-constrained steady-state policy synthesis – volume: 27 start-page: 91 year: 1999 end-page: 106 ident: br0010 article-title: Multi-agent systems: which research for which applications publication-title: Robot. Auton. Syst. – start-page: 251 year: 2016 end-page: 269 ident: br0370 article-title: Compositional synthesis of reactive controllers for multi-agent systems publication-title: International Conference on Computer Aided Verification – start-page: 73 year: 2020 end-page: 92 ident: br0120 article-title: Verifiable and scalable mission-plan synthesis for multiple autonomous agents publication-title: 25th International Conference on Formal Methods for Industrial Critical Systems – year: 2015 ident: br0150 article-title: UPPAAL stratego publication-title: TACAS 2015: International Conference on Tools and Algorithms for the Construction and Analysis of Systems – year: 2008 ident: br0210 article-title: Principles of Model Checking – volume: 218 year: 2022 ident: br0390 article-title: Verified synthesis of optimal safety controllers for human-robot collaboration publication-title: Sci. Comput. Program. – start-page: 336 year: 2008 end-page: 343 ident: br0340 article-title: Reinforcement learning in the presence of rare events publication-title: Proceedings of the 25th International Conference on Machine Learning – start-page: 1 year: 2020 end-page: 7 ident: br0470 article-title: dtControl: decision tree learning algorithms for controller representation publication-title: Proceedings of the 23rd International Conference on Hybrid Systems: Computation and Control – year: 2009 ident: br0270 article-title: Introduction to Algorithms – volume: 1 start-page: 1 year: 2020 end-page: 17 ident: br0080 article-title: Reinforcement learning in dynamic task scheduling: a review publication-title: SN Comput. Sci. – start-page: 129 year: 2014 end-page: 145 ident: br0200 article-title: On time with minimal expected cost! publication-title: International Symposium on Automated Technology for Verification and Analysis – volume: 110 start-page: 1267 year: 2021 end-page: 1317 ident: br0360 article-title: Importance sampling in reinforcement learning with an estimated behavior policy publication-title: Mach. Learn. – volume: 42 start-page: 598 year: 2019 end-page: 608 ident: br0260 article-title: Deep neural network compression for aircraft collision avoidance systems publication-title: J. Guid. Control Dyn. – start-page: 163 year: 2016 end-page: 178 ident: br0280 article-title: Importance sampling for stochastic timed automata publication-title: International Symposium on Dependable Software Engineering: Theories, Tools, and Applications – start-page: 66 year: 2005 end-page: 80 ident: br0180 article-title: Efficient on-the-fly algorithms for the analysis of timed games publication-title: CONCUR 2005: International Conference on Concurrency Theory – year: 2018 ident: br0230 article-title: Reinforcement Learning: An Introduction – start-page: 1 year: 2019 end-page: 10 ident: br0460 article-title: Guaranteeing safety for neural network-based aircraft collision avoidance systems publication-title: 2019 IEEE/AIAA 38th Digital Avionics Systems Conference (DASC) – volume: 24 year: 2022 ident: br0130 article-title: Verifiable strategy synthesis for multiple autonomous agents: a scalable approach publication-title: Int. J. Softw. Tools Technol. Transf. – start-page: 81 year: 2019 end-page: 97 ident: br0290 article-title: Teaching stratego to play ball: optimal synthesis for continuous space MDPs publication-title: International Symposium on Automated Technology for Verification and Analysis – start-page: 326 year: 2021 end-page: 345 ident: br0480 article-title: dtControl 2.0: explainable strategy representation via decision tree learning steered by experts publication-title: International Conference on Tools and Algorithms for the Construction and Analysis of Systems – volume: 1 start-page: 134 year: 1997 end-page: 152 ident: br0220 article-title: UPPAAL in a nutshell publication-title: Int. J. Softw. Tools Technol. Transf. – year: 2019 ident: br0090 article-title: TAMAA: UPPAAL-based mission planning for autonomous agents publication-title: 35th ACM/SIGAPP Symposium on Applied Computing SAC2020 – volume: 35 start-page: 1367 year: 1989 end-page: 1392 ident: br0350 article-title: Importance sampling for stochastic simulations publication-title: Manag. Sci. – year: 2020 ident: br0410 article-title: Strategy synthesis for autonomous driving in a moving block railway system with uppaal stratego publication-title: International Conference on Formal Techniques for Distributed Objects, Components, and Systems – volume: 54 start-page: 655 year: 2017 end-page: 692 ident: br0310 article-title: Safraless LTL synthesis considering maximal realizability publication-title: Acta Inform. – year: 2012 ident: br0170 article-title: Statistical model checking for stochastic hybrid systems – year: 2020 ident: br0140 article-title: PuRSUE-from specification of robotic environments to synthesis of controllers publication-title: Form. Asp. Comput. – volume: 34 year: 2021 ident: br0440 article-title: Compositional reinforcement learning from logical specifications publication-title: Adv. Neural Inf. Process. Syst. – year: 2000 ident: br0030 article-title: A* aesthetic optimizations publication-title: Game Programming Gems – volume: 1 start-page: 1 issue: 6 year: 2020 ident: 10.1016/j.scico.2022.102894_br0080 article-title: Reinforcement learning in dynamic task scheduling: a review publication-title: SN Comput. Sci. doi: 10.1007/s42979-020-00326-5 – year: 2009 ident: 10.1016/j.scico.2022.102894_br0270 – ident: 10.1016/j.scico.2022.102894_br0170 – start-page: 113 year: 2018 ident: 10.1016/j.scico.2022.102894_br0060 article-title: Guaranteed control synthesis for continuous systems in UPPAAL Tiga – ident: 10.1016/j.scico.2022.102894_br0380 – year: 2000 ident: 10.1016/j.scico.2022.102894_br0030 article-title: A* aesthetic optimizations – start-page: 1 year: 2019 ident: 10.1016/j.scico.2022.102894_br0460 article-title: Guaranteeing safety for neural network-based aircraft collision avoidance systems – year: 1998 ident: 10.1016/j.scico.2022.102894_br0040 – year: 1989 ident: 10.1016/j.scico.2022.102894_br0240 – volume: 35 start-page: 1367 issue: 11 year: 1989 ident: 10.1016/j.scico.2022.102894_br0350 article-title: Importance sampling for stochastic simulations publication-title: Manag. Sci. doi: 10.1287/mnsc.35.11.1367 – start-page: 326 year: 2021 ident: 10.1016/j.scico.2022.102894_br0480 article-title: dtControl 2.0: explainable strategy representation via decision tree learning steered by experts – start-page: 1 year: 2016 ident: 10.1016/j.scico.2022.102894_br0450 article-title: Policy compression for aircraft collision avoidance systems – volume: 42 start-page: 598 issue: 3 year: 2019 ident: 10.1016/j.scico.2022.102894_br0260 article-title: Deep neural network compression for aircraft collision avoidance systems publication-title: J. Guid. Control Dyn. doi: 10.2514/1.G003724 – start-page: 81 year: 2019 ident: 10.1016/j.scico.2022.102894_br0290 article-title: Teaching stratego to play ball: optimal synthesis for continuous space MDPs – start-page: 87 year: 2004 ident: 10.1016/j.scico.2022.102894_br0160 article-title: Timed automata: semantics, algorithms and tools – start-page: 163 year: 2016 ident: 10.1016/j.scico.2022.102894_br0280 article-title: Importance sampling for stochastic timed automata – volume: 27 start-page: 91 issue: 1–2 year: 1999 ident: 10.1016/j.scico.2022.102894_br0010 article-title: Multi-agent systems: which research for which applications publication-title: Robot. Auton. Syst. doi: 10.1016/S0921-8890(98)00085-2 – start-page: 292 year: 2009 ident: 10.1016/j.scico.2022.102894_br0420 article-title: Bounded rational search for on-the-fly model checking of LTL properties – start-page: 81 year: 2005 ident: 10.1016/j.scico.2022.102894_br0320 article-title: Beyond liveness: efficient parameter synthesis for time bounded liveness – volume: 218 year: 2022 ident: 10.1016/j.scico.2022.102894_br0390 article-title: Verified synthesis of optimal safety controllers for human-robot collaboration publication-title: Sci. Comput. Program. doi: 10.1016/j.scico.2022.102809 – year: 2008 ident: 10.1016/j.scico.2022.102894_br0210 – volume: 354 start-page: 272 issue: 2 year: 2006 ident: 10.1016/j.scico.2022.102894_br0110 article-title: Scheduling with timed automata publication-title: Theor. Comput. Sci. doi: 10.1016/j.tcs.2005.11.018 – volume: 8 start-page: 1024 year: 1996 ident: 10.1016/j.scico.2022.102894_br0070 article-title: High-performance job-shop scheduling with a timedelay TD(λ) network publication-title: Adv. Neural Inf. Process. Syst. – volume: 1 start-page: 134 issue: 1–2 year: 1997 ident: 10.1016/j.scico.2022.102894_br0220 article-title: UPPAAL in a nutshell publication-title: Int. J. Softw. Tools Technol. Transf. doi: 10.1007/s100090050010 – year: 2015 ident: 10.1016/j.scico.2022.102894_br0150 article-title: UPPAAL stratego – start-page: 129 year: 2014 ident: 10.1016/j.scico.2022.102894_br0200 article-title: On time with minimal expected cost! – volume: 24 issue: 3 year: 2022 ident: 10.1016/j.scico.2022.102894_br0130 article-title: Verifiable strategy synthesis for multiple autonomous agents: a scalable approach publication-title: Int. J. Softw. Tools Technol. Transf. doi: 10.1007/s10009-022-00657-z – volume: 37 start-page: 597 issue: 8 year: 2004 ident: 10.1016/j.scico.2022.102894_br0400 article-title: Motion planning in multi-robot systems using timed automata publication-title: IFAC Proc. Vol. doi: 10.1016/S1474-6670(17)32043-8 – start-page: 1 year: 2020 ident: 10.1016/j.scico.2022.102894_br0470 article-title: dtControl: decision tree learning algorithms for controller representation – ident: 10.1016/j.scico.2022.102894_br0430 – volume: 54 start-page: 655 issue: 7 year: 2017 ident: 10.1016/j.scico.2022.102894_br0310 article-title: Safraless LTL synthesis considering maximal realizability publication-title: Acta Inform. doi: 10.1007/s00236-016-0280-3 – start-page: 364 year: 2006 ident: 10.1016/j.scico.2022.102894_br0490 article-title: Synthesis of reactive (1) designs – start-page: 121 year: 2007 ident: 10.1016/j.scico.2022.102894_br0300 article-title: UPPAAL-Tiga: time for playing games! (tool paper) – year: 2020 ident: 10.1016/j.scico.2022.102894_br0140 article-title: PuRSUE-from specification of robotic environments to synthesis of controllers publication-title: Form. Asp. Comput. doi: 10.1007/s00165-020-00509-0 – start-page: 251 year: 2016 ident: 10.1016/j.scico.2022.102894_br0370 article-title: Compositional synthesis of reactive controllers for multi-agent systems – volume: 126 start-page: 183 year: 1994 ident: 10.1016/j.scico.2022.102894_br0190 article-title: A theory of timed automata publication-title: Theor. Comput. Sci. doi: 10.1016/0304-3975(94)90010-8 – year: 2018 ident: 10.1016/j.scico.2022.102894_br0230 – volume: 46 start-page: 196 year: 2018 ident: 10.1016/j.scico.2022.102894_br0250 article-title: Current advancements on autonomous mission planning and management systems: an AUV and UAV perspective publication-title: Annu. Rev. Control doi: 10.1016/j.arcontrol.2018.07.002 – year: 1998 ident: 10.1016/j.scico.2022.102894_br0020 article-title: Research issues in autonomous control of tactical UAVs – start-page: 385 year: 2013 ident: 10.1016/j.scico.2022.102894_br0050 article-title: As soon as probable: optimal scheduling under stochastic uncertainty – volume: 110 start-page: 1267 issue: 6 year: 2021 ident: 10.1016/j.scico.2022.102894_br0360 article-title: Importance sampling in reinforcement learning with an estimated behavior policy publication-title: Mach. Learn. doi: 10.1007/s10994-020-05938-9 – start-page: 73 year: 2020 ident: 10.1016/j.scico.2022.102894_br0120 article-title: Verifiable and scalable mission-plan synthesis for multiple autonomous agents – year: 2020 ident: 10.1016/j.scico.2022.102894_br0410 article-title: Strategy synthesis for autonomous driving in a moving block railway system with uppaal stratego – start-page: 66 year: 2005 ident: 10.1016/j.scico.2022.102894_br0180 article-title: Efficient on-the-fly algorithms for the analysis of timed games – year: 2017 ident: 10.1016/j.scico.2022.102894_br0100 article-title: Belief state planning for autonomously navigating urban intersections – year: 2019 ident: 10.1016/j.scico.2022.102894_br0090 article-title: TAMAA: UPPAAL-based mission planning for autonomous agents – volume: 49 start-page: 287 year: 2014 ident: 10.1016/j.scico.2022.102894_br0330 article-title: A survey of rare event simulation methods for static input–output models publication-title: Simul. Model. Pract. Theory doi: 10.1016/j.simpat.2014.10.007 – volume: 34 year: 2021 ident: 10.1016/j.scico.2022.102894_br0440 article-title: Compositional reinforcement learning from logical specifications publication-title: Adv. Neural Inf. Process. Syst. – start-page: 336 year: 2008 ident: 10.1016/j.scico.2022.102894_br0340 article-title: Reinforcement learning in the presence of rare events
SSID	ssj0006471
Score	2.4133368
Snippet	Planning is a critical function of multi-agent autonomous systems, which includes path finding and task scheduling. Exhaustive search-based methods such as...
SourceID	swepub crossref elsevier
SourceType	Open Access Repository Enrichment Source Index Database Publisher
StartPage	102894
SubjectTerms	Multi-agent autonomous systems Planning Reinforcement learning Strategy compression Timed games
Title	Correctness-guaranteed strategy synthesis and compression for multi-agent autonomous systems
URI	https://dx.doi.org/10.1016/j.scico.2022.102894 https://urn.kb.se/resolve?urn=urn:nbn:se:mdh:diva-58045
Volume	224
WOSCitedRecordID	wos000878770800001&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
journalDatabaseRights	– providerCode: PRVESC databaseName: Elsevier SD Freedom Collection Journals 2021 customDbUrl: eissn: 1872-7964 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0006471 issn: 0167-6423 databaseCode: AIEXJ dateStart: 20211209 isFulltext: true titleUrlDefault: https://www.sciencedirect.com providerName: Elsevier
link	http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV1Lb9QwELag5cCFN6JQkA9wKq428caJj9F2W6hQxaGgPSBZieOUrai32mZR-ffM-JGNxFKVA5do5Y1jaebLeGbyeYaQt4InadImho0aLtg4TSWrtc5ZLkxjmqZtK19n9lN-clLMZvJzaHd05doJ5NYW19fy8r-qGsZA2Xh09h_U3T8UBuA3KB2uoHa43krxE-y3oTs0YQwBgKJzXqUrCgGm4pcFpw_rkGDOHO2Bp8J6xqE7kMtKPHC1V646PPKAJNlhYfPgykarEHjp2Boisr0u4n6IxJ6VJ3CvR46NDWkfRw7eO9rv0zxGwy5YWTdl4syP7beNqV3M3R_TBmG95jSGnEWaDvgfIY0J5hkiH77RiPt8wjmE9_Ay7ON8LDBR-GbIG6pjH8y_lmqxPFMXzXeVFeCk3iXbaZ5JsHLb5cfp7Ljfm4UPwfv1MRwvcogxpBjHmlSO_ffH2n_1W4YFZp1TcvqIPAjRBC09Ch6TO8Y-IQ9jpw4aVPSUfNsMChpBQXtQUAAFHYCCAijoABR0DQoaQPGMfDmcnk4-sNBXg2leZB1zh52zatSMRJVKzauikRx8vVZoLEuZJEbX7oNx3kgNw9m4raSoMTKtIOBu-XOyZRfWvCC0BYlq2DVaUfMxhOpFDfECREKG60rwWu6QNEpN6VB0Hnuf_FCRXXiunKgVilp5Ue-Q9_2kS19z5ebbRVSHCm6jdwcVAOrmie-88vpVNqPp5S3ve0Xur8G-S7a65cq8Jvf0z25-tXwToPgbsBueSQ
linkProvider	Elsevier
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Correctness-Guaranteed+Strategy+Synthesis+and+Compression+for+Multi-Agent+Autonomous+Systems&rft.jtitle=Science+of+computer+programming&rft.au=Gu%2C+Rong&rft.au=Jensen%2C+Peter+G.&rft.au=Seceleanu%2C+Cristina&rft.au=Enoiu%2C+Eduard+Paul&rft.date=2022-12-01&rft.issn=0167-6423&rft_id=info:doi/10.1016%2Fj.scico.2022.102894&rft.externalDocID=oai_DiVA_org_mdh_58045
thumbnail_l	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0167-6423&client=summon
thumbnail_m	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0167-6423&client=summon
thumbnail_s	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0167-6423&client=summon