Correctness-guaranteed strategy synthesis and compression for multi-agent autonomous systems

Planning is a critical function of multi-agent autonomous systems, which includes path finding and task scheduling. Exhaustive search-based methods such as model checking and algorithmic game theory can solve simple instances of multi-agent planning. However, these methods suffer from state-space ex...

Full description

Saved in:
Bibliographic Details
Published in:Science of computer programming Vol. 224; p. 102894
Main Authors: Gu, Rong, Jensen, Peter G., Seceleanu, Cristina, Enoiu, Eduard, Lundqvist, Kristina
Format: Journal Article
Language:English
Published: Elsevier B.V 01.12.2022
Subjects:
ISSN:0167-6423, 1872-7964, 1872-7964
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Abstract Planning is a critical function of multi-agent autonomous systems, which includes path finding and task scheduling. Exhaustive search-based methods such as model checking and algorithmic game theory can solve simple instances of multi-agent planning. However, these methods suffer from state-space explosion when the number of agents is large. Learning-based methods can alleviate this problem, but lack a guarantee of correctness of the results. In this paper, we introduce MoCReL, a new version of our previously proposed method that combines model checking with reinforcement learning in solving the planning problem. The approach takes advantage of reinforcement learning to synthesize path plans and task schedules for large numbers of autonomous agents, and of model checking to verify the correctness of the synthesized strategies. Further, MoCReL can compress large strategies into smaller ones that have down to 0.05% of the original sizes, while preserving their correctness, which we show in this paper. MoCReL is integrated into a new version of Uppaal Stratego that supports calling external libraries when running learning and verification of timed games models.
AbstractList Planning is a critical function of multi-agent autonomous systems, which includes path finding and task scheduling. Exhaustive search-based methods such as model checking and algorithmic game theory can solve simple instances of multi-agent planning. However, these methods suffer from the state-space explosion when the number of agents is large. Learning-based methods can alleviate this problem but lack a guarantee of the correctness of the results. In this paper, we introduce MoCReL, a new version of our previously proposed method that combines model checking with reinforcement learning in solving the planning problem. The approach takes advantage of reinforcement learning to synthesize path plans and task schedules for large numbers of autonomous agents, and of model checking to verify the correctness of the synthesized strategies. Further, MoCReL can compress large strategies into smaller ones that have down to 0.05% of the original sizes, while preserving their correctness, which we show in this paper. MoCReL is integrated into a new version of UPPAAL Stratego that supports calling external libraries when running learning and verification of timed games models. 
Planning is a critical function of multi-agent autonomous systems, which includes path finding and task scheduling. Exhaustive search-based methods such as model checking and algorithmic game theory can solve simple instances of multi-agent planning. However, these methods suffer from state-space explosion when the number of agents is large. Learning-based methods can alleviate this problem, but lack a guarantee of correctness of the results. In this paper, we introduce MoCReL, a new version of our previously proposed method that combines model checking with reinforcement learning in solving the planning problem. The approach takes advantage of reinforcement learning to synthesize path plans and task schedules for large numbers of autonomous agents, and of model checking to verify the correctness of the synthesized strategies. Further, MoCReL can compress large strategies into smaller ones that have down to 0.05% of the original sizes, while preserving their correctness, which we show in this paper. MoCReL is integrated into a new version of Uppaal Stratego that supports calling external libraries when running learning and verification of timed games models.
ArticleNumber 102894
Author Gu, Rong
Seceleanu, Cristina
Enoiu, Eduard
Jensen, Peter G.
Lundqvist, Kristina
Author_xml – sequence: 1
  givenname: Rong
  orcidid: 0000-0003-0570-6005
  surname: Gu
  fullname: Gu, Rong
  email: rong.gu@mdu.se
  organization: Mälardalen University, Sweden
– sequence: 2
  givenname: Peter G.
  surname: Jensen
  fullname: Jensen, Peter G.
  email: pgj@cs.aau.dk
  organization: Aalborg University, Denmark
– sequence: 3
  givenname: Cristina
  surname: Seceleanu
  fullname: Seceleanu, Cristina
  email: cristina.seceleanu@mdu.se
  organization: Mälardalen University, Sweden
– sequence: 4
  givenname: Eduard
  surname: Enoiu
  fullname: Enoiu, Eduard
  email: eduard.paul.enoiu@mdu.se
  organization: Mälardalen University, Sweden
– sequence: 5
  givenname: Kristina
  surname: Lundqvist
  fullname: Lundqvist, Kristina
  email: kristina.lundqvist@mdu.se
  organization: Mälardalen University, Sweden
BackLink https://urn.kb.se/resolve?urn=urn:nbn:se:mdh:diva-58045$$DView record from Swedish Publication Index (Mälardalens högskola)
BookMark eNp9kLtOwzAUQC1UJNrCF7DkA0ixncSJB4aqPKVKLMCEZLnOTeqqsSvbAfXvcQgsDEyWrs651j0zNDHWAEKXBC8IJux6t_BKK7ugmNI4oRXPT9CUVCVNS87yCZpGqkxZTrMzNPN-hzFmeUmm6H1lnQMVDHiftr100gSAOvHByQDtMfFHE7bgtU-kqRNlu4OLqLYmaaxLun4fdCpbMCGRfbDGdrb3UfIBOn-OThu593Dx887R6_3dy-oxXT8_PK2W61RlVRHSHDOeFxLXmEnKVSarmmeYkoYpzAtGCKgNjmRV1lzFcZE3krNNUbJKUkaabI6uxr3-Ew79Rhyc7qQ7Ciu1uNVvS2FdK7p6K4oK50XE-YgrZ7130Ailgwzxpni03guCxVBV7MR3VTFUFWPV6GZ_3N_P_rduRgtihQ8NbmDAKKj1EF_UVv_rfwEFjZaR
CitedBy_id crossref_primary_10_3390_s25113484
crossref_primary_10_1002_spe_3372
crossref_primary_10_1145_3672445
Cites_doi 10.1007/s42979-020-00326-5
10.1287/mnsc.35.11.1367
10.2514/1.G003724
10.1016/S0921-8890(98)00085-2
10.1016/j.scico.2022.102809
10.1016/j.tcs.2005.11.018
10.1007/s100090050010
10.1007/s10009-022-00657-z
10.1016/S1474-6670(17)32043-8
10.1007/s00236-016-0280-3
10.1007/s00165-020-00509-0
10.1016/0304-3975(94)90010-8
10.1016/j.arcontrol.2018.07.002
10.1007/s10994-020-05938-9
10.1016/j.simpat.2014.10.007
ContentType Journal Article
Copyright 2022 The Author(s)
Copyright_xml – notice: 2022 The Author(s)
DBID 6I.
AAFTH
AAYXX
CITATION
ABGEM
ADTPV
AOWAS
D8T
DF7
ZZAVC
DOI 10.1016/j.scico.2022.102894
DatabaseName ScienceDirect Open Access Titles
Elsevier:ScienceDirect:Open Access
CrossRef
SWEPUB Mälardalens högskola full text
SwePub
SwePub Articles
SWEPUB Freely available online
SWEPUB Mälardalens högskola
SwePub Articles full text
DatabaseTitle CrossRef
DatabaseTitleList

DeliveryMethod fulltext_linktorsrc
Discipline Computer Science
EISSN 1872-7964
ExternalDocumentID oai_DiVA_org_mdh_58045
10_1016_j_scico_2022_102894
S0167642322001277
GroupedDBID --K
--M
.DC
.~1
0R~
123
1B1
1RT
1~.
1~5
4.4
457
4G.
5VS
6I.
7-5
71M
8P~
9JN
AACTN
AAEDW
AAFTH
AAIAV
AAIKJ
AAKOC
AALRI
AAOAW
AAQFI
AAXUO
AAYFN
ABBOA
ABJNI
ABMAC
ABYKQ
ACDAQ
ACGFS
ACRLP
ACZNC
ADBBV
ADEZE
AEBSH
AEKER
AENEX
AFKWA
AFTJW
AGUBO
AGYEJ
AHHHB
AHZHX
AIALX
AIEXJ
AIKHN
AITUG
AJOXV
ALMA_UNASSIGNED_HOLDINGS
AMFUW
AMRAJ
AOUOD
AXJTR
BKOJK
BLXMC
CS3
DU5
E.L
EBS
EFJIC
EFLBG
EO8
EO9
EP2
EP3
FDB
FEDTE
FIRID
FNPLU
FYGXN
G-Q
GBLVA
GBOLZ
HVGLF
IHE
IXB
J1W
KOM
LG9
M26
M41
MO0
N9A
O-L
O9-
OAUVE
OK1
OZT
P-8
P-9
P2P
PC.
Q38
ROL
RPZ
SDF
SDG
SDP
SES
SPC
SPCBC
SSV
T5K
TN5
XPP
ZMT
~G-
9DU
AAEDT
AAQXK
AATTM
AAXKI
AAYWO
AAYXX
ABFNM
ABWVN
ABXDB
ACLOT
ACNNM
ACRPL
ACVFH
ADCNI
ADHUB
ADMUD
ADNMO
ADVLN
AEIPS
AEUPX
AEXQZ
AFFNX
AFJKZ
AFPUW
AGHFR
AGQPQ
AIGII
AIIUN
AKBMS
AKRWK
AKYEP
ANKPU
APXCP
ASPBG
AVWKF
AZFZN
CITATION
EFKBS
EJD
FGOYB
G-2
HZ~
R2-
SEW
SSZ
WUQ
ZY4
~HD
ABGEM
ADTPV
AOWAS
D8T
DF7
ZZAVC
ID FETCH-LOGICAL-c385t-406945a0d06a29c3a8d93021f6c095611ecb038587d9c21f54fa96b5768a261f3
ISICitedReferencesCount 11
ISICitedReferencesURI http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000878770800001&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
ISSN 0167-6423
1872-7964
IngestDate Tue Nov 04 16:16:39 EST 2025
Tue Nov 18 22:27:57 EST 2025
Sat Nov 29 07:23:21 EST 2025
Fri Feb 23 02:38:50 EST 2024
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Keywords Multi-agent autonomous systems
Strategy compression
Planning
Reinforcement learning
Timed games
Language English
License This is an open access article under the CC BY license.
LinkModel OpenURL
MergedId FETCHMERGED-LOGICAL-c385t-406945a0d06a29c3a8d93021f6c095611ecb038587d9c21f54fa96b5768a261f3
ORCID 0000-0003-0570-6005
OpenAccessLink https://urn.kb.se/resolve?urn=urn:nbn:se:mdh:diva-58045
ParticipantIDs swepub_primary_oai_DiVA_org_mdh_58045
crossref_citationtrail_10_1016_j_scico_2022_102894
crossref_primary_10_1016_j_scico_2022_102894
elsevier_sciencedirect_doi_10_1016_j_scico_2022_102894
PublicationCentury 2000
PublicationDate 2022-12-01
PublicationDateYYYYMMDD 2022-12-01
PublicationDate_xml – month: 12
  year: 2022
  text: 2022-12-01
  day: 01
PublicationDecade 2020
PublicationTitle Science of computer programming
PublicationYear 2022
Publisher Elsevier B.V
Publisher_xml – name: Elsevier B.V
References Tomita, Ueno, Shimakawa, Hagihara, Yonezaki (br0310) 2017; 54
Kempf, Bozga, Maler (br0050) 2013
David, Du, Larsen, Legay, Mikučionis, Poulsen, Sedwards (br0170) 2012
Julian, Kochenderfer, Owen (br0260) 2019; 42
Bersani, Soldo, Menghi, Pelliccione, Rossi (br0140) 2020
Křetínskỳ (br0380) 2021
Watkins (br0240) 1989
Larsen, Le Coënt, Mikučionis, Taankvist (br0060) 2018
Baier, Katoen (br0210) 2008
Cormen, Leiserson, Rivest, Stein (br0270) 2009
Behrmann, David, Fleury, Larsen, Lime, Nantes (br0300) 2007
Gu, Enoiu, Seceleanu, Lundqvist (br0120) 2020
Ashok, Jackermeier, Křetínskỳ, Weinhuber, Weininger, Yadav (br0480) 2021
Alur, Moarref, Topcu (br0370) 2016
Zhang, Dietterich (br0070) 1996; 8
Sutton, Barto (br0230) 2018
Andersen, Jensen, Bak, Quottrup (br0400) 2004; 37
Jegourel, Larsen, Legay, Mikučionis, Poulsen, Sedwards (br0280) 2016
Bengtsson, Yi (br0160) 2004
Larsen, Pettersson, Yi (br0220) 1997; 1
Frank, Mannor, Precup (br0340) 2008
Basile, ter Beek, Legay (br0410) 2020
David, Jensen, Larsen, Legay, Lime, Sørensen, Taankvist (br0200) 2014
Gleirscher, Calinescu, Douthwaite, Lesage, Paterson, Aitken, Alexander, Law (br0390) 2022; 218
Julian, Kochenderfer (br0460) 2019
Morio, Balesdent, Jacquemart, Vergé (br0330) 2014; 49
Bouton, Karlsson, Nakhaei, Fujimura, Kochenderfer, Tumova (br0430) 2019
Abdeddaı, Asarin, Maler (br0110) 2006; 354
Bouton, Cosgun, Kochenderfer (br0100) 2017
Hanna, Niekum, Stone (br0360) 2021; 110
LaValle (br0040) 1998
Behrmann, Larsen, Rasmussen (br0320) 2005
Gu, Jensen, Poulsen, Seceleanu, Enoiu, Lundqvist (br0130) 2022; 24
Oliveira, Fischer, Stepankova (br0010) 1999; 27
Atyabi, MahmoudZadeh, Nefti-Meziani (br0250) 2018; 46
Rabin (br0030) 2000
Behjati, Sirjani, Ahmadabadi (br0420) 2009
Cassez, David, Fleury, Larsen, Lime (br0180) 2005
Jaeger, Jensen, Larsen, Legay, Sedwards, Taankvist (br0290) 2019
Shyalika, Silva, Karunananda (br0080) 2020; 1
Julian, Lopez, Brush, Owen, Kochenderfer (br0450) 2016
Jothimurugan, Bansal, Bastani, Alur (br0440) 2021; 34
Glynn, Iglehart (br0350) 1989; 35
Piterman, Pnueli, Sa'ar (br0490) 2006
David, Jensen, Larsen, Mikučionis, Taankvist (br0150) 2015
Chandler, Pachter (br0020) 1998
Alur, Dill (br0190) 1994; 126
Ashok, Jackermeier, Jagtap, Křetínskỳ, Weininger, Zamani (br0470) 2020
Gu, Enoiu, Seceleanu (br0090) 2019
Behjati (10.1016/j.scico.2022.102894_br0420) 2009
Frank (10.1016/j.scico.2022.102894_br0340) 2008
Julian (10.1016/j.scico.2022.102894_br0450) 2016
Julian (10.1016/j.scico.2022.102894_br0260) 2019; 42
Gu (10.1016/j.scico.2022.102894_br0090) 2019
Bouton (10.1016/j.scico.2022.102894_br0430)
Sutton (10.1016/j.scico.2022.102894_br0230) 2018
Jaeger (10.1016/j.scico.2022.102894_br0290) 2019
Alur (10.1016/j.scico.2022.102894_br0370) 2016
Larsen (10.1016/j.scico.2022.102894_br0220) 1997; 1
Basile (10.1016/j.scico.2022.102894_br0410) 2020
Bouton (10.1016/j.scico.2022.102894_br0100) 2017
Gleirscher (10.1016/j.scico.2022.102894_br0390) 2022; 218
Oliveira (10.1016/j.scico.2022.102894_br0010) 1999; 27
Křetínskỳ (10.1016/j.scico.2022.102894_br0380)
Abdeddaı (10.1016/j.scico.2022.102894_br0110) 2006; 354
Rabin (10.1016/j.scico.2022.102894_br0030) 2000
Piterman (10.1016/j.scico.2022.102894_br0490) 2006
Ashok (10.1016/j.scico.2022.102894_br0470) 2020
Gu (10.1016/j.scico.2022.102894_br0120) 2020
Hanna (10.1016/j.scico.2022.102894_br0360) 2021; 110
Shyalika (10.1016/j.scico.2022.102894_br0080) 2020; 1
Cormen (10.1016/j.scico.2022.102894_br0270) 2009
Kempf (10.1016/j.scico.2022.102894_br0050) 2013
Alur (10.1016/j.scico.2022.102894_br0190) 1994; 126
Atyabi (10.1016/j.scico.2022.102894_br0250) 2018; 46
Chandler (10.1016/j.scico.2022.102894_br0020) 1998
Larsen (10.1016/j.scico.2022.102894_br0060) 2018
Andersen (10.1016/j.scico.2022.102894_br0400) 2004; 37
David (10.1016/j.scico.2022.102894_br0170)
Glynn (10.1016/j.scico.2022.102894_br0350) 1989; 35
Bengtsson (10.1016/j.scico.2022.102894_br0160) 2004
Tomita (10.1016/j.scico.2022.102894_br0310) 2017; 54
Gu (10.1016/j.scico.2022.102894_br0130) 2022; 24
Zhang (10.1016/j.scico.2022.102894_br0070) 1996; 8
Baier (10.1016/j.scico.2022.102894_br0210) 2008
Morio (10.1016/j.scico.2022.102894_br0330) 2014; 49
Ashok (10.1016/j.scico.2022.102894_br0480) 2021
Watkins (10.1016/j.scico.2022.102894_br0240) 1989
Julian (10.1016/j.scico.2022.102894_br0460) 2019
David (10.1016/j.scico.2022.102894_br0150) 2015
LaValle (10.1016/j.scico.2022.102894_br0040) 1998
Jegourel (10.1016/j.scico.2022.102894_br0280) 2016
Cassez (10.1016/j.scico.2022.102894_br0180) 2005
David (10.1016/j.scico.2022.102894_br0200) 2014
Jothimurugan (10.1016/j.scico.2022.102894_br0440) 2021; 34
Bersani (10.1016/j.scico.2022.102894_br0140) 2020
Behrmann (10.1016/j.scico.2022.102894_br0320) 2005
Behrmann (10.1016/j.scico.2022.102894_br0300) 2007
References_xml – start-page: 292
  year: 2009
  end-page: 307
  ident: br0420
  article-title: Bounded rational search for on-the-fly model checking of LTL properties
  publication-title: FSE
– year: 2019
  ident: br0430
  article-title: Reinforcement learning with probabilistic guarantees for autonomous driving
– year: 1998
  ident: br0040
  article-title: Rapidly-exploring random trees: a new tool for path planning
– volume: 126
  start-page: 183
  year: 1994
  end-page: 235
  ident: br0190
  article-title: A theory of timed automata
  publication-title: Theor. Comput. Sci.
– year: 2017
  ident: br0100
  article-title: Belief state planning for autonomously navigating urban intersections
  publication-title: Intelligent Vehicles Symposium
– volume: 354
  start-page: 272
  year: 2006
  end-page: 300
  ident: br0110
  article-title: Scheduling with timed automata
  publication-title: Theor. Comput. Sci.
– year: 1998
  ident: br0020
  article-title: Research issues in autonomous control of tactical UAVs
  publication-title: Proceedings of the 1998 American Control Conference. ACC (IEEE Cat. No. 98CH36207)
– start-page: 121
  year: 2007
  end-page: 125
  ident: br0300
  article-title: UPPAAL-Tiga: time for playing games! (tool paper)
  publication-title: Proceedings of the 2007 Computer Aided Verification
– start-page: 1
  year: 2016
  end-page: 10
  ident: br0450
  article-title: Policy compression for aircraft collision avoidance systems
  publication-title: 2016 IEEE/AIAA 35th Digital Avionics Systems Conference (DASC)
– volume: 46
  start-page: 196
  year: 2018
  end-page: 215
  ident: br0250
  article-title: Current advancements on autonomous mission planning and management systems: an AUV and UAV perspective
  publication-title: Annu. Rev. Control
– start-page: 113
  year: 2018
  end-page: 133
  ident: br0060
  article-title: Guaranteed control synthesis for continuous systems in UPPAAL Tiga
  publication-title: Cyber Physical Systems. Model-Based Design
– year: 1989
  ident: br0240
  article-title: Learning from Delayed Rewards
– volume: 8
  start-page: 1024
  year: 1996
  end-page: 1030
  ident: br0070
  article-title: High-performance job-shop scheduling with a timedelay TD(
  publication-title: Adv. Neural Inf. Process. Syst.
– volume: 49
  start-page: 287
  year: 2014
  end-page: 304
  ident: br0330
  article-title: A survey of rare event simulation methods for static input–output models
  publication-title: Simul. Model. Pract. Theory
– start-page: 364
  year: 2006
  end-page: 380
  ident: br0490
  article-title: Synthesis of reactive (1) designs
  publication-title: International Workshop on Verification, Model Checking, and Abstract Interpretation
– start-page: 87
  year: 2004
  end-page: 124
  ident: br0160
  article-title: Timed automata: semantics, algorithms and tools
  publication-title: Lectures on Concurrency and Petri Nets: Advances in Petri Nets
– volume: 37
  start-page: 597
  year: 2004
  end-page: 602
  ident: br0400
  article-title: Motion planning in multi-robot systems using timed automata
  publication-title: IFAC Proc. Vol.
– start-page: 81
  year: 2005
  end-page: 94
  ident: br0320
  article-title: Beyond liveness: efficient parameter synthesis for time bounded liveness
  publication-title: International Conference on Formal Modeling and Analysis of Timed Systems
– start-page: 385
  year: 2013
  end-page: 400
  ident: br0050
  article-title: As soon as probable: optimal scheduling under stochastic uncertainty
  publication-title: International Conference on Tools and Algorithms for the Construction and Analysis of Systems
– year: 2021
  ident: br0380
  article-title: LTL-constrained steady-state policy synthesis
– volume: 27
  start-page: 91
  year: 1999
  end-page: 106
  ident: br0010
  article-title: Multi-agent systems: which research for which applications
  publication-title: Robot. Auton. Syst.
– start-page: 251
  year: 2016
  end-page: 269
  ident: br0370
  article-title: Compositional synthesis of reactive controllers for multi-agent systems
  publication-title: International Conference on Computer Aided Verification
– start-page: 73
  year: 2020
  end-page: 92
  ident: br0120
  article-title: Verifiable and scalable mission-plan synthesis for multiple autonomous agents
  publication-title: 25th International Conference on Formal Methods for Industrial Critical Systems
– year: 2015
  ident: br0150
  article-title: UPPAAL stratego
  publication-title: TACAS 2015: International Conference on Tools and Algorithms for the Construction and Analysis of Systems
– year: 2008
  ident: br0210
  article-title: Principles of Model Checking
– volume: 218
  year: 2022
  ident: br0390
  article-title: Verified synthesis of optimal safety controllers for human-robot collaboration
  publication-title: Sci. Comput. Program.
– start-page: 336
  year: 2008
  end-page: 343
  ident: br0340
  article-title: Reinforcement learning in the presence of rare events
  publication-title: Proceedings of the 25th International Conference on Machine Learning
– start-page: 1
  year: 2020
  end-page: 7
  ident: br0470
  article-title: dtControl: decision tree learning algorithms for controller representation
  publication-title: Proceedings of the 23rd International Conference on Hybrid Systems: Computation and Control
– year: 2009
  ident: br0270
  article-title: Introduction to Algorithms
– volume: 1
  start-page: 1
  year: 2020
  end-page: 17
  ident: br0080
  article-title: Reinforcement learning in dynamic task scheduling: a review
  publication-title: SN Comput. Sci.
– start-page: 129
  year: 2014
  end-page: 145
  ident: br0200
  article-title: On time with minimal expected cost!
  publication-title: International Symposium on Automated Technology for Verification and Analysis
– volume: 110
  start-page: 1267
  year: 2021
  end-page: 1317
  ident: br0360
  article-title: Importance sampling in reinforcement learning with an estimated behavior policy
  publication-title: Mach. Learn.
– volume: 42
  start-page: 598
  year: 2019
  end-page: 608
  ident: br0260
  article-title: Deep neural network compression for aircraft collision avoidance systems
  publication-title: J. Guid. Control Dyn.
– start-page: 163
  year: 2016
  end-page: 178
  ident: br0280
  article-title: Importance sampling for stochastic timed automata
  publication-title: International Symposium on Dependable Software Engineering: Theories, Tools, and Applications
– start-page: 66
  year: 2005
  end-page: 80
  ident: br0180
  article-title: Efficient on-the-fly algorithms for the analysis of timed games
  publication-title: CONCUR 2005: International Conference on Concurrency Theory
– year: 2018
  ident: br0230
  article-title: Reinforcement Learning: An Introduction
– start-page: 1
  year: 2019
  end-page: 10
  ident: br0460
  article-title: Guaranteeing safety for neural network-based aircraft collision avoidance systems
  publication-title: 2019 IEEE/AIAA 38th Digital Avionics Systems Conference (DASC)
– volume: 24
  year: 2022
  ident: br0130
  article-title: Verifiable strategy synthesis for multiple autonomous agents: a scalable approach
  publication-title: Int. J. Softw. Tools Technol. Transf.
– start-page: 81
  year: 2019
  end-page: 97
  ident: br0290
  article-title: Teaching stratego to play ball: optimal synthesis for continuous space MDPs
  publication-title: International Symposium on Automated Technology for Verification and Analysis
– start-page: 326
  year: 2021
  end-page: 345
  ident: br0480
  article-title: dtControl 2.0: explainable strategy representation via decision tree learning steered by experts
  publication-title: International Conference on Tools and Algorithms for the Construction and Analysis of Systems
– volume: 1
  start-page: 134
  year: 1997
  end-page: 152
  ident: br0220
  article-title: UPPAAL in a nutshell
  publication-title: Int. J. Softw. Tools Technol. Transf.
– year: 2019
  ident: br0090
  article-title: TAMAA: UPPAAL-based mission planning for autonomous agents
  publication-title: 35th ACM/SIGAPP Symposium on Applied Computing SAC2020
– volume: 35
  start-page: 1367
  year: 1989
  end-page: 1392
  ident: br0350
  article-title: Importance sampling for stochastic simulations
  publication-title: Manag. Sci.
– year: 2020
  ident: br0410
  article-title: Strategy synthesis for autonomous driving in a moving block railway system with uppaal stratego
  publication-title: International Conference on Formal Techniques for Distributed Objects, Components, and Systems
– volume: 54
  start-page: 655
  year: 2017
  end-page: 692
  ident: br0310
  article-title: Safraless LTL synthesis considering maximal realizability
  publication-title: Acta Inform.
– year: 2012
  ident: br0170
  article-title: Statistical model checking for stochastic hybrid systems
– year: 2020
  ident: br0140
  article-title: PuRSUE-from specification of robotic environments to synthesis of controllers
  publication-title: Form. Asp. Comput.
– volume: 34
  year: 2021
  ident: br0440
  article-title: Compositional reinforcement learning from logical specifications
  publication-title: Adv. Neural Inf. Process. Syst.
– year: 2000
  ident: br0030
  article-title: A* aesthetic optimizations
  publication-title: Game Programming Gems
– volume: 1
  start-page: 1
  issue: 6
  year: 2020
  ident: 10.1016/j.scico.2022.102894_br0080
  article-title: Reinforcement learning in dynamic task scheduling: a review
  publication-title: SN Comput. Sci.
  doi: 10.1007/s42979-020-00326-5
– year: 2009
  ident: 10.1016/j.scico.2022.102894_br0270
– ident: 10.1016/j.scico.2022.102894_br0170
– start-page: 113
  year: 2018
  ident: 10.1016/j.scico.2022.102894_br0060
  article-title: Guaranteed control synthesis for continuous systems in UPPAAL Tiga
– ident: 10.1016/j.scico.2022.102894_br0380
– year: 2000
  ident: 10.1016/j.scico.2022.102894_br0030
  article-title: A* aesthetic optimizations
– start-page: 1
  year: 2019
  ident: 10.1016/j.scico.2022.102894_br0460
  article-title: Guaranteeing safety for neural network-based aircraft collision avoidance systems
– year: 1998
  ident: 10.1016/j.scico.2022.102894_br0040
– year: 1989
  ident: 10.1016/j.scico.2022.102894_br0240
– volume: 35
  start-page: 1367
  issue: 11
  year: 1989
  ident: 10.1016/j.scico.2022.102894_br0350
  article-title: Importance sampling for stochastic simulations
  publication-title: Manag. Sci.
  doi: 10.1287/mnsc.35.11.1367
– start-page: 326
  year: 2021
  ident: 10.1016/j.scico.2022.102894_br0480
  article-title: dtControl 2.0: explainable strategy representation via decision tree learning steered by experts
– start-page: 1
  year: 2016
  ident: 10.1016/j.scico.2022.102894_br0450
  article-title: Policy compression for aircraft collision avoidance systems
– volume: 42
  start-page: 598
  issue: 3
  year: 2019
  ident: 10.1016/j.scico.2022.102894_br0260
  article-title: Deep neural network compression for aircraft collision avoidance systems
  publication-title: J. Guid. Control Dyn.
  doi: 10.2514/1.G003724
– start-page: 81
  year: 2019
  ident: 10.1016/j.scico.2022.102894_br0290
  article-title: Teaching stratego to play ball: optimal synthesis for continuous space MDPs
– start-page: 87
  year: 2004
  ident: 10.1016/j.scico.2022.102894_br0160
  article-title: Timed automata: semantics, algorithms and tools
– start-page: 163
  year: 2016
  ident: 10.1016/j.scico.2022.102894_br0280
  article-title: Importance sampling for stochastic timed automata
– volume: 27
  start-page: 91
  issue: 1–2
  year: 1999
  ident: 10.1016/j.scico.2022.102894_br0010
  article-title: Multi-agent systems: which research for which applications
  publication-title: Robot. Auton. Syst.
  doi: 10.1016/S0921-8890(98)00085-2
– start-page: 292
  year: 2009
  ident: 10.1016/j.scico.2022.102894_br0420
  article-title: Bounded rational search for on-the-fly model checking of LTL properties
– start-page: 81
  year: 2005
  ident: 10.1016/j.scico.2022.102894_br0320
  article-title: Beyond liveness: efficient parameter synthesis for time bounded liveness
– volume: 218
  year: 2022
  ident: 10.1016/j.scico.2022.102894_br0390
  article-title: Verified synthesis of optimal safety controllers for human-robot collaboration
  publication-title: Sci. Comput. Program.
  doi: 10.1016/j.scico.2022.102809
– year: 2008
  ident: 10.1016/j.scico.2022.102894_br0210
– volume: 354
  start-page: 272
  issue: 2
  year: 2006
  ident: 10.1016/j.scico.2022.102894_br0110
  article-title: Scheduling with timed automata
  publication-title: Theor. Comput. Sci.
  doi: 10.1016/j.tcs.2005.11.018
– volume: 8
  start-page: 1024
  year: 1996
  ident: 10.1016/j.scico.2022.102894_br0070
  article-title: High-performance job-shop scheduling with a timedelay TD(λ) network
  publication-title: Adv. Neural Inf. Process. Syst.
– volume: 1
  start-page: 134
  issue: 1–2
  year: 1997
  ident: 10.1016/j.scico.2022.102894_br0220
  article-title: UPPAAL in a nutshell
  publication-title: Int. J. Softw. Tools Technol. Transf.
  doi: 10.1007/s100090050010
– year: 2015
  ident: 10.1016/j.scico.2022.102894_br0150
  article-title: UPPAAL stratego
– start-page: 129
  year: 2014
  ident: 10.1016/j.scico.2022.102894_br0200
  article-title: On time with minimal expected cost!
– volume: 24
  issue: 3
  year: 2022
  ident: 10.1016/j.scico.2022.102894_br0130
  article-title: Verifiable strategy synthesis for multiple autonomous agents: a scalable approach
  publication-title: Int. J. Softw. Tools Technol. Transf.
  doi: 10.1007/s10009-022-00657-z
– volume: 37
  start-page: 597
  issue: 8
  year: 2004
  ident: 10.1016/j.scico.2022.102894_br0400
  article-title: Motion planning in multi-robot systems using timed automata
  publication-title: IFAC Proc. Vol.
  doi: 10.1016/S1474-6670(17)32043-8
– start-page: 1
  year: 2020
  ident: 10.1016/j.scico.2022.102894_br0470
  article-title: dtControl: decision tree learning algorithms for controller representation
– ident: 10.1016/j.scico.2022.102894_br0430
– volume: 54
  start-page: 655
  issue: 7
  year: 2017
  ident: 10.1016/j.scico.2022.102894_br0310
  article-title: Safraless LTL synthesis considering maximal realizability
  publication-title: Acta Inform.
  doi: 10.1007/s00236-016-0280-3
– start-page: 364
  year: 2006
  ident: 10.1016/j.scico.2022.102894_br0490
  article-title: Synthesis of reactive (1) designs
– start-page: 121
  year: 2007
  ident: 10.1016/j.scico.2022.102894_br0300
  article-title: UPPAAL-Tiga: time for playing games! (tool paper)
– year: 2020
  ident: 10.1016/j.scico.2022.102894_br0140
  article-title: PuRSUE-from specification of robotic environments to synthesis of controllers
  publication-title: Form. Asp. Comput.
  doi: 10.1007/s00165-020-00509-0
– start-page: 251
  year: 2016
  ident: 10.1016/j.scico.2022.102894_br0370
  article-title: Compositional synthesis of reactive controllers for multi-agent systems
– volume: 126
  start-page: 183
  year: 1994
  ident: 10.1016/j.scico.2022.102894_br0190
  article-title: A theory of timed automata
  publication-title: Theor. Comput. Sci.
  doi: 10.1016/0304-3975(94)90010-8
– year: 2018
  ident: 10.1016/j.scico.2022.102894_br0230
– volume: 46
  start-page: 196
  year: 2018
  ident: 10.1016/j.scico.2022.102894_br0250
  article-title: Current advancements on autonomous mission planning and management systems: an AUV and UAV perspective
  publication-title: Annu. Rev. Control
  doi: 10.1016/j.arcontrol.2018.07.002
– year: 1998
  ident: 10.1016/j.scico.2022.102894_br0020
  article-title: Research issues in autonomous control of tactical UAVs
– start-page: 385
  year: 2013
  ident: 10.1016/j.scico.2022.102894_br0050
  article-title: As soon as probable: optimal scheduling under stochastic uncertainty
– volume: 110
  start-page: 1267
  issue: 6
  year: 2021
  ident: 10.1016/j.scico.2022.102894_br0360
  article-title: Importance sampling in reinforcement learning with an estimated behavior policy
  publication-title: Mach. Learn.
  doi: 10.1007/s10994-020-05938-9
– start-page: 73
  year: 2020
  ident: 10.1016/j.scico.2022.102894_br0120
  article-title: Verifiable and scalable mission-plan synthesis for multiple autonomous agents
– year: 2020
  ident: 10.1016/j.scico.2022.102894_br0410
  article-title: Strategy synthesis for autonomous driving in a moving block railway system with uppaal stratego
– start-page: 66
  year: 2005
  ident: 10.1016/j.scico.2022.102894_br0180
  article-title: Efficient on-the-fly algorithms for the analysis of timed games
– year: 2017
  ident: 10.1016/j.scico.2022.102894_br0100
  article-title: Belief state planning for autonomously navigating urban intersections
– year: 2019
  ident: 10.1016/j.scico.2022.102894_br0090
  article-title: TAMAA: UPPAAL-based mission planning for autonomous agents
– volume: 49
  start-page: 287
  year: 2014
  ident: 10.1016/j.scico.2022.102894_br0330
  article-title: A survey of rare event simulation methods for static input–output models
  publication-title: Simul. Model. Pract. Theory
  doi: 10.1016/j.simpat.2014.10.007
– volume: 34
  year: 2021
  ident: 10.1016/j.scico.2022.102894_br0440
  article-title: Compositional reinforcement learning from logical specifications
  publication-title: Adv. Neural Inf. Process. Syst.
– start-page: 336
  year: 2008
  ident: 10.1016/j.scico.2022.102894_br0340
  article-title: Reinforcement learning in the presence of rare events
SSID ssj0006471
Score 2.413421
Snippet Planning is a critical function of multi-agent autonomous systems, which includes path finding and task scheduling. Exhaustive search-based methods such as...
SourceID swepub
crossref
elsevier
SourceType Open Access Repository
Enrichment Source
Index Database
Publisher
StartPage 102894
SubjectTerms Multi-agent autonomous systems
Planning
Reinforcement learning
Strategy compression
Timed games
Title Correctness-guaranteed strategy synthesis and compression for multi-agent autonomous systems
URI https://dx.doi.org/10.1016/j.scico.2022.102894
https://urn.kb.se/resolve?urn=urn:nbn:se:mdh:diva-58045
Volume 224
WOSCitedRecordID wos000878770800001&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVESC
  databaseName: Elsevier SD Freedom Collection Journals 2021
  customDbUrl:
  eissn: 1872-7964
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0006471
  issn: 0167-6423
  databaseCode: AIEXJ
  dateStart: 20211209
  isFulltext: true
  titleUrlDefault: https://www.sciencedirect.com
  providerName: Elsevier
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV1LbxMxELZCyoELlJcoj8oHOMFGWe_Le4xCWqhQxaFUOSBZtnc3pGqdKtlU5Qfwvxk_s1CI6IHLKnK8jrXz7XjG-fwNQq95CjF_LGXEZUGiVHARCUqSKG4yTqisamHEnk8_FcfHdDotP_d6P_xZmKvzQil6fV1e_ldTQxsYWx-dvYW5w6DQAJ_B6HAFs8P1nww_1vU2ZKtdWDQDAOhHB1HlysrQftcaBRD0aR0Sd6TNUWEt49AQDCM-M9TzdauPPGiS7KojbO5CWe8VHC9dl4bwbK8Lvx5qYs_aErg3LUe1cts-hhz89nAQtnlqCasgV-aWsXE_KiwbE7WYmy8m1dpJAfjtCkI61A-3gwmeGZKepOuCCUk7TlTHPLby8Q3_brcaziDzh_dkoMcfbHr_qqb92yoXuIee1nbGzCBMD8LsIHfQDimykvbRzujjZHoUlvTcZu5h7l6-yhAFb8zlryFOV4vWxC8nu-i-SzzwyALmIerV6hF64It6YGfNx-jrn_GDPX5wwA8G_OAOfjDgB3fwgzf4wQ4_T9CXg8nJ-EPkSnBEMqFZG5lz0RkfVsOck1ImnFZlAmFhk0utYBnHtRTmv-WiKiU0Z2nDy1zoJJZDbt4kT1FfLVT9DOGkyShE46IexiJNeSqgi6QxpzLJRdXQPUT8U2PS6dPrMinnbIvF9tC7cNOllWfZ3j335mAuwrSRIwOAbb_xjTVe-BUty_5-fjpii-WMXVTfWEYhO3p-u_m8QPc278hL1G-X6_oVuiuv2vlque9Q-BOm5rBp
linkProvider Elsevier
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Correctness-guaranteed+strategy+synthesis+and+compression+for+multi-agent+autonomous+systems&rft.jtitle=Science+of+computer+programming&rft.au=Gu%2C+Rong&rft.au=Jensen%2C+Peter+G.&rft.au=Seceleanu%2C+Cristina&rft.au=Enoiu%2C+Eduard&rft.date=2022-12-01&rft.issn=0167-6423&rft.volume=224&rft.spage=102894&rft_id=info:doi/10.1016%2Fj.scico.2022.102894&rft.externalDBID=n%2Fa&rft.externalDocID=10_1016_j_scico_2022_102894
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0167-6423&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0167-6423&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0167-6423&client=summon