Optimal distributed synchronization control for continuous-time heterogeneous multi-agent differential graphical games

In this paper, a new optimal distributed synchronization control scheme for the consensus problem of heterogeneous multi-agent differential graphical games is developed by iterative adaptive dynamic programming (ADP). The main idea is to use iterative ADP technique to obtain the iterative control la...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Information sciences Jg. 317; S. 96 - 113
Hauptverfasser: Wei, Qinglai, Liu, Derong, Lewis, Frank L.
Format: Journal Article
Sprache:Englisch
Veröffentlicht: Elsevier Inc 01.10.2015
Schlagworte:
ISSN:0020-0255, 1872-6291
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Abstract In this paper, a new optimal distributed synchronization control scheme for the consensus problem of heterogeneous multi-agent differential graphical games is developed by iterative adaptive dynamic programming (ADP). The main idea is to use iterative ADP technique to obtain the iterative control law which makes all the agents track a given dynamics and simultaneously makes the iterative value function reach the Nash equilibrium. In the developed heterogeneous multi-agent differential graphical games, the agent of each node is different from one another. The dynamics and performance index function for each node depend only on local neighborhood information. A cooperative policy iteration algorithm is presented to achieve the optimal distributed synchronization control law for the agent of each node, where the coupled Hamilton–Jacobi equations for optimal synchronization control of heterogeneous multi-agent differential games can be avoided. Convergence analysis is developed to show that the iterative value functions of heterogeneous multi-agent differential graphical games can converge to the Nash equilibrium. Two simulation examples are given to show the effectiveness of the developed optimal control scheme.
AbstractList In this paper, a new optimal distributed synchronization control scheme for the consensus problem of heterogeneous multi-agent differential graphical games is developed by iterative adaptive dynamic programming (ADP). The main idea is to use iterative ADP technique to obtain the iterative control law which makes all the agents track a given dynamics and simultaneously makes the iterative value function reach the Nash equilibrium. In the developed heterogeneous multi-agent differential graphical games, the agent of each node is different from one another. The dynamics and performance index function for each node depend only on local neighborhood information. A cooperative policy iteration algorithm is presented to achieve the optimal distributed synchronization control law for the agent of each node, where the coupled Hamilton-Jacobi equations for optimal synchronization control of heterogeneous multi-agent differential games can be avoided. Convergence analysis is developed to show that the iterative value functions of heterogeneous multi-agent differential graphical games can converge to the Nash equilibrium. Two simulation examples are given to show the effectiveness of the developed optimal control scheme.
Author Lewis, Frank L.
Wei, Qinglai
Liu, Derong
Author_xml – sequence: 1
  givenname: Qinglai
  surname: Wei
  fullname: Wei, Qinglai
  email: qinglai.wei@ia.ac.cn
  organization: The State Key Laboratory of Management and Control for Complex Systems, Institute of Automation, Chinese Academy of Sciences, Beijing 100190, China
– sequence: 2
  givenname: Derong
  surname: Liu
  fullname: Liu, Derong
  email: derong.liu@ia.ac.cn
  organization: The State Key Laboratory of Management and Control for Complex Systems, Institute of Automation, Chinese Academy of Sciences, Beijing 100190, China
– sequence: 3
  givenname: Frank L.
  surname: Lewis
  fullname: Lewis, Frank L.
  email: lewis@uta.edu
  organization: UTA Research Institute, University of Texas at Arlington, Fort Worth, TX, USA
BookMark eNp9kE9r4zAQxUXpQpNsP0BvPu7F6ciWLZuelrD_oNBLexayPGomOFIqyYXsp18l6WkPhYF5Gub3GL0lu3beIWN3HNYceHu_W5OL6wp4swaRS1yxBe9kVbZVz6_ZAqCCEqqmuWHLGHcAIGTbLtj70yHRXk_FSDEFGuaEYxGPzmyDd_RXJ_KuMN6l4KfC-nDW5GY_xzKDWGwxYfCv6DCPiv08JSp1fqbsaC2GrCjbvwZ92JI5Kb3H-JV9sXqKePvRV-zl54_nze_y8enXn833x9LUNaRS9IMYQXALoDvbmW7sq0rqph14g03fwCB5L7SFEVF30GuuWznYuhttZTvZ1yv27eJ7CP5txpjUnqLBadLnexWXEuqm5n2XV_ll1QQfY0CrDiEnE46KgzplrHYqZ6xOGSsQuURm5H-MoXTOLAVN06fkw4XE_Pt3wqCiIXQGRwpokho9fUL_A_bJnKQ
CitedBy_id crossref_primary_10_1016_j_ejcon_2019_10_008
crossref_primary_10_1109_TCYB_2016_2586082
crossref_primary_10_1016_j_ins_2023_119884
crossref_primary_10_1109_TSMC_2018_2883801
crossref_primary_10_1631_FITEE_2200010
crossref_primary_10_1016_j_jfranklin_2018_11_054
crossref_primary_10_1016_j_neucom_2016_11_041
crossref_primary_10_1109_TCYB_2017_2788819
crossref_primary_10_3390_s20051302
crossref_primary_10_1016_j_neucom_2017_01_076
crossref_primary_10_1109_TCYB_2022_3196003
crossref_primary_10_1007_s00521_022_07880_4
crossref_primary_10_1007_s00521_016_2593_0
crossref_primary_10_1016_j_ins_2015_08_042
crossref_primary_10_1080_00207179_2018_1441550
crossref_primary_10_1002_rnc_4650
crossref_primary_10_1016_j_ins_2021_12_125
crossref_primary_10_1016_j_jfranklin_2022_12_021
crossref_primary_10_1109_JAS_2021_1003838
crossref_primary_10_1016_j_neucom_2020_04_119
crossref_primary_10_1109_TASE_2023_3289950
crossref_primary_10_1109_TCSI_2023_3246001
crossref_primary_10_1109_TNNLS_2023_3291542
crossref_primary_10_1109_TSMC_2025_3548114
crossref_primary_10_1109_TCYB_2024_3354945
crossref_primary_10_1109_TCSI_2025_3548900
crossref_primary_10_1109_TCYB_2016_2611613
crossref_primary_10_1016_j_ins_2015_11_019
crossref_primary_10_1109_TCYB_2018_2819695
crossref_primary_10_3390_app142210273
crossref_primary_10_1002_oca_2907
crossref_primary_10_1109_TSMC_2022_3190058
crossref_primary_10_1109_TCYB_2021_3110645
crossref_primary_10_1109_TNNLS_2017_2728622
crossref_primary_10_1016_j_neucom_2017_09_020
crossref_primary_10_1016_j_engappai_2025_110998
crossref_primary_10_1016_j_neucom_2017_09_066
crossref_primary_10_1002_rnc_7939
crossref_primary_10_1016_j_amc_2019_01_066
crossref_primary_10_1007_s12555_018_0904_1
crossref_primary_10_1016_j_ins_2021_01_056
crossref_primary_10_1002_acs_3945
crossref_primary_10_1007_s12555_016_0507_7
crossref_primary_10_1049_iet_cta_2019_0397
crossref_primary_10_1109_TASE_2023_3237770
crossref_primary_10_1002_oca_2859
crossref_primary_10_1109_TCSI_2021_3121809
crossref_primary_10_1109_TSMC_2018_2814018
crossref_primary_10_1109_TSMC_2020_3011184
crossref_primary_10_1109_TNNLS_2015_2464080
crossref_primary_10_1109_TNNLS_2018_2832025
crossref_primary_10_1016_j_neucom_2016_02_029
crossref_primary_10_1109_TITS_2022_3223303
crossref_primary_10_1109_TFUZZ_2023_3327699
crossref_primary_10_1007_s11063_021_10641_4
crossref_primary_10_1109_TCYB_2021_3140104
crossref_primary_10_1016_j_ifacol_2021_04_205
crossref_primary_10_1016_j_isatra_2016_07_004
crossref_primary_10_1016_j_neucom_2015_05_075
crossref_primary_10_1007_s12083_019_00751_1
crossref_primary_10_1016_j_isatra_2019_01_021
crossref_primary_10_1109_TCSI_2022_3166220
crossref_primary_10_1016_j_ins_2016_07_051
crossref_primary_10_1016_j_neucom_2017_07_058
crossref_primary_10_1016_j_neucom_2021_05_046
crossref_primary_10_1109_TIE_2016_2542134
crossref_primary_10_1049_iet_cta_2016_0028
crossref_primary_10_1109_TCYB_2015_2492242
crossref_primary_10_1016_j_ins_2023_118949
crossref_primary_10_1016_j_neunet_2018_06_007
crossref_primary_10_1109_ACCESS_2020_3043775
crossref_primary_10_1016_j_jfranklin_2022_02_034
crossref_primary_10_1016_j_eswa_2025_128094
crossref_primary_10_1007_s00521_019_04263_0
crossref_primary_10_1016_j_ins_2019_12_078
crossref_primary_10_1016_j_neucom_2017_01_047
crossref_primary_10_1016_j_ins_2025_122117
crossref_primary_10_1007_s11071_025_11097_0
Cites_doi 10.1016/j.automatica.2013.09.043
10.1109/TNNLS.2014.2306201
10.1109/TNNLS.2012.2234133
10.1016/j.automatica.2004.11.034
10.1109/TASE.2013.2296206
10.1109/TNNLS.2013.2247627
10.1109/9.256331
10.1016/j.ins.2013.08.037
10.1109/TSMCB.2008.920269
10.1109/TAC.2006.884959
10.1109/TNN.2003.813839
10.1016/j.ins.2012.07.006
10.1109/TSMCC.2002.801727
10.1109/TCYB.2014.2313915
10.1109/TSMCB.2012.2216523
10.1109/TNN.2008.2000204
10.1016/j.automatica.2011.03.005
10.1109/MCS.2012.2214134
10.1109/TASE.2012.2198057
10.1109/TNNLS.2013.2281663
10.1016/j.automatica.2010.10.033
10.1002/acs.2349
10.1016/j.automatica.2014.10.053
10.1109/TASE.2013.2284545
10.1016/j.neucom.2008.05.012
10.1109/TNNLS.2013.2280013
10.1109/TAC.2013.2239011
10.1137/S036301290037908X
10.1109/TIE.2014.2301770
10.1109/TNNLS.2012.2227339
10.1016/j.automatica.2014.05.011
10.1109/TASE.2014.2303139
10.1016/j.automatica.2013.11.002
10.1109/72.623201
10.1016/j.automatica.2012.05.074
10.1109/TASE.2013.2280974
10.1109/TNNLS.2013.2271778
10.1109/72.914523
10.1109/TSMCB.2012.2207718
10.1109/TASE.2014.2300532
10.1109/TSMCB.2008.926614
10.1109/TII.2012.2231085
10.1016/j.automatica.2006.09.019
10.1016/j.neunet.2012.02.027
10.1109/TSMCB.2012.2203336
10.1016/j.ins.2014.07.008
10.1016/j.automatica.2014.02.015
10.1109/TNNLS.2013.2249668
10.1109/TNNLS.2013.2294968
10.1109/TNN.2005.853408
10.1109/TPWRS.2013.2237793
10.1016/j.ins.2014.05.050
10.1109/TNNLS.2013.2292704
10.1109/TSG.2012.2233224
10.1109/JSYST.2014.2330392
10.1109/TSMCB.2006.880135
10.1049/iet-cta.2012.0486
ContentType Journal Article
Copyright 2015 Elsevier Inc.
Copyright_xml – notice: 2015 Elsevier Inc.
DBID AAYXX
CITATION
7SC
8FD
JQ2
L7M
L~C
L~D
DOI 10.1016/j.ins.2015.04.044
DatabaseName CrossRef
Computer and Information Systems Abstracts
Technology Research Database
ProQuest Computer Science Collection
Advanced Technologies Database with Aerospace
Computer and Information Systems Abstracts – Academic
Computer and Information Systems Abstracts Professional
DatabaseTitle CrossRef
Computer and Information Systems Abstracts
Technology Research Database
Computer and Information Systems Abstracts – Academic
Advanced Technologies Database with Aerospace
ProQuest Computer Science Collection
Computer and Information Systems Abstracts Professional
DatabaseTitleList Computer and Information Systems Abstracts

DeliveryMethod fulltext_linktorsrc
Discipline Engineering
Library & Information Science
EISSN 1872-6291
EndPage 113
ExternalDocumentID 10_1016_j_ins_2015_04_044
S0020025515003266
GroupedDBID --K
--M
--Z
-~X
.DC
.~1
0R~
1B1
1RT
1~.
1~5
4.4
457
4G.
5GY
5VS
7-5
71M
8P~
9JN
9JO
AAAKF
AABNK
AACTN
AAEDT
AAEDW
AAIAV
AAIKJ
AAKOC
AALRI
AAOAW
AAQFI
AARIN
AAXUO
AAYFN
ABAOU
ABBOA
ABFNM
ABJNI
ABMAC
ABUCO
ABYKQ
ACAZW
ACDAQ
ACGFS
ACRLP
ACZNC
ADBBV
ADEZE
ADGUI
ADTZH
AEBSH
AECPX
AEKER
AENEX
AFKWA
AFTJW
AGHFR
AGUBO
AGYEJ
AHHHB
AHJVU
AHZHX
AIALX
AIEXJ
AIGVJ
AIKHN
AITUG
AJBFU
AJOXV
ALMA_UNASSIGNED_HOLDINGS
AMFUW
AMRAJ
AOUOD
APLSM
ARUGR
AXJTR
BJAXD
BKOJK
BLXMC
CS3
DU5
EBS
EFJIC
EFLBG
EJD
EO8
EO9
EP2
EP3
F5P
FDB
FIRID
FNPLU
FYGXN
G-Q
GBLVA
GBOLZ
HAMUX
IHE
J1W
JJJVA
KOM
LG9
LY1
M41
MHUIS
MO0
MS~
N9A
O-L
O9-
OAUVE
OZT
P-8
P-9
P2P
PC.
Q38
RIG
ROL
RPZ
SDF
SDG
SDP
SES
SPC
SPCBC
SSB
SSD
SST
SSV
SSW
SSZ
T5K
TN5
TWZ
WH7
XPP
ZMT
~02
~G-
1OL
29I
77I
9DU
AAAKG
AAQXK
AATTM
AAXKI
AAYWO
AAYXX
ABEFU
ABWVN
ABXDB
ACLOT
ACNNM
ACRPL
ACVFH
ADCNI
ADJOM
ADMUD
ADNMO
ADVLN
AEIPS
AEUPX
AFFNX
AFJKZ
AFPUW
AGQPQ
AIGII
AIIUN
AKBMS
AKRWK
AKYEP
ANKPU
APXCP
ASPBG
AVWKF
AZFZN
CITATION
EFKBS
FEDTE
FGOYB
HLZ
HVGLF
HZ~
H~9
R2-
SBC
SDS
SEW
UHS
WUQ
YYP
ZY4
~HD
7SC
8FD
JQ2
L7M
L~C
L~D
ID FETCH-LOGICAL-c330t-49b4d041f00a8f8c8d9227a56b15e5950b7194af0deea809a1a67bf38df2f8793
ISICitedReferencesCount 107
ISICitedReferencesURI http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000358093400006&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
ISSN 0020-0255
IngestDate Sun Sep 28 02:00:38 EDT 2025
Tue Nov 18 22:09:45 EST 2025
Sat Nov 29 06:24:57 EST 2025
Fri Feb 23 02:23:14 EST 2024
IsPeerReviewed true
IsScholarly true
Keywords Approximate dynamic programming
Policy iteration
Adaptive dynamic programming
Graphical games
Adaptive critic designs
Heterogeneous multi-agents
Language English
LinkModel OpenURL
MergedId FETCHMERGED-LOGICAL-c330t-49b4d041f00a8f8c8d9227a56b15e5950b7194af0deea809a1a67bf38df2f8793
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
PQID 1770353198
PQPubID 23500
PageCount 18
ParticipantIDs proquest_miscellaneous_1770353198
crossref_primary_10_1016_j_ins_2015_04_044
crossref_citationtrail_10_1016_j_ins_2015_04_044
elsevier_sciencedirect_doi_10_1016_j_ins_2015_04_044
PublicationCentury 2000
PublicationDate 2015-10-01
2015-10-00
20151001
PublicationDateYYYYMMDD 2015-10-01
PublicationDate_xml – month: 10
  year: 2015
  text: 2015-10-01
  day: 01
PublicationDecade 2010
PublicationTitle Information sciences
PublicationYear 2015
Publisher Elsevier Inc
Publisher_xml – name: Elsevier Inc
References Vrabie, Vamvoudakis, Lewis (b0225) 2013
Enns, Si (b0055) 2003; 14
Liang, Molina, Venayagamoorthy, Harley (b0110) 2013; 28
Liu, Wei (b0130) 2014; 28
Si, Wang (b0195) 2001; 12
Xu, Jagannathan (b0300) 2013; 24
Xu, Yang, Shi (b0305) 2014; 25
Su, Wu, Shi (b0210) 2013; 9
Zhang, Qin, Luo (b0325) 2014; 11
Jamshidi (b0085) 1982
Tang, Gao, Zou, Kurths (b0215) 2013; 43
W.-Q. Wang, Carrier frequency synchronization in distributed wireless sensor networks, IEEE Syst. J. (2015) (in press) doi:10.1109/JSYST.2014.2330392.
Wei, Liu (b0280) 2014; 61
Su, Wu, Shi, Song (b0205) 2014; 50
Jiang, Jiang (b0095) 2014; 25
Wei, Liu (b0275) 2014; 11
Owen (b0180) 1982
Ni, He, Wen (b0175) 2013; 24
Basar, Bernhard (b0040) 1995
Lugli, Franco, Santos (b0150) 2014; 10
Basar, Olsder (b0045) 1982
Murray, Cox, Lendaris, Saeks (b0170) 2002; 32
Zhang, Qin, Jiang, Luo (b0320) 2014; 44
Al-Tamimi, Lewis, Abu-Khalaf (b0025) 2007; 43
Shi (b0190) 2002; 41
Werbos (b0295) 1991
Liu, Wei (b0140) 2014; 25
Molina, Venayagamoorthy, Liang, Harley (b0165) 2013; 4
Zhang, Wei, Luo (b0335) 2008; 38
Liu, Wang, Yang (b0125) 2013; 220
Trentelman, Takaba, Monshizadeh (b0220) 2013; 58
Wang, Liu, Li, Ma (b0255) 2014; 28
Werbos (b0290) 1977; 22
Liu, Wang, Zhao, Wei, Jin (b0120) 2012; 9
Iantovics, Zamfirescu (b0075) 2013; 9
Modares, Lewis (b0155) 2014; 50
Vamvoudakis, Lewis, Hudas (b0235) 2012; 48
Modares, Lewis, Naghibi-Sistani (b0160) 2014; 50
Zhang, Wei, Liu (b0330) 2011; 47
Lewis, Vrabie, Vamvoudakis (b0100) 2012; 32
Song, Xiao, Zhang, Sun (b0200) 2014; 25
Wei, Liu (b0265) 2013; 7
Wang, Liu, Li (b0250) 2014; 11
Jiang, Jiang (b0090) 2013; 24
Vamvoudakis, Lewis (b0230) 2011; 47
Liu, Wang, Li (b0115) 2014; 25
Xu, Zuo, Huang (b0310) 2014; 261
Abu-Khalaf, Lewis, Huang (b0015) 2008; 19
Wei, Liu (b0270) 2014; 11
Liu, Zhang, Zhang (b0145) 2005; 16
Kiumarsi, Lewis, Modares, Karimpour, Naghibi-Sistani (b0080) 2014; 50
Zhang, Cui, Luo (b0315) 2013; 43
Prokhorov, Wunsch (b0185) 1997; 8
Huang, Xu, Zuo (b0065) 2014; 286
Abu-Khalaf, Lewis (b0005) 2005; 41
Li, Liu, Wang (b0105) 2014; 11
Bertsekas, Tsitsiklis (b0050) 1996
Al-Tamimi, Abu-Khalaf, Lewis (b0020) 2007; 37
Al-Tamimi, Lewis, Abu-Khalaf (b0030) 2008; 38
Aurangzeb, Lewis (b0035) 2014; 50
Van Der Schaft (b0240) 1992; 37
Liu, Wei (b0135) 2013; 43
Fairbank, Alonso, Prokhorov (b0060) 2014; 24
Heydari, Balakrishnan (b0070) 2013; 24
Wei, Liu (b0260) 2012; 32
Wei, Zhang, Dai (b0285) 2009; 72
Abu-Khalaf, Lewis, Huang (b0010) 2006; 51
Aurangzeb (10.1016/j.ins.2015.04.044_b0035) 2014; 50
Owen (10.1016/j.ins.2015.04.044_b0180) 1982
Su (10.1016/j.ins.2015.04.044_b0210) 2013; 9
Basar (10.1016/j.ins.2015.04.044_b0040) 1995
10.1016/j.ins.2015.04.044_b0245
Wei (10.1016/j.ins.2015.04.044_b0275) 2014; 11
Wei (10.1016/j.ins.2015.04.044_b0280) 2014; 61
Prokhorov (10.1016/j.ins.2015.04.044_b0185) 1997; 8
Al-Tamimi (10.1016/j.ins.2015.04.044_b0030) 2008; 38
Zhang (10.1016/j.ins.2015.04.044_b0320) 2014; 44
Wang (10.1016/j.ins.2015.04.044_b0250) 2014; 11
Huang (10.1016/j.ins.2015.04.044_b0065) 2014; 286
Modares (10.1016/j.ins.2015.04.044_b0155) 2014; 50
Iantovics (10.1016/j.ins.2015.04.044_b0075) 2013; 9
Wei (10.1016/j.ins.2015.04.044_b0270) 2014; 11
Abu-Khalaf (10.1016/j.ins.2015.04.044_b0015) 2008; 19
Jamshidi (10.1016/j.ins.2015.04.044_b0085) 1982
Wei (10.1016/j.ins.2015.04.044_b0285) 2009; 72
Wei (10.1016/j.ins.2015.04.044_b0260) 2012; 32
Zhang (10.1016/j.ins.2015.04.044_b0330) 2011; 47
Heydari (10.1016/j.ins.2015.04.044_b0070) 2013; 24
Liu (10.1016/j.ins.2015.04.044_b0140) 2014; 25
Song (10.1016/j.ins.2015.04.044_b0200) 2014; 25
Lugli (10.1016/j.ins.2015.04.044_b0150) 2014; 10
Shi (10.1016/j.ins.2015.04.044_b0190) 2002; 41
Vamvoudakis (10.1016/j.ins.2015.04.044_b0235) 2012; 48
Ni (10.1016/j.ins.2015.04.044_b0175) 2013; 24
Werbos (10.1016/j.ins.2015.04.044_b0295) 1991
Xu (10.1016/j.ins.2015.04.044_b0305) 2014; 25
Zhang (10.1016/j.ins.2015.04.044_b0325) 2014; 11
Tang (10.1016/j.ins.2015.04.044_b0215) 2013; 43
Fairbank (10.1016/j.ins.2015.04.044_b0060) 2014; 24
Liu (10.1016/j.ins.2015.04.044_b0115) 2014; 25
Liu (10.1016/j.ins.2015.04.044_b0125) 2013; 220
Werbos (10.1016/j.ins.2015.04.044_b0290) 1977; 22
Enns (10.1016/j.ins.2015.04.044_b0055) 2003; 14
Kiumarsi (10.1016/j.ins.2015.04.044_b0080) 2014; 50
Liu (10.1016/j.ins.2015.04.044_b0130) 2014; 28
Wei (10.1016/j.ins.2015.04.044_b0265) 2013; 7
Bertsekas (10.1016/j.ins.2015.04.044_b0050) 1996
Basar (10.1016/j.ins.2015.04.044_b0045) 1982
Lewis (10.1016/j.ins.2015.04.044_b0100) 2012; 32
Molina (10.1016/j.ins.2015.04.044_b0165) 2013; 4
Abu-Khalaf (10.1016/j.ins.2015.04.044_b0005) 2005; 41
Murray (10.1016/j.ins.2015.04.044_b0170) 2002; 32
Van Der Schaft (10.1016/j.ins.2015.04.044_b0240) 1992; 37
Jiang (10.1016/j.ins.2015.04.044_b0090) 2013; 24
Al-Tamimi (10.1016/j.ins.2015.04.044_b0025) 2007; 43
Liu (10.1016/j.ins.2015.04.044_b0120) 2012; 9
Su (10.1016/j.ins.2015.04.044_b0205) 2014; 50
Vrabie (10.1016/j.ins.2015.04.044_b0225) 2013
Xu (10.1016/j.ins.2015.04.044_b0310) 2014; 261
Abu-Khalaf (10.1016/j.ins.2015.04.044_b0010) 2006; 51
Liu (10.1016/j.ins.2015.04.044_b0145) 2005; 16
Liu (10.1016/j.ins.2015.04.044_b0135) 2013; 43
Vamvoudakis (10.1016/j.ins.2015.04.044_b0230) 2011; 47
Jiang (10.1016/j.ins.2015.04.044_b0095) 2014; 25
Al-Tamimi (10.1016/j.ins.2015.04.044_b0020) 2007; 37
Liang (10.1016/j.ins.2015.04.044_b0110) 2013; 28
Trentelman (10.1016/j.ins.2015.04.044_b0220) 2013; 58
Modares (10.1016/j.ins.2015.04.044_b0160) 2014; 50
Xu (10.1016/j.ins.2015.04.044_b0300) 2013; 24
Li (10.1016/j.ins.2015.04.044_b0105) 2014; 11
Wang (10.1016/j.ins.2015.04.044_b0255) 2014; 28
Zhang (10.1016/j.ins.2015.04.044_b0315) 2013; 43
Zhang (10.1016/j.ins.2015.04.044_b0335) 2008; 38
Si (10.1016/j.ins.2015.04.044_b0195) 2001; 12
References_xml – volume: 58
  start-page: 1511
  year: 2013
  end-page: 1523
  ident: b0220
  article-title: Robust synchronization of uncertain linear multi-agent systems
  publication-title: IEEE Trans. Autom. Contr.
– year: 1982
  ident: b0045
  article-title: Dynamic Noncooperative Game Theory
– volume: 24
  start-page: 2088
  year: 2014
  end-page: 2100
  ident: b0060
  article-title: An equivalence between adaptive dynamic programming with a critic and backpropagation through time
  publication-title: IEEE Trans. Neural Netw. Learn. Syst.
– reference: W.-Q. Wang, Carrier frequency synchronization in distributed wireless sensor networks, IEEE Syst. J. (2015) (in press) doi:10.1109/JSYST.2014.2330392.
– volume: 43
  start-page: 779
  year: 2013
  end-page: 789
  ident: b0135
  article-title: Finite-approximation-error-based optimal control approach for discrete-time nonlinear systems
  publication-title: IEEE Trans. Cybernet.
– volume: 25
  start-page: 882
  year: 2014
  end-page: 893
  ident: b0095
  article-title: Robust adaptive dynamic programming and feedback stabilization of nonlinear systems
  publication-title: IEEE Trans. Neural Netw. Learn. Syst.
– volume: 25
  start-page: 621
  year: 2014
  end-page: 634
  ident: b0140
  article-title: Policy iteration adaptive dynamic programming algorithm for discrete-time nonlinear systems
  publication-title: IEEE Trans. Neural Netw. Learn. Syst.
– volume: 11
  start-page: 1020
  year: 2014
  end-page: 1036
  ident: b0275
  article-title: Adaptive dynamic programming for optimal tracking control of unknown nonlinear systems with application to coal gasification
  publication-title: IEEE Trans. Autom. Sci. Eng.
– volume: 9
  start-page: 628
  year: 2012
  end-page: 634
  ident: b0120
  article-title: Neural-network-based optimal control for a class of unknown discrete-time nonlinear systems using globalized dual heuristic programming
  publication-title: IEEE Trans. Autom. Sci. Eng.
– volume: 51
  start-page: 1989
  year: 2006
  end-page: 1995
  ident: b0010
  article-title: Policy iterations on the Hamilton–Jacobi–Isaacs equation for state feedback control with input saturation
  publication-title: IEEE Trans. Autom. Contr.
– volume: 32
  start-page: 140
  year: 2002
  end-page: 153
  ident: b0170
  article-title: Adaptive dynamic programming
  publication-title: IEEE Trans. Syst. Man Cybernet. – Part C: Appl. Rev.
– volume: 25
  start-page: 1733
  year: 2014
  end-page: 1739
  ident: b0200
  article-title: Adaptive dynamic programming for a class of complex-valued nonlinear systems
  publication-title: IEEE Trans. Neural Netw. Learn. Syst.
– volume: 11
  start-page: 1176
  year: 2014
  end-page: 1190
  ident: b0270
  article-title: A novel iterative
  publication-title: IEEE Trans. Autom. Sci. Eng.
– volume: 47
  start-page: 207
  year: 2011
  end-page: 214
  ident: b0330
  article-title: An iterative adaptive dynamic programming method for solving a class of nonlinear zero-sum differential games
  publication-title: Automatica
– volume: 43
  start-page: 473
  year: 2007
  end-page: 481
  ident: b0025
  article-title: Model-free Q-learning designs for linear discrete-time zero-sum games with application to H-infinity control
  publication-title: Automatica
– volume: 41
  start-page: 826
  year: 2002
  end-page: 850
  ident: b0190
  article-title: Limited Hamilton–Jacobi–Isaacs equations for singularly perturbed zero-sum dynamic (discrete time) games
  publication-title: SIAM J. Contr. Optimiz.
– volume: 50
  start-page: 1167
  year: 2014
  end-page: 1175
  ident: b0080
  article-title: Reinforcement-learning for optimal tracking control of linear discrete-time systems with unknown dynamics
  publication-title: Automatica
– volume: 286
  start-page: 209
  year: 2014
  end-page: 227
  ident: b0065
  article-title: Reinforcement learning with automatic basis construction based on isometric feature mapping
  publication-title: Inform. Sci.
– volume: 37
  start-page: 770
  year: 1992
  end-page: 784
  ident: b0240
  article-title: -gain analysis of nonlinear systems and nonlinear state feedback H control
  publication-title: IEEE Trans. Autom. Contr.
– year: 1995
  ident: b0040
  article-title: Optimal Control and Related Minimax Design Problems
– volume: 25
  start-page: 418
  year: 2014
  end-page: 428
  ident: b0115
  article-title: Decentralized stabilization for a class of continuous-time nonlinear interconnected systems using online learning optimal control approach
  publication-title: IEEE Trans. Neural Netw. Learn. Syst.
– volume: 14
  start-page: 929
  year: 2003
  end-page: 939
  ident: b0055
  article-title: Helicopter trimming and tracking control using direct neural dynamic programming
  publication-title: IEEE Trans. Neural Netw.
– volume: 9
  start-page: 1739
  year: 2013
  end-page: 1750
  ident: b0210
  article-title: Sensor networks with random link failures: distributed filtering for T–S fuzzy systems
  publication-title: IEEE Trans. Ind. Inform.
– year: 1996
  ident: b0050
  article-title: Neuro-Dynamic Programming
– volume: 28
  start-page: 205
  year: 2014
  end-page: 231
  ident: b0130
  article-title: Multi-person zero-sum differential games for a class of uncertain nonlinear systems
  publication-title: Int. J. Adapt. Contr. Signal Process.
– volume: 47
  start-page: 1556
  year: 2011
  end-page: 1569
  ident: b0230
  article-title: Multi-player non-zero-sum games: online adaptive learning solution of coupled Hamilton–Jacobi equations
  publication-title: Automatica
– volume: 10
  start-page: 1275
  year: 2014
  end-page: 1289
  ident: b0150
  article-title: Advances in distributed control for factory automation on ethernet technology
  publication-title: Int. J. Innovative Comput. Inform. Contr.
– volume: 9
  start-page: 1171
  year: 2013
  end-page: 1188
  ident: b0075
  article-title: ERMS: an evolutionary reorganizing multiagent system
  publication-title: Int. J. Innovative Comput. Inform. Contr.
– volume: 24
  start-page: 1150
  year: 2013
  end-page: 1156
  ident: b0090
  article-title: Robust adaptive dynamic programming with an application to power systems
  publication-title: IEEE Trans. Neural Netw. Learn. Syst.
– volume: 24
  start-page: 145
  year: 2013
  end-page: 157
  ident: b0070
  article-title: Finite-horizon control-constrained nonlinear optimal control using single network adaptive critics
  publication-title: IEEE Trans. Neural Netw. Learn. Syst.
– year: 1982
  ident: b0085
  article-title: Large-Scale Systems-Modeling and Control
– volume: 8
  start-page: 997
  year: 1997
  end-page: 1007
  ident: b0185
  article-title: Adaptive critic designs
  publication-title: IEEE Trans. Neural Netw.
– volume: 11
  start-page: 627
  year: 2014
  end-page: 632
  ident: b0250
  article-title: Policy iteration algorithm for online design of robust control for a class of continuous-time nonlinear systems
  publication-title: IEEE Trans. Autom. Sci. Eng.
– volume: 24
  start-page: 471
  year: 2013
  end-page: 484
  ident: b0300
  article-title: Stochastic optimal controller design for uncertain nonlinear networked control system via neuro dynamic programming
  publication-title: IEEE Trans. Neural Netw. Learn. Syst.
– year: 1982
  ident: b0180
  article-title: Game Theory
– volume: 28
  start-page: 167
  year: 2014
  end-page: 179
  ident: b0255
  article-title: Neural-network-based robust optimal control design for a class of uncertain nonlinear systems via adaptive dynamic programming
  publication-title: Inform. Sci.
– volume: 261
  start-page: 1
  year: 2014
  end-page: 31
  ident: b0310
  article-title: Reinforcement learning algorithms with function approximation: recent advances and applications
  publication-title: Inform. Sci.
– volume: 16
  start-page: 1219
  year: 2005
  end-page: 1228
  ident: b0145
  article-title: A self-learning call admission control scheme for CDMA cellular networks
  publication-title: IEEE Trans. Neural Netw.
– volume: 50
  start-page: 1780
  year: 2014
  end-page: 1792
  ident: b0155
  article-title: Optimal tracking control of nonlinear partially-unknown constrained-input systems using integral reinforcement learning
  publication-title: Automatica
– volume: 38
  start-page: 937
  year: 2008
  end-page: 942
  ident: b0335
  article-title: A novel infinite-time optimal tracking control scheme for a class of discrete-time nonlinear systems via the greedy HDP iteration algorithm
  publication-title: IEEE Trans. Syst. Man Cybernet. – Part B: Cybernet.
– year: 2013
  ident: b0225
  article-title: Optimal Adaptive Control and Differential Games by Reinforcement Learning Principles
– volume: 37
  start-page: 240
  year: 2007
  end-page: 247
  ident: b0020
  article-title: Adaptive critic designs for discrete-time zero-sum games with application to
  publication-title: IEEE Trans. Syst. Man Cybernet. – Part B: Cybernet.
– volume: 4
  start-page: 498
  year: 2013
  end-page: 508
  ident: b0165
  article-title: Intelligent local area signals based damping of power system oscillations using virtual generators and approximate dynamic programming
  publication-title: IEEE Trans. Smart Grid
– volume: 50
  start-page: 3268
  year: 2014
  end-page: 3275
  ident: b0205
  article-title: A novel approach to output feedback control of fuzzy stochastic systems
  publication-title: Automatica
– start-page: 67
  year: 1991
  end-page: 95
  ident: b0295
  article-title: A menu of designs for reinforcement learning over time
  publication-title: Neural Networks for Control
– volume: 38
  start-page: 943
  year: 2008
  end-page: 949
  ident: b0030
  article-title: Discrete-time nonlinear HJB solution using approximate dynamic programming: convergence proof
  publication-title: IEEE Trans. Syst. Man Cybernet. – Part B: Cybernet.
– volume: 50
  start-page: 193
  year: 2014
  end-page: 202
  ident: b0160
  article-title: Integral reinforcement learning and experience replay for adaptive optimal control of partially-unknown constrained-input continuous-time systems
  publication-title: Automatica
– volume: 12
  start-page: 264
  year: 2001
  end-page: 276
  ident: b0195
  article-title: On-line learning control by association and reinforcement
  publication-title: IEEE Trans. Neural Netw.
– volume: 28
  start-page: 2670
  year: 2013
  end-page: 2678
  ident: b0110
  article-title: Two-level dynamic stochastic optimal power flow control for power systems with intermittent renewable generation
  publication-title: IEEE Trans. Power Syst.
– volume: 7
  start-page: 1472
  year: 2013
  end-page: 1486
  ident: b0265
  article-title: Numerical adaptive learning control scheme for discrete-time nonlinear systems
  publication-title: IET Contr. Theory Appl.
– volume: 72
  start-page: 1839
  year: 2009
  end-page: 1848
  ident: b0285
  article-title: Model-free multiobjective approximate dynamic programming for discrete-time nonlinear systems with general performance index functions
  publication-title: Neurocomputing
– volume: 19
  start-page: 1243
  year: 2008
  end-page: 1252
  ident: b0015
  article-title: Neurodynamic programming and zero-sum games for constrained control systems
  publication-title: IEEE Trans. Neural Netw.
– volume: 32
  start-page: 76
  year: 2012
  end-page: 105
  ident: b0100
  article-title: Reinforcement learning and feedback control: using natural decision methods to design optimal adaptive controllers
  publication-title: IEEE Contr. Syst.
– volume: 44
  start-page: 2706
  year: 2014
  end-page: 2718
  ident: b0320
  article-title: Online adaptive policy learning algorithm for
  publication-title: IEEE Trans. Cybernet.
– volume: 11
  start-page: 839
  year: 2014
  end-page: 849
  ident: b0325
  article-title: Neural-network-based constrained optimal control scheme for discrete-time switched nonlinear system using dual heuristic programming
  publication-title: IEEE Trans. Autom. Sci. Eng.
– volume: 48
  start-page: 1598
  year: 2012
  end-page: 1611
  ident: b0235
  article-title: Multi-agent differential graphical games: online adaptive learning solution for synchronization with optimality
  publication-title: Automatica
– volume: 32
  start-page: 236
  year: 2012
  end-page: 244
  ident: b0260
  article-title: An iterative
  publication-title: Neural Netw.
– volume: 43
  start-page: 206
  year: 2013
  end-page: 216
  ident: b0315
  article-title: Near-optimal control for nonzero-sum differential games of continuous-time nonlinear systems using single-network ADP
  publication-title: IEEE Trans. Cybernet.
– volume: 43
  start-page: 358
  year: 2013
  end-page: 370
  ident: b0215
  article-title: Distributed synchronization in networks of agent systems with nonlinearities and random switchings
  publication-title: IEEE Trans. Cybernet.
– volume: 61
  start-page: 6399
  year: 2014
  end-page: 6408
  ident: b0280
  article-title: Data-driven neuro-optimal temperature control of water gas shift reaction using stable iterative adaptive dynamic programming
  publication-title: IEEE Trans. Ind. Electron.
– volume: 11
  start-page: 706
  year: 2014
  end-page: 714
  ident: b0105
  article-title: Integral reinforcement learning for linear continuous-time zero-sum games with completely unknown dynamics
  publication-title: IEEE Trans. Autom. Sci. Eng.
– volume: 24
  start-page: 913
  year: 2013
  end-page: 928
  ident: b0175
  article-title: Adaptive learning in tracking control based on the dual critic network design
  publication-title: IEEE Trans. Neural Netw. Learn. Syst.
– volume: 41
  start-page: 779
  year: 2005
  end-page: 791
  ident: b0005
  article-title: Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approch
  publication-title: Automatica
– volume: 25
  start-page: 635
  year: 2014
  end-page: 641
  ident: b0305
  article-title: Reinforcement learning output feedback NN control using deterministic learning technique
  publication-title: IEEE Trans. Neural Netw. Learn. Syst.
– volume: 220
  start-page: 331
  year: 2013
  end-page: 342
  ident: b0125
  article-title: An iterative adaptive dynamic programming algorithm for optimal control of unknown discrete-time nonlinear systems with constrained inputs
  publication-title: Inform. Sci.
– volume: 22
  start-page: 25
  year: 1977
  end-page: 38
  ident: b0290
  article-title: Advanced forecasting methods for global crisis warning and models of intelligence
  publication-title: Gen. Syst. Yearbook
– volume: 50
  start-page: 335
  year: 2014
  end-page: 348
  ident: b0035
  article-title: Internal structure of coalitions in competitive and altruistic graphical coalitional games
  publication-title: Automatica
– volume: 50
  start-page: 193
  year: 2014
  ident: 10.1016/j.ins.2015.04.044_b0160
  article-title: Integral reinforcement learning and experience replay for adaptive optimal control of partially-unknown constrained-input continuous-time systems
  publication-title: Automatica
  doi: 10.1016/j.automatica.2013.09.043
– volume: 25
  start-page: 1733
  year: 2014
  ident: 10.1016/j.ins.2015.04.044_b0200
  article-title: Adaptive dynamic programming for a class of complex-valued nonlinear systems
  publication-title: IEEE Trans. Neural Netw. Learn. Syst.
  doi: 10.1109/TNNLS.2014.2306201
– volume: 24
  start-page: 471
  year: 2013
  ident: 10.1016/j.ins.2015.04.044_b0300
  article-title: Stochastic optimal controller design for uncertain nonlinear networked control system via neuro dynamic programming
  publication-title: IEEE Trans. Neural Netw. Learn. Syst.
  doi: 10.1109/TNNLS.2012.2234133
– volume: 41
  start-page: 779
  year: 2005
  ident: 10.1016/j.ins.2015.04.044_b0005
  article-title: Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approch
  publication-title: Automatica
  doi: 10.1016/j.automatica.2004.11.034
– volume: 11
  start-page: 627
  year: 2014
  ident: 10.1016/j.ins.2015.04.044_b0250
  article-title: Policy iteration algorithm for online design of robust control for a class of continuous-time nonlinear systems
  publication-title: IEEE Trans. Autom. Sci. Eng.
  doi: 10.1109/TASE.2013.2296206
– volume: 24
  start-page: 913
  year: 2013
  ident: 10.1016/j.ins.2015.04.044_b0175
  article-title: Adaptive learning in tracking control based on the dual critic network design
  publication-title: IEEE Trans. Neural Netw. Learn. Syst.
  doi: 10.1109/TNNLS.2013.2247627
– volume: 37
  start-page: 770
  year: 1992
  ident: 10.1016/j.ins.2015.04.044_b0240
  article-title: L2-gain analysis of nonlinear systems and nonlinear state feedback H control
  publication-title: IEEE Trans. Autom. Contr.
  doi: 10.1109/9.256331
– volume: 261
  start-page: 1
  year: 2014
  ident: 10.1016/j.ins.2015.04.044_b0310
  article-title: Reinforcement learning algorithms with function approximation: recent advances and applications
  publication-title: Inform. Sci.
  doi: 10.1016/j.ins.2013.08.037
– volume: 38
  start-page: 937
  year: 2008
  ident: 10.1016/j.ins.2015.04.044_b0335
  article-title: A novel infinite-time optimal tracking control scheme for a class of discrete-time nonlinear systems via the greedy HDP iteration algorithm
  publication-title: IEEE Trans. Syst. Man Cybernet. – Part B: Cybernet.
  doi: 10.1109/TSMCB.2008.920269
– volume: 51
  start-page: 1989
  year: 2006
  ident: 10.1016/j.ins.2015.04.044_b0010
  article-title: Policy iterations on the Hamilton–Jacobi–Isaacs equation for state feedback control with input saturation
  publication-title: IEEE Trans. Autom. Contr.
  doi: 10.1109/TAC.2006.884959
– volume: 14
  start-page: 929
  year: 2003
  ident: 10.1016/j.ins.2015.04.044_b0055
  article-title: Helicopter trimming and tracking control using direct neural dynamic programming
  publication-title: IEEE Trans. Neural Netw.
  doi: 10.1109/TNN.2003.813839
– volume: 220
  start-page: 331
  year: 2013
  ident: 10.1016/j.ins.2015.04.044_b0125
  article-title: An iterative adaptive dynamic programming algorithm for optimal control of unknown discrete-time nonlinear systems with constrained inputs
  publication-title: Inform. Sci.
  doi: 10.1016/j.ins.2012.07.006
– volume: 32
  start-page: 140
  year: 2002
  ident: 10.1016/j.ins.2015.04.044_b0170
  article-title: Adaptive dynamic programming
  publication-title: IEEE Trans. Syst. Man Cybernet. – Part C: Appl. Rev.
  doi: 10.1109/TSMCC.2002.801727
– volume: 44
  start-page: 2706
  year: 2014
  ident: 10.1016/j.ins.2015.04.044_b0320
  article-title: Online adaptive policy learning algorithm for H∞ state feedback control of unknown affine nonlinear discrete-time systems
  publication-title: IEEE Trans. Cybernet.
  doi: 10.1109/TCYB.2014.2313915
– volume: 9
  start-page: 1171
  year: 2013
  ident: 10.1016/j.ins.2015.04.044_b0075
  article-title: ERMS: an evolutionary reorganizing multiagent system
  publication-title: Int. J. Innovative Comput. Inform. Contr.
– volume: 43
  start-page: 779
  year: 2013
  ident: 10.1016/j.ins.2015.04.044_b0135
  article-title: Finite-approximation-error-based optimal control approach for discrete-time nonlinear systems
  publication-title: IEEE Trans. Cybernet.
  doi: 10.1109/TSMCB.2012.2216523
– volume: 19
  start-page: 1243
  year: 2008
  ident: 10.1016/j.ins.2015.04.044_b0015
  article-title: Neurodynamic programming and zero-sum games for constrained control systems
  publication-title: IEEE Trans. Neural Netw.
  doi: 10.1109/TNN.2008.2000204
– volume: 10
  start-page: 1275
  year: 2014
  ident: 10.1016/j.ins.2015.04.044_b0150
  article-title: Advances in distributed control for factory automation on ethernet technology
  publication-title: Int. J. Innovative Comput. Inform. Contr.
– year: 1996
  ident: 10.1016/j.ins.2015.04.044_b0050
– volume: 47
  start-page: 1556
  year: 2011
  ident: 10.1016/j.ins.2015.04.044_b0230
  article-title: Multi-player non-zero-sum games: online adaptive learning solution of coupled Hamilton–Jacobi equations
  publication-title: Automatica
  doi: 10.1016/j.automatica.2011.03.005
– start-page: 67
  year: 1991
  ident: 10.1016/j.ins.2015.04.044_b0295
  article-title: A menu of designs for reinforcement learning over time
– volume: 32
  start-page: 76
  year: 2012
  ident: 10.1016/j.ins.2015.04.044_b0100
  article-title: Reinforcement learning and feedback control: using natural decision methods to design optimal adaptive controllers
  publication-title: IEEE Contr. Syst.
  doi: 10.1109/MCS.2012.2214134
– volume: 9
  start-page: 628
  year: 2012
  ident: 10.1016/j.ins.2015.04.044_b0120
  article-title: Neural-network-based optimal control for a class of unknown discrete-time nonlinear systems using globalized dual heuristic programming
  publication-title: IEEE Trans. Autom. Sci. Eng.
  doi: 10.1109/TASE.2012.2198057
– volume: 25
  start-page: 621
  year: 2014
  ident: 10.1016/j.ins.2015.04.044_b0140
  article-title: Policy iteration adaptive dynamic programming algorithm for discrete-time nonlinear systems
  publication-title: IEEE Trans. Neural Netw. Learn. Syst.
  doi: 10.1109/TNNLS.2013.2281663
– volume: 47
  start-page: 207
  year: 2011
  ident: 10.1016/j.ins.2015.04.044_b0330
  article-title: An iterative adaptive dynamic programming method for solving a class of nonlinear zero-sum differential games
  publication-title: Automatica
  doi: 10.1016/j.automatica.2010.10.033
– volume: 28
  start-page: 205
  year: 2014
  ident: 10.1016/j.ins.2015.04.044_b0130
  article-title: Multi-person zero-sum differential games for a class of uncertain nonlinear systems
  publication-title: Int. J. Adapt. Contr. Signal Process.
  doi: 10.1002/acs.2349
– volume: 50
  start-page: 3268
  year: 2014
  ident: 10.1016/j.ins.2015.04.044_b0205
  article-title: A novel approach to output feedback control of fuzzy stochastic systems
  publication-title: Automatica
  doi: 10.1016/j.automatica.2014.10.053
– volume: 11
  start-page: 1020
  year: 2014
  ident: 10.1016/j.ins.2015.04.044_b0275
  article-title: Adaptive dynamic programming for optimal tracking control of unknown nonlinear systems with application to coal gasification
  publication-title: IEEE Trans. Autom. Sci. Eng.
  doi: 10.1109/TASE.2013.2284545
– volume: 22
  start-page: 25
  year: 1977
  ident: 10.1016/j.ins.2015.04.044_b0290
  article-title: Advanced forecasting methods for global crisis warning and models of intelligence
  publication-title: Gen. Syst. Yearbook
– volume: 72
  start-page: 1839
  year: 2009
  ident: 10.1016/j.ins.2015.04.044_b0285
  article-title: Model-free multiobjective approximate dynamic programming for discrete-time nonlinear systems with general performance index functions
  publication-title: Neurocomputing
  doi: 10.1016/j.neucom.2008.05.012
– volume: 25
  start-page: 418
  year: 2014
  ident: 10.1016/j.ins.2015.04.044_b0115
  article-title: Decentralized stabilization for a class of continuous-time nonlinear interconnected systems using online learning optimal control approach
  publication-title: IEEE Trans. Neural Netw. Learn. Syst.
  doi: 10.1109/TNNLS.2013.2280013
– volume: 58
  start-page: 1511
  year: 2013
  ident: 10.1016/j.ins.2015.04.044_b0220
  article-title: Robust synchronization of uncertain linear multi-agent systems
  publication-title: IEEE Trans. Autom. Contr.
  doi: 10.1109/TAC.2013.2239011
– volume: 41
  start-page: 826
  year: 2002
  ident: 10.1016/j.ins.2015.04.044_b0190
  article-title: Limited Hamilton–Jacobi–Isaacs equations for singularly perturbed zero-sum dynamic (discrete time) games
  publication-title: SIAM J. Contr. Optimiz.
  doi: 10.1137/S036301290037908X
– volume: 61
  start-page: 6399
  year: 2014
  ident: 10.1016/j.ins.2015.04.044_b0280
  article-title: Data-driven neuro-optimal temperature control of water gas shift reaction using stable iterative adaptive dynamic programming
  publication-title: IEEE Trans. Ind. Electron.
  doi: 10.1109/TIE.2014.2301770
– year: 2013
  ident: 10.1016/j.ins.2015.04.044_b0225
– volume: 24
  start-page: 145
  year: 2013
  ident: 10.1016/j.ins.2015.04.044_b0070
  article-title: Finite-horizon control-constrained nonlinear optimal control using single network adaptive critics
  publication-title: IEEE Trans. Neural Netw. Learn. Syst.
  doi: 10.1109/TNNLS.2012.2227339
– volume: 50
  start-page: 1780
  year: 2014
  ident: 10.1016/j.ins.2015.04.044_b0155
  article-title: Optimal tracking control of nonlinear partially-unknown constrained-input systems using integral reinforcement learning
  publication-title: Automatica
  doi: 10.1016/j.automatica.2014.05.011
– volume: 11
  start-page: 839
  year: 2014
  ident: 10.1016/j.ins.2015.04.044_b0325
  article-title: Neural-network-based constrained optimal control scheme for discrete-time switched nonlinear system using dual heuristic programming
  publication-title: IEEE Trans. Autom. Sci. Eng.
  doi: 10.1109/TASE.2014.2303139
– volume: 50
  start-page: 335
  year: 2014
  ident: 10.1016/j.ins.2015.04.044_b0035
  article-title: Internal structure of coalitions in competitive and altruistic graphical coalitional games
  publication-title: Automatica
  doi: 10.1016/j.automatica.2013.11.002
– year: 1982
  ident: 10.1016/j.ins.2015.04.044_b0085
– volume: 8
  start-page: 997
  year: 1997
  ident: 10.1016/j.ins.2015.04.044_b0185
  article-title: Adaptive critic designs
  publication-title: IEEE Trans. Neural Netw.
  doi: 10.1109/72.623201
– volume: 48
  start-page: 1598
  year: 2012
  ident: 10.1016/j.ins.2015.04.044_b0235
  article-title: Multi-agent differential graphical games: online adaptive learning solution for synchronization with optimality
  publication-title: Automatica
  doi: 10.1016/j.automatica.2012.05.074
– volume: 11
  start-page: 1176
  year: 2014
  ident: 10.1016/j.ins.2015.04.044_b0270
  article-title: A novel iterative θ-adaptive dynamic programming for discrete-time nonlinear systems
  publication-title: IEEE Trans. Autom. Sci. Eng.
  doi: 10.1109/TASE.2013.2280974
– volume: 24
  start-page: 2088
  year: 2014
  ident: 10.1016/j.ins.2015.04.044_b0060
  article-title: An equivalence between adaptive dynamic programming with a critic and backpropagation through time
  publication-title: IEEE Trans. Neural Netw. Learn. Syst.
  doi: 10.1109/TNNLS.2013.2271778
– volume: 12
  start-page: 264
  year: 2001
  ident: 10.1016/j.ins.2015.04.044_b0195
  article-title: On-line learning control by association and reinforcement
  publication-title: IEEE Trans. Neural Netw.
  doi: 10.1109/72.914523
– volume: 43
  start-page: 358
  year: 2013
  ident: 10.1016/j.ins.2015.04.044_b0215
  article-title: Distributed synchronization in networks of agent systems with nonlinearities and random switchings
  publication-title: IEEE Trans. Cybernet.
  doi: 10.1109/TSMCB.2012.2207718
– volume: 11
  start-page: 706
  year: 2014
  ident: 10.1016/j.ins.2015.04.044_b0105
  article-title: Integral reinforcement learning for linear continuous-time zero-sum games with completely unknown dynamics
  publication-title: IEEE Trans. Autom. Sci. Eng.
  doi: 10.1109/TASE.2014.2300532
– year: 1982
  ident: 10.1016/j.ins.2015.04.044_b0180
– volume: 38
  start-page: 943
  year: 2008
  ident: 10.1016/j.ins.2015.04.044_b0030
  article-title: Discrete-time nonlinear HJB solution using approximate dynamic programming: convergence proof
  publication-title: IEEE Trans. Syst. Man Cybernet. – Part B: Cybernet.
  doi: 10.1109/TSMCB.2008.926614
– volume: 9
  start-page: 1739
  year: 2013
  ident: 10.1016/j.ins.2015.04.044_b0210
  article-title: Sensor networks with random link failures: distributed filtering for T–S fuzzy systems
  publication-title: IEEE Trans. Ind. Inform.
  doi: 10.1109/TII.2012.2231085
– volume: 43
  start-page: 473
  year: 2007
  ident: 10.1016/j.ins.2015.04.044_b0025
  article-title: Model-free Q-learning designs for linear discrete-time zero-sum games with application to H-infinity control
  publication-title: Automatica
  doi: 10.1016/j.automatica.2006.09.019
– year: 1982
  ident: 10.1016/j.ins.2015.04.044_b0045
– volume: 32
  start-page: 236
  year: 2012
  ident: 10.1016/j.ins.2015.04.044_b0260
  article-title: An iterative ∊-optimal control scheme for a class of discrete-time nonlinear systems with unfixed initial state
  publication-title: Neural Netw.
  doi: 10.1016/j.neunet.2012.02.027
– volume: 43
  start-page: 206
  year: 2013
  ident: 10.1016/j.ins.2015.04.044_b0315
  article-title: Near-optimal control for nonzero-sum differential games of continuous-time nonlinear systems using single-network ADP
  publication-title: IEEE Trans. Cybernet.
  doi: 10.1109/TSMCB.2012.2203336
– volume: 286
  start-page: 209
  year: 2014
  ident: 10.1016/j.ins.2015.04.044_b0065
  article-title: Reinforcement learning with automatic basis construction based on isometric feature mapping
  publication-title: Inform. Sci.
  doi: 10.1016/j.ins.2014.07.008
– volume: 50
  start-page: 1167
  year: 2014
  ident: 10.1016/j.ins.2015.04.044_b0080
  article-title: Reinforcement-learning for optimal tracking control of linear discrete-time systems with unknown dynamics
  publication-title: Automatica
  doi: 10.1016/j.automatica.2014.02.015
– volume: 24
  start-page: 1150
  year: 2013
  ident: 10.1016/j.ins.2015.04.044_b0090
  article-title: Robust adaptive dynamic programming with an application to power systems
  publication-title: IEEE Trans. Neural Netw. Learn. Syst.
  doi: 10.1109/TNNLS.2013.2249668
– volume: 25
  start-page: 882
  year: 2014
  ident: 10.1016/j.ins.2015.04.044_b0095
  article-title: Robust adaptive dynamic programming and feedback stabilization of nonlinear systems
  publication-title: IEEE Trans. Neural Netw. Learn. Syst.
  doi: 10.1109/TNNLS.2013.2294968
– year: 1995
  ident: 10.1016/j.ins.2015.04.044_b0040
– volume: 16
  start-page: 1219
  year: 2005
  ident: 10.1016/j.ins.2015.04.044_b0145
  article-title: A self-learning call admission control scheme for CDMA cellular networks
  publication-title: IEEE Trans. Neural Netw.
  doi: 10.1109/TNN.2005.853408
– volume: 28
  start-page: 2670
  year: 2013
  ident: 10.1016/j.ins.2015.04.044_b0110
  article-title: Two-level dynamic stochastic optimal power flow control for power systems with intermittent renewable generation
  publication-title: IEEE Trans. Power Syst.
  doi: 10.1109/TPWRS.2013.2237793
– volume: 28
  start-page: 167
  year: 2014
  ident: 10.1016/j.ins.2015.04.044_b0255
  article-title: Neural-network-based robust optimal control design for a class of uncertain nonlinear systems via adaptive dynamic programming
  publication-title: Inform. Sci.
  doi: 10.1016/j.ins.2014.05.050
– volume: 25
  start-page: 635
  year: 2014
  ident: 10.1016/j.ins.2015.04.044_b0305
  article-title: Reinforcement learning output feedback NN control using deterministic learning technique
  publication-title: IEEE Trans. Neural Netw. Learn. Syst.
  doi: 10.1109/TNNLS.2013.2292704
– volume: 4
  start-page: 498
  year: 2013
  ident: 10.1016/j.ins.2015.04.044_b0165
  article-title: Intelligent local area signals based damping of power system oscillations using virtual generators and approximate dynamic programming
  publication-title: IEEE Trans. Smart Grid
  doi: 10.1109/TSG.2012.2233224
– ident: 10.1016/j.ins.2015.04.044_b0245
  doi: 10.1109/JSYST.2014.2330392
– volume: 37
  start-page: 240
  year: 2007
  ident: 10.1016/j.ins.2015.04.044_b0020
  article-title: Adaptive critic designs for discrete-time zero-sum games with application to H∞ control
  publication-title: IEEE Trans. Syst. Man Cybernet. – Part B: Cybernet.
  doi: 10.1109/TSMCB.2006.880135
– volume: 7
  start-page: 1472
  year: 2013
  ident: 10.1016/j.ins.2015.04.044_b0265
  article-title: Numerical adaptive learning control scheme for discrete-time nonlinear systems
  publication-title: IET Contr. Theory Appl.
  doi: 10.1049/iet-cta.2012.0486
SSID ssj0004766
Score 2.4841194
Snippet In this paper, a new optimal distributed synchronization control scheme for the consensus problem of heterogeneous multi-agent differential graphical games is...
SourceID proquest
crossref
elsevier
SourceType Aggregation Database
Enrichment Source
Index Database
Publisher
StartPage 96
SubjectTerms Adaptive critic designs
Adaptive dynamic programming
Algorithms
Approximate dynamic programming
Dynamics
Games
Graphical games
Heterogeneous multi-agents
Multiagent systems
Optimization
Policies
Policy iteration
Synchronism
Synchronization
Title Optimal distributed synchronization control for continuous-time heterogeneous multi-agent differential graphical games
URI https://dx.doi.org/10.1016/j.ins.2015.04.044
https://www.proquest.com/docview/1770353198
Volume 317
WOSCitedRecordID wos000358093400006&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVESC
  databaseName: Elsevier SD Freedom Collection Journals 2021
  customDbUrl:
  eissn: 1872-6291
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0004766
  issn: 0020-0255
  databaseCode: AIEXJ
  dateStart: 19950101
  isFulltext: true
  titleUrlDefault: https://www.sciencedirect.com
  providerName: Elsevier
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV3da9UwFA_XOx_0QXQqTp1EEB8clbRNmuRxyERFpsLE-1aSph29zN6x--H8D_yzPfnquopDBeFSSrlJS84v55wkv3MOQs8K1lBp8jypJUw3AAVLtKEiKUymOLgjUmjhik3ww0Mxm8mPk8mPGAuzOeFdJ87P5el_FTU8A2Hb0Nm_EHffKTyAexA6XEHscP0jwX8AJfDVH734albgUi6_d5XLguuDLnuCuuUY2vu2Wy_Wy8QWmgfXEYZ6Ad3Xlh3rCIeJsgFYfTGVld1ld5munYSPVYwimUdafB8SuRcsbO-5f6kdfeBTa4uHtD0dqF175QffeHzBEfrmEyC4wvJ7718OdyhS1nPdwrZZDJ25xOy0fmpiFzTeEHntK3iWFJkv3xXVc-5jO4OClcXAVKc-jPUXK-A3JOawdLEJ2VPmktn6NJOj5Nr2rNotq8AvJuDJFtfQVsaZFFO0tf_2YPbuIsaW-3Pv-NnxhNxxBUcv-p2PM7L2zoU5uo1uhbUH3veYuYMmdbeNbg4yUm6j3RDHgp_jgRRxsAB30SagCw_QhUfowgFdGNrjEbrwJXThAbrwEF24Rxd26LqHPr8-OHr1JgmlO5Iqz8kqoVJTQ2jaEKJEIyphZJZxxQqdsppJRjRPJVUNMXWtBJEqVQXXTS5MkzUCbMZ9NO0WXf0AYa1UkzFNOTMFTY2WlldZGWOzLAndmB1E4mCXVchrb8urnJSRwDgvQT6llU9JKPzoDnrRNzn1SV2u-jONEizDnPHeZglwu6rZ0yjtEjS2PYZTbmjLlIOVtaZPPPy3rh-hG5mFoZtmj9F0dbaud9H1arNql2dPAnB_AvJgw0w
linkProvider Elsevier
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Optimal+distributed+synchronization+control+for+continuous-time+heterogeneous+multi-agent+differential+graphical+games&rft.jtitle=Information+sciences&rft.au=Wei%2C+Qinglai&rft.au=Liu%2C+Derong&rft.au=Lewis%2C+Frank+L.&rft.date=2015-10-01&rft.pub=Elsevier+Inc&rft.issn=0020-0255&rft.eissn=1872-6291&rft.volume=317&rft.spage=96&rft.epage=113&rft_id=info:doi/10.1016%2Fj.ins.2015.04.044&rft.externalDocID=S0020025515003266
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0020-0255&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0020-0255&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0020-0255&client=summon