Optimal distributed synchronization control for continuous-time heterogeneous multi-agent differential graphical games

In this paper, a new optimal distributed synchronization control scheme for the consensus problem of heterogeneous multi-agent differential graphical games is developed by iterative adaptive dynamic programming (ADP). The main idea is to use iterative ADP technique to obtain the iterative control la...

Full description

Saved in:

Bibliographic Details
Published in:	Information sciences Vol. 317; pp. 96 - 113
Main Authors:	Wei, Qinglai, Liu, Derong, Lewis, Frank L.
Format:	Journal Article
Language:	English
Published:	Elsevier Inc 01.10.2015
Subjects:	Adaptive critic designs Adaptive dynamic programming Algorithms Approximate dynamic programming Dynamics Games Graphical games Heterogeneous multi-agents Multiagent systems Optimization Policies Policy iteration Synchronism Synchronization Approximate dynamic programming Policy iteration Adaptive dynamic programming Graphical games Adaptive critic designs Heterogeneous multi-agents
ISSN:	0020-0255, 1872-6291
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

Abstract	In this paper, a new optimal distributed synchronization control scheme for the consensus problem of heterogeneous multi-agent differential graphical games is developed by iterative adaptive dynamic programming (ADP). The main idea is to use iterative ADP technique to obtain the iterative control law which makes all the agents track a given dynamics and simultaneously makes the iterative value function reach the Nash equilibrium. In the developed heterogeneous multi-agent differential graphical games, the agent of each node is different from one another. The dynamics and performance index function for each node depend only on local neighborhood information. A cooperative policy iteration algorithm is presented to achieve the optimal distributed synchronization control law for the agent of each node, where the coupled Hamilton–Jacobi equations for optimal synchronization control of heterogeneous multi-agent differential games can be avoided. Convergence analysis is developed to show that the iterative value functions of heterogeneous multi-agent differential graphical games can converge to the Nash equilibrium. Two simulation examples are given to show the effectiveness of the developed optimal control scheme.
AbstractList	In this paper, a new optimal distributed synchronization control scheme for the consensus problem of heterogeneous multi-agent differential graphical games is developed by iterative adaptive dynamic programming (ADP). The main idea is to use iterative ADP technique to obtain the iterative control law which makes all the agents track a given dynamics and simultaneously makes the iterative value function reach the Nash equilibrium. In the developed heterogeneous multi-agent differential graphical games, the agent of each node is different from one another. The dynamics and performance index function for each node depend only on local neighborhood information. A cooperative policy iteration algorithm is presented to achieve the optimal distributed synchronization control law for the agent of each node, where the coupled Hamilton-Jacobi equations for optimal synchronization control of heterogeneous multi-agent differential games can be avoided. Convergence analysis is developed to show that the iterative value functions of heterogeneous multi-agent differential graphical games can converge to the Nash equilibrium. Two simulation examples are given to show the effectiveness of the developed optimal control scheme.
Author	Lewis, Frank L. Wei, Qinglai Liu, Derong
Author_xml	– sequence: 1 givenname: Qinglai surname: Wei fullname: Wei, Qinglai email: qinglai.wei@ia.ac.cn organization: The State Key Laboratory of Management and Control for Complex Systems, Institute of Automation, Chinese Academy of Sciences, Beijing 100190, China – sequence: 2 givenname: Derong surname: Liu fullname: Liu, Derong email: derong.liu@ia.ac.cn organization: The State Key Laboratory of Management and Control for Complex Systems, Institute of Automation, Chinese Academy of Sciences, Beijing 100190, China – sequence: 3 givenname: Frank L. surname: Lewis fullname: Lewis, Frank L. email: lewis@uta.edu organization: UTA Research Institute, University of Texas at Arlington, Fort Worth, TX, USA
BookMark	eNp9kE9r4zAQxUXpQpNsP0BvPu7F6ciWLZuelrD_oNBLexayPGomOFIqyYXsp18l6WkPhYF5Gub3GL0lu3beIWN3HNYceHu_W5OL6wp4swaRS1yxBe9kVbZVz6_ZAqCCEqqmuWHLGHcAIGTbLtj70yHRXk_FSDEFGuaEYxGPzmyDd_RXJ_KuMN6l4KfC-nDW5GY_xzKDWGwxYfCv6DCPiv08JSp1fqbsaC2GrCjbvwZ92JI5Kb3H-JV9sXqKePvRV-zl54_nze_y8enXn833x9LUNaRS9IMYQXALoDvbmW7sq0rqph14g03fwCB5L7SFEVF30GuuWznYuhttZTvZ1yv27eJ7CP5txpjUnqLBadLnexWXEuqm5n2XV_ll1QQfY0CrDiEnE46KgzplrHYqZ6xOGSsQuURm5H-MoXTOLAVN06fkw4XE_Pt3wqCiIXQGRwpokho9fUL_A_bJnKQ
CitedBy_id	crossref_primary_10_1016_j_ejcon_2019_10_008 crossref_primary_10_1109_TCYB_2016_2586082 crossref_primary_10_1016_j_ins_2023_119884 crossref_primary_10_1109_TSMC_2018_2883801 crossref_primary_10_1631_FITEE_2200010 crossref_primary_10_1016_j_jfranklin_2018_11_054 crossref_primary_10_1016_j_neucom_2016_11_041 crossref_primary_10_1109_TCYB_2017_2788819 crossref_primary_10_3390_s20051302 crossref_primary_10_1016_j_neucom_2017_01_076 crossref_primary_10_1109_TCYB_2022_3196003 crossref_primary_10_1007_s00521_022_07880_4 crossref_primary_10_1007_s00521_016_2593_0 crossref_primary_10_1016_j_ins_2015_08_042 crossref_primary_10_1080_00207179_2018_1441550 crossref_primary_10_1002_rnc_4650 crossref_primary_10_1016_j_ins_2021_12_125 crossref_primary_10_1016_j_jfranklin_2022_12_021 crossref_primary_10_1109_JAS_2021_1003838 crossref_primary_10_1016_j_neucom_2020_04_119 crossref_primary_10_1109_TASE_2023_3289950 crossref_primary_10_1109_TCSI_2023_3246001 crossref_primary_10_1109_TNNLS_2023_3291542 crossref_primary_10_1109_TSMC_2025_3548114 crossref_primary_10_1109_TCYB_2024_3354945 crossref_primary_10_1109_TCSI_2025_3548900 crossref_primary_10_1109_TCYB_2016_2611613 crossref_primary_10_1016_j_ins_2015_11_019 crossref_primary_10_1109_TCYB_2018_2819695 crossref_primary_10_3390_app142210273 crossref_primary_10_1002_oca_2907 crossref_primary_10_1109_TSMC_2022_3190058 crossref_primary_10_1109_TCYB_2021_3110645 crossref_primary_10_1109_TNNLS_2017_2728622 crossref_primary_10_1016_j_neucom_2017_09_020 crossref_primary_10_1016_j_engappai_2025_110998 crossref_primary_10_1016_j_neucom_2017_09_066 crossref_primary_10_1002_rnc_7939 crossref_primary_10_1016_j_amc_2019_01_066 crossref_primary_10_1007_s12555_018_0904_1 crossref_primary_10_1016_j_ins_2021_01_056 crossref_primary_10_1002_acs_3945 crossref_primary_10_1007_s12555_016_0507_7 crossref_primary_10_1049_iet_cta_2019_0397 crossref_primary_10_1109_TASE_2023_3237770 crossref_primary_10_1002_oca_2859 crossref_primary_10_1109_TCSI_2021_3121809 crossref_primary_10_1109_TSMC_2018_2814018 crossref_primary_10_1109_TSMC_2020_3011184 crossref_primary_10_1109_TNNLS_2015_2464080 crossref_primary_10_1109_TNNLS_2018_2832025 crossref_primary_10_1016_j_neucom_2016_02_029 crossref_primary_10_1109_TITS_2022_3223303 crossref_primary_10_1109_TFUZZ_2023_3327699 crossref_primary_10_1007_s11063_021_10641_4 crossref_primary_10_1109_TCYB_2021_3140104 crossref_primary_10_1016_j_ifacol_2021_04_205 crossref_primary_10_1016_j_isatra_2016_07_004 crossref_primary_10_1016_j_neucom_2015_05_075 crossref_primary_10_1007_s12083_019_00751_1 crossref_primary_10_1016_j_isatra_2019_01_021 crossref_primary_10_1109_TCSI_2022_3166220 crossref_primary_10_1016_j_ins_2016_07_051 crossref_primary_10_1016_j_neucom_2017_07_058 crossref_primary_10_1016_j_neucom_2021_05_046 crossref_primary_10_1109_TIE_2016_2542134 crossref_primary_10_1049_iet_cta_2016_0028 crossref_primary_10_1109_TCYB_2015_2492242 crossref_primary_10_1016_j_ins_2023_118949 crossref_primary_10_1016_j_neunet_2018_06_007 crossref_primary_10_1109_ACCESS_2020_3043775 crossref_primary_10_1016_j_jfranklin_2022_02_034 crossref_primary_10_1016_j_eswa_2025_128094 crossref_primary_10_1007_s00521_019_04263_0 crossref_primary_10_1016_j_ins_2019_12_078 crossref_primary_10_1016_j_neucom_2017_01_047 crossref_primary_10_1016_j_ins_2025_122117 crossref_primary_10_1007_s11071_025_11097_0
Cites_doi	10.1016/j.automatica.2013.09.043 10.1109/TNNLS.2014.2306201 10.1109/TNNLS.2012.2234133 10.1016/j.automatica.2004.11.034 10.1109/TASE.2013.2296206 10.1109/TNNLS.2013.2247627 10.1109/9.256331 10.1016/j.ins.2013.08.037 10.1109/TSMCB.2008.920269 10.1109/TAC.2006.884959 10.1109/TNN.2003.813839 10.1016/j.ins.2012.07.006 10.1109/TSMCC.2002.801727 10.1109/TCYB.2014.2313915 10.1109/TSMCB.2012.2216523 10.1109/TNN.2008.2000204 10.1016/j.automatica.2011.03.005 10.1109/MCS.2012.2214134 10.1109/TASE.2012.2198057 10.1109/TNNLS.2013.2281663 10.1016/j.automatica.2010.10.033 10.1002/acs.2349 10.1016/j.automatica.2014.10.053 10.1109/TASE.2013.2284545 10.1016/j.neucom.2008.05.012 10.1109/TNNLS.2013.2280013 10.1109/TAC.2013.2239011 10.1137/S036301290037908X 10.1109/TIE.2014.2301770 10.1109/TNNLS.2012.2227339 10.1016/j.automatica.2014.05.011 10.1109/TASE.2014.2303139 10.1016/j.automatica.2013.11.002 10.1109/72.623201 10.1016/j.automatica.2012.05.074 10.1109/TASE.2013.2280974 10.1109/TNNLS.2013.2271778 10.1109/72.914523 10.1109/TSMCB.2012.2207718 10.1109/TASE.2014.2300532 10.1109/TSMCB.2008.926614 10.1109/TII.2012.2231085 10.1016/j.automatica.2006.09.019 10.1016/j.neunet.2012.02.027 10.1109/TSMCB.2012.2203336 10.1016/j.ins.2014.07.008 10.1016/j.automatica.2014.02.015 10.1109/TNNLS.2013.2249668 10.1109/TNNLS.2013.2294968 10.1109/TNN.2005.853408 10.1109/TPWRS.2013.2237793 10.1016/j.ins.2014.05.050 10.1109/TNNLS.2013.2292704 10.1109/TSG.2012.2233224 10.1109/JSYST.2014.2330392 10.1109/TSMCB.2006.880135 10.1049/iet-cta.2012.0486
ContentType	Journal Article
Copyright	2015 Elsevier Inc.
Copyright_xml	– notice: 2015 Elsevier Inc.
DBID	AAYXX CITATION 7SC 8FD JQ2 L7M L~C L~D
DOI	10.1016/j.ins.2015.04.044
DatabaseName	CrossRef Computer and Information Systems Abstracts Technology Research Database ProQuest Computer Science Collection Advanced Technologies Database with Aerospace Computer and Information Systems Abstracts Academic Computer and Information Systems Abstracts Professional
DatabaseTitle	CrossRef Computer and Information Systems Abstracts Technology Research Database Computer and Information Systems Abstracts – Academic Advanced Technologies Database with Aerospace ProQuest Computer Science Collection Computer and Information Systems Abstracts Professional
DatabaseTitleList	Computer and Information Systems Abstracts
DeliveryMethod	fulltext_linktorsrc
Discipline	Engineering Library & Information Science
EISSN	1872-6291
EndPage	113
ExternalDocumentID	10_1016_j_ins_2015_04_044 S0020025515003266
GroupedDBID	--K --M --Z -~X .DC .~1 0R~ 1B1 1RT 1~. 1~5 4.4 457 4G. 5GY 5VS 7-5 71M 8P~ 9JN 9JO AAAKF AABNK AACTN AAEDT AAEDW AAIAV AAIKJ AAKOC AALRI AAOAW AAQFI AARIN AAXUO AAYFN ABAOU ABBOA ABFNM ABJNI ABMAC ABUCO ABYKQ ACAZW ACDAQ ACGFS ACRLP ACZNC ADBBV ADEZE ADGUI ADTZH AEBSH AECPX AEKER AENEX AFKWA AFTJW AGHFR AGUBO AGYEJ AHHHB AHJVU AHZHX AIALX AIEXJ AIGVJ AIKHN AITUG AJBFU AJOXV ALMA_UNASSIGNED_HOLDINGS AMFUW AMRAJ AOUOD APLSM ARUGR AXJTR BJAXD BKOJK BLXMC CS3 DU5 EBS EFJIC EFLBG EJD EO8 EO9 EP2 EP3 F5P FDB FIRID FNPLU FYGXN G-Q GBLVA GBOLZ HAMUX IHE J1W JJJVA KOM LG9 LY1 M41 MHUIS MO0 MS~ N9A O-L O9- OAUVE OZT P-8 P-9 P2P PC. Q38 RIG ROL RPZ SDF SDG SDP SES SPC SPCBC SSB SSD SST SSV SSW SSZ T5K TN5 TWZ WH7 XPP ZMT ~02 ~G- 1OL 29I 77I 9DU AAAKG AAQXK AATTM AAXKI AAYWO AAYXX ABEFU ABWVN ABXDB ACLOT ACNNM ACRPL ACVFH ADCNI ADJOM ADMUD ADNMO ADVLN AEIPS AEUPX AFFNX AFJKZ AFPUW AGQPQ AIGII AIIUN AKBMS AKRWK AKYEP ANKPU APXCP ASPBG AVWKF AZFZN CITATION EFKBS FEDTE FGOYB HLZ HVGLF HZ~ H~9 R2- SBC SDS SEW UHS WUQ YYP ZY4 ~HD 7SC 8FD JQ2 L7M L~C L~D
ID	FETCH-LOGICAL-c330t-49b4d041f00a8f8c8d9227a56b15e5950b7194af0deea809a1a67bf38df2f8793
ISICitedReferencesCount	107
ISICitedReferencesURI	http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000358093400006&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
ISSN	0020-0255
IngestDate	Sun Sep 28 02:00:38 EDT 2025 Tue Nov 18 22:09:45 EST 2025 Sat Nov 29 06:24:57 EST 2025 Fri Feb 23 02:23:14 EST 2024
IsPeerReviewed	true
IsScholarly	true
Keywords	Approximate dynamic programming Policy iteration Adaptive dynamic programming Graphical games Adaptive critic designs Heterogeneous multi-agents
Language	English
LinkModel	OpenURL
MergedId	FETCHMERGED-LOGICAL-c330t-49b4d041f00a8f8c8d9227a56b15e5950b7194af0deea809a1a67bf38df2f8793
Notes	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23
PQID	1770353198
PQPubID	23500
PageCount	18
ParticipantIDs	proquest_miscellaneous_1770353198 crossref_primary_10_1016_j_ins_2015_04_044 crossref_citationtrail_10_1016_j_ins_2015_04_044 elsevier_sciencedirect_doi_10_1016_j_ins_2015_04_044
PublicationCentury	2000
PublicationDate	2015-10-01 2015-10-00 20151001
PublicationDateYYYYMMDD	2015-10-01
PublicationDate_xml	– month: 10 year: 2015 text: 2015-10-01 day: 01
PublicationDecade	2010
PublicationTitle	Information sciences
PublicationYear	2015
Publisher	Elsevier Inc
Publisher_xml	– name: Elsevier Inc
References	Vrabie, Vamvoudakis, Lewis (b0225) 2013 Enns, Si (b0055) 2003; 14 Liang, Molina, Venayagamoorthy, Harley (b0110) 2013; 28 Liu, Wei (b0130) 2014; 28 Si, Wang (b0195) 2001; 12 Xu, Jagannathan (b0300) 2013; 24 Xu, Yang, Shi (b0305) 2014; 25 Su, Wu, Shi (b0210) 2013; 9 Zhang, Qin, Luo (b0325) 2014; 11 Jamshidi (b0085) 1982 Tang, Gao, Zou, Kurths (b0215) 2013; 43 W.-Q. Wang, Carrier frequency synchronization in distributed wireless sensor networks, IEEE Syst. J. (2015) (in press) doi:10.1109/JSYST.2014.2330392. Wei, Liu (b0280) 2014; 61 Su, Wu, Shi, Song (b0205) 2014; 50 Jiang, Jiang (b0095) 2014; 25 Wei, Liu (b0275) 2014; 11 Owen (b0180) 1982 Ni, He, Wen (b0175) 2013; 24 Basar, Bernhard (b0040) 1995 Lugli, Franco, Santos (b0150) 2014; 10 Basar, Olsder (b0045) 1982 Murray, Cox, Lendaris, Saeks (b0170) 2002; 32 Zhang, Qin, Jiang, Luo (b0320) 2014; 44 Al-Tamimi, Lewis, Abu-Khalaf (b0025) 2007; 43 Shi (b0190) 2002; 41 Werbos (b0295) 1991 Liu, Wei (b0140) 2014; 25 Molina, Venayagamoorthy, Liang, Harley (b0165) 2013; 4 Zhang, Wei, Luo (b0335) 2008; 38 Liu, Wang, Yang (b0125) 2013; 220 Trentelman, Takaba, Monshizadeh (b0220) 2013; 58 Wang, Liu, Li, Ma (b0255) 2014; 28 Werbos (b0290) 1977; 22 Liu, Wang, Zhao, Wei, Jin (b0120) 2012; 9 Iantovics, Zamfirescu (b0075) 2013; 9 Modares, Lewis (b0155) 2014; 50 Vamvoudakis, Lewis, Hudas (b0235) 2012; 48 Modares, Lewis, Naghibi-Sistani (b0160) 2014; 50 Zhang, Wei, Liu (b0330) 2011; 47 Lewis, Vrabie, Vamvoudakis (b0100) 2012; 32 Song, Xiao, Zhang, Sun (b0200) 2014; 25 Wei, Liu (b0265) 2013; 7 Wang, Liu, Li (b0250) 2014; 11 Jiang, Jiang (b0090) 2013; 24 Vamvoudakis, Lewis (b0230) 2011; 47 Liu, Wang, Li (b0115) 2014; 25 Xu, Zuo, Huang (b0310) 2014; 261 Abu-Khalaf, Lewis, Huang (b0015) 2008; 19 Wei, Liu (b0270) 2014; 11 Liu, Zhang, Zhang (b0145) 2005; 16 Kiumarsi, Lewis, Modares, Karimpour, Naghibi-Sistani (b0080) 2014; 50 Zhang, Cui, Luo (b0315) 2013; 43 Prokhorov, Wunsch (b0185) 1997; 8 Huang, Xu, Zuo (b0065) 2014; 286 Abu-Khalaf, Lewis (b0005) 2005; 41 Li, Liu, Wang (b0105) 2014; 11 Bertsekas, Tsitsiklis (b0050) 1996 Al-Tamimi, Abu-Khalaf, Lewis (b0020) 2007; 37 Al-Tamimi, Lewis, Abu-Khalaf (b0030) 2008; 38 Aurangzeb, Lewis (b0035) 2014; 50 Van Der Schaft (b0240) 1992; 37 Liu, Wei (b0135) 2013; 43 Fairbank, Alonso, Prokhorov (b0060) 2014; 24 Heydari, Balakrishnan (b0070) 2013; 24 Wei, Liu (b0260) 2012; 32 Wei, Zhang, Dai (b0285) 2009; 72 Abu-Khalaf, Lewis, Huang (b0010) 2006; 51 Aurangzeb (10.1016/j.ins.2015.04.044_b0035) 2014; 50 Owen (10.1016/j.ins.2015.04.044_b0180) 1982 Su (10.1016/j.ins.2015.04.044_b0210) 2013; 9 Basar (10.1016/j.ins.2015.04.044_b0040) 1995 10.1016/j.ins.2015.04.044_b0245 Wei (10.1016/j.ins.2015.04.044_b0275) 2014; 11 Wei (10.1016/j.ins.2015.04.044_b0280) 2014; 61 Prokhorov (10.1016/j.ins.2015.04.044_b0185) 1997; 8 Al-Tamimi (10.1016/j.ins.2015.04.044_b0030) 2008; 38 Zhang (10.1016/j.ins.2015.04.044_b0320) 2014; 44 Wang (10.1016/j.ins.2015.04.044_b0250) 2014; 11 Huang (10.1016/j.ins.2015.04.044_b0065) 2014; 286 Modares (10.1016/j.ins.2015.04.044_b0155) 2014; 50 Iantovics (10.1016/j.ins.2015.04.044_b0075) 2013; 9 Wei (10.1016/j.ins.2015.04.044_b0270) 2014; 11 Abu-Khalaf (10.1016/j.ins.2015.04.044_b0015) 2008; 19 Jamshidi (10.1016/j.ins.2015.04.044_b0085) 1982 Wei (10.1016/j.ins.2015.04.044_b0285) 2009; 72 Wei (10.1016/j.ins.2015.04.044_b0260) 2012; 32 Zhang (10.1016/j.ins.2015.04.044_b0330) 2011; 47 Heydari (10.1016/j.ins.2015.04.044_b0070) 2013; 24 Liu (10.1016/j.ins.2015.04.044_b0140) 2014; 25 Song (10.1016/j.ins.2015.04.044_b0200) 2014; 25 Lugli (10.1016/j.ins.2015.04.044_b0150) 2014; 10 Shi (10.1016/j.ins.2015.04.044_b0190) 2002; 41 Vamvoudakis (10.1016/j.ins.2015.04.044_b0235) 2012; 48 Ni (10.1016/j.ins.2015.04.044_b0175) 2013; 24 Werbos (10.1016/j.ins.2015.04.044_b0295) 1991 Xu (10.1016/j.ins.2015.04.044_b0305) 2014; 25 Zhang (10.1016/j.ins.2015.04.044_b0325) 2014; 11 Tang (10.1016/j.ins.2015.04.044_b0215) 2013; 43 Fairbank (10.1016/j.ins.2015.04.044_b0060) 2014; 24 Liu (10.1016/j.ins.2015.04.044_b0115) 2014; 25 Liu (10.1016/j.ins.2015.04.044_b0125) 2013; 220 Werbos (10.1016/j.ins.2015.04.044_b0290) 1977; 22 Enns (10.1016/j.ins.2015.04.044_b0055) 2003; 14 Kiumarsi (10.1016/j.ins.2015.04.044_b0080) 2014; 50 Liu (10.1016/j.ins.2015.04.044_b0130) 2014; 28 Wei (10.1016/j.ins.2015.04.044_b0265) 2013; 7 Bertsekas (10.1016/j.ins.2015.04.044_b0050) 1996 Basar (10.1016/j.ins.2015.04.044_b0045) 1982 Lewis (10.1016/j.ins.2015.04.044_b0100) 2012; 32 Molina (10.1016/j.ins.2015.04.044_b0165) 2013; 4 Abu-Khalaf (10.1016/j.ins.2015.04.044_b0005) 2005; 41 Murray (10.1016/j.ins.2015.04.044_b0170) 2002; 32 Van Der Schaft (10.1016/j.ins.2015.04.044_b0240) 1992; 37 Jiang (10.1016/j.ins.2015.04.044_b0090) 2013; 24 Al-Tamimi (10.1016/j.ins.2015.04.044_b0025) 2007; 43 Liu (10.1016/j.ins.2015.04.044_b0120) 2012; 9 Su (10.1016/j.ins.2015.04.044_b0205) 2014; 50 Vrabie (10.1016/j.ins.2015.04.044_b0225) 2013 Xu (10.1016/j.ins.2015.04.044_b0310) 2014; 261 Abu-Khalaf (10.1016/j.ins.2015.04.044_b0010) 2006; 51 Liu (10.1016/j.ins.2015.04.044_b0145) 2005; 16 Liu (10.1016/j.ins.2015.04.044_b0135) 2013; 43 Vamvoudakis (10.1016/j.ins.2015.04.044_b0230) 2011; 47 Jiang (10.1016/j.ins.2015.04.044_b0095) 2014; 25 Al-Tamimi (10.1016/j.ins.2015.04.044_b0020) 2007; 37 Liang (10.1016/j.ins.2015.04.044_b0110) 2013; 28 Trentelman (10.1016/j.ins.2015.04.044_b0220) 2013; 58 Modares (10.1016/j.ins.2015.04.044_b0160) 2014; 50 Xu (10.1016/j.ins.2015.04.044_b0300) 2013; 24 Li (10.1016/j.ins.2015.04.044_b0105) 2014; 11 Wang (10.1016/j.ins.2015.04.044_b0255) 2014; 28 Zhang (10.1016/j.ins.2015.04.044_b0315) 2013; 43 Zhang (10.1016/j.ins.2015.04.044_b0335) 2008; 38 Si (10.1016/j.ins.2015.04.044_b0195) 2001; 12
References_xml	– volume: 58 start-page: 1511 year: 2013 end-page: 1523 ident: b0220 article-title: Robust synchronization of uncertain linear multi-agent systems publication-title: IEEE Trans. Autom. Contr. – year: 1982 ident: b0045 article-title: Dynamic Noncooperative Game Theory – volume: 24 start-page: 2088 year: 2014 end-page: 2100 ident: b0060 article-title: An equivalence between adaptive dynamic programming with a critic and backpropagation through time publication-title: IEEE Trans. Neural Netw. Learn. Syst. – reference: W.-Q. Wang, Carrier frequency synchronization in distributed wireless sensor networks, IEEE Syst. J. (2015) (in press) doi:10.1109/JSYST.2014.2330392. – volume: 43 start-page: 779 year: 2013 end-page: 789 ident: b0135 article-title: Finite-approximation-error-based optimal control approach for discrete-time nonlinear systems publication-title: IEEE Trans. Cybernet. – volume: 25 start-page: 882 year: 2014 end-page: 893 ident: b0095 article-title: Robust adaptive dynamic programming and feedback stabilization of nonlinear systems publication-title: IEEE Trans. Neural Netw. Learn. Syst. – volume: 25 start-page: 621 year: 2014 end-page: 634 ident: b0140 article-title: Policy iteration adaptive dynamic programming algorithm for discrete-time nonlinear systems publication-title: IEEE Trans. Neural Netw. Learn. Syst. – volume: 11 start-page: 1020 year: 2014 end-page: 1036 ident: b0275 article-title: Adaptive dynamic programming for optimal tracking control of unknown nonlinear systems with application to coal gasification publication-title: IEEE Trans. Autom. Sci. Eng. – volume: 9 start-page: 628 year: 2012 end-page: 634 ident: b0120 article-title: Neural-network-based optimal control for a class of unknown discrete-time nonlinear systems using globalized dual heuristic programming publication-title: IEEE Trans. Autom. Sci. Eng. – volume: 51 start-page: 1989 year: 2006 end-page: 1995 ident: b0010 article-title: Policy iterations on the Hamilton–Jacobi–Isaacs equation for state feedback control with input saturation publication-title: IEEE Trans. Autom. Contr. – volume: 32 start-page: 140 year: 2002 end-page: 153 ident: b0170 article-title: Adaptive dynamic programming publication-title: IEEE Trans. Syst. Man Cybernet. – Part C: Appl. Rev. – volume: 25 start-page: 1733 year: 2014 end-page: 1739 ident: b0200 article-title: Adaptive dynamic programming for a class of complex-valued nonlinear systems publication-title: IEEE Trans. Neural Netw. Learn. Syst. – volume: 11 start-page: 1176 year: 2014 end-page: 1190 ident: b0270 article-title: A novel iterative publication-title: IEEE Trans. Autom. Sci. Eng. – volume: 47 start-page: 207 year: 2011 end-page: 214 ident: b0330 article-title: An iterative adaptive dynamic programming method for solving a class of nonlinear zero-sum differential games publication-title: Automatica – volume: 43 start-page: 473 year: 2007 end-page: 481 ident: b0025 article-title: Model-free Q-learning designs for linear discrete-time zero-sum games with application to H-infinity control publication-title: Automatica – volume: 41 start-page: 826 year: 2002 end-page: 850 ident: b0190 article-title: Limited Hamilton–Jacobi–Isaacs equations for singularly perturbed zero-sum dynamic (discrete time) games publication-title: SIAM J. Contr. Optimiz. – volume: 50 start-page: 1167 year: 2014 end-page: 1175 ident: b0080 article-title: Reinforcement-learning for optimal tracking control of linear discrete-time systems with unknown dynamics publication-title: Automatica – volume: 286 start-page: 209 year: 2014 end-page: 227 ident: b0065 article-title: Reinforcement learning with automatic basis construction based on isometric feature mapping publication-title: Inform. Sci. – volume: 37 start-page: 770 year: 1992 end-page: 784 ident: b0240 article-title: -gain analysis of nonlinear systems and nonlinear state feedback H control publication-title: IEEE Trans. Autom. Contr. – year: 1995 ident: b0040 article-title: Optimal Control and Related Minimax Design Problems – volume: 25 start-page: 418 year: 2014 end-page: 428 ident: b0115 article-title: Decentralized stabilization for a class of continuous-time nonlinear interconnected systems using online learning optimal control approach publication-title: IEEE Trans. Neural Netw. Learn. Syst. – volume: 14 start-page: 929 year: 2003 end-page: 939 ident: b0055 article-title: Helicopter trimming and tracking control using direct neural dynamic programming publication-title: IEEE Trans. Neural Netw. – volume: 9 start-page: 1739 year: 2013 end-page: 1750 ident: b0210 article-title: Sensor networks with random link failures: distributed filtering for T–S fuzzy systems publication-title: IEEE Trans. Ind. Inform. – year: 1996 ident: b0050 article-title: Neuro-Dynamic Programming – volume: 28 start-page: 205 year: 2014 end-page: 231 ident: b0130 article-title: Multi-person zero-sum differential games for a class of uncertain nonlinear systems publication-title: Int. J. Adapt. Contr. Signal Process. – volume: 47 start-page: 1556 year: 2011 end-page: 1569 ident: b0230 article-title: Multi-player non-zero-sum games: online adaptive learning solution of coupled Hamilton–Jacobi equations publication-title: Automatica – volume: 10 start-page: 1275 year: 2014 end-page: 1289 ident: b0150 article-title: Advances in distributed control for factory automation on ethernet technology publication-title: Int. J. Innovative Comput. Inform. Contr. – volume: 9 start-page: 1171 year: 2013 end-page: 1188 ident: b0075 article-title: ERMS: an evolutionary reorganizing multiagent system publication-title: Int. J. Innovative Comput. Inform. Contr. – volume: 24 start-page: 1150 year: 2013 end-page: 1156 ident: b0090 article-title: Robust adaptive dynamic programming with an application to power systems publication-title: IEEE Trans. Neural Netw. Learn. Syst. – volume: 24 start-page: 145 year: 2013 end-page: 157 ident: b0070 article-title: Finite-horizon control-constrained nonlinear optimal control using single network adaptive critics publication-title: IEEE Trans. Neural Netw. Learn. Syst. – year: 1982 ident: b0085 article-title: Large-Scale Systems-Modeling and Control – volume: 8 start-page: 997 year: 1997 end-page: 1007 ident: b0185 article-title: Adaptive critic designs publication-title: IEEE Trans. Neural Netw. – volume: 11 start-page: 627 year: 2014 end-page: 632 ident: b0250 article-title: Policy iteration algorithm for online design of robust control for a class of continuous-time nonlinear systems publication-title: IEEE Trans. Autom. Sci. Eng. – volume: 24 start-page: 471 year: 2013 end-page: 484 ident: b0300 article-title: Stochastic optimal controller design for uncertain nonlinear networked control system via neuro dynamic programming publication-title: IEEE Trans. Neural Netw. Learn. Syst. – year: 1982 ident: b0180 article-title: Game Theory – volume: 28 start-page: 167 year: 2014 end-page: 179 ident: b0255 article-title: Neural-network-based robust optimal control design for a class of uncertain nonlinear systems via adaptive dynamic programming publication-title: Inform. Sci. – volume: 261 start-page: 1 year: 2014 end-page: 31 ident: b0310 article-title: Reinforcement learning algorithms with function approximation: recent advances and applications publication-title: Inform. Sci. – volume: 16 start-page: 1219 year: 2005 end-page: 1228 ident: b0145 article-title: A self-learning call admission control scheme for CDMA cellular networks publication-title: IEEE Trans. Neural Netw. – volume: 50 start-page: 1780 year: 2014 end-page: 1792 ident: b0155 article-title: Optimal tracking control of nonlinear partially-unknown constrained-input systems using integral reinforcement learning publication-title: Automatica – volume: 38 start-page: 937 year: 2008 end-page: 942 ident: b0335 article-title: A novel infinite-time optimal tracking control scheme for a class of discrete-time nonlinear systems via the greedy HDP iteration algorithm publication-title: IEEE Trans. Syst. Man Cybernet. – Part B: Cybernet. – year: 2013 ident: b0225 article-title: Optimal Adaptive Control and Differential Games by Reinforcement Learning Principles – volume: 37 start-page: 240 year: 2007 end-page: 247 ident: b0020 article-title: Adaptive critic designs for discrete-time zero-sum games with application to publication-title: IEEE Trans. Syst. Man Cybernet. – Part B: Cybernet. – volume: 4 start-page: 498 year: 2013 end-page: 508 ident: b0165 article-title: Intelligent local area signals based damping of power system oscillations using virtual generators and approximate dynamic programming publication-title: IEEE Trans. Smart Grid – volume: 50 start-page: 3268 year: 2014 end-page: 3275 ident: b0205 article-title: A novel approach to output feedback control of fuzzy stochastic systems publication-title: Automatica – start-page: 67 year: 1991 end-page: 95 ident: b0295 article-title: A menu of designs for reinforcement learning over time publication-title: Neural Networks for Control – volume: 38 start-page: 943 year: 2008 end-page: 949 ident: b0030 article-title: Discrete-time nonlinear HJB solution using approximate dynamic programming: convergence proof publication-title: IEEE Trans. Syst. Man Cybernet. – Part B: Cybernet. – volume: 50 start-page: 193 year: 2014 end-page: 202 ident: b0160 article-title: Integral reinforcement learning and experience replay for adaptive optimal control of partially-unknown constrained-input continuous-time systems publication-title: Automatica – volume: 12 start-page: 264 year: 2001 end-page: 276 ident: b0195 article-title: On-line learning control by association and reinforcement publication-title: IEEE Trans. Neural Netw. – volume: 28 start-page: 2670 year: 2013 end-page: 2678 ident: b0110 article-title: Two-level dynamic stochastic optimal power flow control for power systems with intermittent renewable generation publication-title: IEEE Trans. Power Syst. – volume: 7 start-page: 1472 year: 2013 end-page: 1486 ident: b0265 article-title: Numerical adaptive learning control scheme for discrete-time nonlinear systems publication-title: IET Contr. Theory Appl. – volume: 72 start-page: 1839 year: 2009 end-page: 1848 ident: b0285 article-title: Model-free multiobjective approximate dynamic programming for discrete-time nonlinear systems with general performance index functions publication-title: Neurocomputing – volume: 19 start-page: 1243 year: 2008 end-page: 1252 ident: b0015 article-title: Neurodynamic programming and zero-sum games for constrained control systems publication-title: IEEE Trans. Neural Netw. – volume: 32 start-page: 76 year: 2012 end-page: 105 ident: b0100 article-title: Reinforcement learning and feedback control: using natural decision methods to design optimal adaptive controllers publication-title: IEEE Contr. Syst. – volume: 44 start-page: 2706 year: 2014 end-page: 2718 ident: b0320 article-title: Online adaptive policy learning algorithm for publication-title: IEEE Trans. Cybernet. – volume: 11 start-page: 839 year: 2014 end-page: 849 ident: b0325 article-title: Neural-network-based constrained optimal control scheme for discrete-time switched nonlinear system using dual heuristic programming publication-title: IEEE Trans. Autom. Sci. Eng. – volume: 48 start-page: 1598 year: 2012 end-page: 1611 ident: b0235 article-title: Multi-agent differential graphical games: online adaptive learning solution for synchronization with optimality publication-title: Automatica – volume: 32 start-page: 236 year: 2012 end-page: 244 ident: b0260 article-title: An iterative publication-title: Neural Netw. – volume: 43 start-page: 206 year: 2013 end-page: 216 ident: b0315 article-title: Near-optimal control for nonzero-sum differential games of continuous-time nonlinear systems using single-network ADP publication-title: IEEE Trans. Cybernet. – volume: 43 start-page: 358 year: 2013 end-page: 370 ident: b0215 article-title: Distributed synchronization in networks of agent systems with nonlinearities and random switchings publication-title: IEEE Trans. Cybernet. – volume: 61 start-page: 6399 year: 2014 end-page: 6408 ident: b0280 article-title: Data-driven neuro-optimal temperature control of water gas shift reaction using stable iterative adaptive dynamic programming publication-title: IEEE Trans. Ind. Electron. – volume: 11 start-page: 706 year: 2014 end-page: 714 ident: b0105 article-title: Integral reinforcement learning for linear continuous-time zero-sum games with completely unknown dynamics publication-title: IEEE Trans. Autom. Sci. Eng. – volume: 24 start-page: 913 year: 2013 end-page: 928 ident: b0175 article-title: Adaptive learning in tracking control based on the dual critic network design publication-title: IEEE Trans. Neural Netw. Learn. Syst. – volume: 41 start-page: 779 year: 2005 end-page: 791 ident: b0005 article-title: Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approch publication-title: Automatica – volume: 25 start-page: 635 year: 2014 end-page: 641 ident: b0305 article-title: Reinforcement learning output feedback NN control using deterministic learning technique publication-title: IEEE Trans. Neural Netw. Learn. Syst. – volume: 220 start-page: 331 year: 2013 end-page: 342 ident: b0125 article-title: An iterative adaptive dynamic programming algorithm for optimal control of unknown discrete-time nonlinear systems with constrained inputs publication-title: Inform. Sci. – volume: 22 start-page: 25 year: 1977 end-page: 38 ident: b0290 article-title: Advanced forecasting methods for global crisis warning and models of intelligence publication-title: Gen. Syst. Yearbook – volume: 50 start-page: 335 year: 2014 end-page: 348 ident: b0035 article-title: Internal structure of coalitions in competitive and altruistic graphical coalitional games publication-title: Automatica – volume: 50 start-page: 193 year: 2014 ident: 10.1016/j.ins.2015.04.044_b0160 article-title: Integral reinforcement learning and experience replay for adaptive optimal control of partially-unknown constrained-input continuous-time systems publication-title: Automatica doi: 10.1016/j.automatica.2013.09.043 – volume: 25 start-page: 1733 year: 2014 ident: 10.1016/j.ins.2015.04.044_b0200 article-title: Adaptive dynamic programming for a class of complex-valued nonlinear systems publication-title: IEEE Trans. Neural Netw. Learn. Syst. doi: 10.1109/TNNLS.2014.2306201 – volume: 24 start-page: 471 year: 2013 ident: 10.1016/j.ins.2015.04.044_b0300 article-title: Stochastic optimal controller design for uncertain nonlinear networked control system via neuro dynamic programming publication-title: IEEE Trans. Neural Netw. Learn. Syst. doi: 10.1109/TNNLS.2012.2234133 – volume: 41 start-page: 779 year: 2005 ident: 10.1016/j.ins.2015.04.044_b0005 article-title: Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approch publication-title: Automatica doi: 10.1016/j.automatica.2004.11.034 – volume: 11 start-page: 627 year: 2014 ident: 10.1016/j.ins.2015.04.044_b0250 article-title: Policy iteration algorithm for online design of robust control for a class of continuous-time nonlinear systems publication-title: IEEE Trans. Autom. Sci. Eng. doi: 10.1109/TASE.2013.2296206 – volume: 24 start-page: 913 year: 2013 ident: 10.1016/j.ins.2015.04.044_b0175 article-title: Adaptive learning in tracking control based on the dual critic network design publication-title: IEEE Trans. Neural Netw. Learn. Syst. doi: 10.1109/TNNLS.2013.2247627 – volume: 37 start-page: 770 year: 1992 ident: 10.1016/j.ins.2015.04.044_b0240 article-title: L2-gain analysis of nonlinear systems and nonlinear state feedback H control publication-title: IEEE Trans. Autom. Contr. doi: 10.1109/9.256331 – volume: 261 start-page: 1 year: 2014 ident: 10.1016/j.ins.2015.04.044_b0310 article-title: Reinforcement learning algorithms with function approximation: recent advances and applications publication-title: Inform. Sci. doi: 10.1016/j.ins.2013.08.037 – volume: 38 start-page: 937 year: 2008 ident: 10.1016/j.ins.2015.04.044_b0335 article-title: A novel infinite-time optimal tracking control scheme for a class of discrete-time nonlinear systems via the greedy HDP iteration algorithm publication-title: IEEE Trans. Syst. Man Cybernet. – Part B: Cybernet. doi: 10.1109/TSMCB.2008.920269 – volume: 51 start-page: 1989 year: 2006 ident: 10.1016/j.ins.2015.04.044_b0010 article-title: Policy iterations on the Hamilton–Jacobi–Isaacs equation for state feedback control with input saturation publication-title: IEEE Trans. Autom. Contr. doi: 10.1109/TAC.2006.884959 – volume: 14 start-page: 929 year: 2003 ident: 10.1016/j.ins.2015.04.044_b0055 article-title: Helicopter trimming and tracking control using direct neural dynamic programming publication-title: IEEE Trans. Neural Netw. doi: 10.1109/TNN.2003.813839 – volume: 220 start-page: 331 year: 2013 ident: 10.1016/j.ins.2015.04.044_b0125 article-title: An iterative adaptive dynamic programming algorithm for optimal control of unknown discrete-time nonlinear systems with constrained inputs publication-title: Inform. Sci. doi: 10.1016/j.ins.2012.07.006 – volume: 32 start-page: 140 year: 2002 ident: 10.1016/j.ins.2015.04.044_b0170 article-title: Adaptive dynamic programming publication-title: IEEE Trans. Syst. Man Cybernet. – Part C: Appl. Rev. doi: 10.1109/TSMCC.2002.801727 – volume: 44 start-page: 2706 year: 2014 ident: 10.1016/j.ins.2015.04.044_b0320 article-title: Online adaptive policy learning algorithm for H∞ state feedback control of unknown affine nonlinear discrete-time systems publication-title: IEEE Trans. Cybernet. doi: 10.1109/TCYB.2014.2313915 – volume: 9 start-page: 1171 year: 2013 ident: 10.1016/j.ins.2015.04.044_b0075 article-title: ERMS: an evolutionary reorganizing multiagent system publication-title: Int. J. Innovative Comput. Inform. Contr. – volume: 43 start-page: 779 year: 2013 ident: 10.1016/j.ins.2015.04.044_b0135 article-title: Finite-approximation-error-based optimal control approach for discrete-time nonlinear systems publication-title: IEEE Trans. Cybernet. doi: 10.1109/TSMCB.2012.2216523 – volume: 19 start-page: 1243 year: 2008 ident: 10.1016/j.ins.2015.04.044_b0015 article-title: Neurodynamic programming and zero-sum games for constrained control systems publication-title: IEEE Trans. Neural Netw. doi: 10.1109/TNN.2008.2000204 – volume: 10 start-page: 1275 year: 2014 ident: 10.1016/j.ins.2015.04.044_b0150 article-title: Advances in distributed control for factory automation on ethernet technology publication-title: Int. J. Innovative Comput. Inform. Contr. – year: 1996 ident: 10.1016/j.ins.2015.04.044_b0050 – volume: 47 start-page: 1556 year: 2011 ident: 10.1016/j.ins.2015.04.044_b0230 article-title: Multi-player non-zero-sum games: online adaptive learning solution of coupled Hamilton–Jacobi equations publication-title: Automatica doi: 10.1016/j.automatica.2011.03.005 – start-page: 67 year: 1991 ident: 10.1016/j.ins.2015.04.044_b0295 article-title: A menu of designs for reinforcement learning over time – volume: 32 start-page: 76 year: 2012 ident: 10.1016/j.ins.2015.04.044_b0100 article-title: Reinforcement learning and feedback control: using natural decision methods to design optimal adaptive controllers publication-title: IEEE Contr. Syst. doi: 10.1109/MCS.2012.2214134 – volume: 9 start-page: 628 year: 2012 ident: 10.1016/j.ins.2015.04.044_b0120 article-title: Neural-network-based optimal control for a class of unknown discrete-time nonlinear systems using globalized dual heuristic programming publication-title: IEEE Trans. Autom. Sci. Eng. doi: 10.1109/TASE.2012.2198057 – volume: 25 start-page: 621 year: 2014 ident: 10.1016/j.ins.2015.04.044_b0140 article-title: Policy iteration adaptive dynamic programming algorithm for discrete-time nonlinear systems publication-title: IEEE Trans. Neural Netw. Learn. Syst. doi: 10.1109/TNNLS.2013.2281663 – volume: 47 start-page: 207 year: 2011 ident: 10.1016/j.ins.2015.04.044_b0330 article-title: An iterative adaptive dynamic programming method for solving a class of nonlinear zero-sum differential games publication-title: Automatica doi: 10.1016/j.automatica.2010.10.033 – volume: 28 start-page: 205 year: 2014 ident: 10.1016/j.ins.2015.04.044_b0130 article-title: Multi-person zero-sum differential games for a class of uncertain nonlinear systems publication-title: Int. J. Adapt. Contr. Signal Process. doi: 10.1002/acs.2349 – volume: 50 start-page: 3268 year: 2014 ident: 10.1016/j.ins.2015.04.044_b0205 article-title: A novel approach to output feedback control of fuzzy stochastic systems publication-title: Automatica doi: 10.1016/j.automatica.2014.10.053 – volume: 11 start-page: 1020 year: 2014 ident: 10.1016/j.ins.2015.04.044_b0275 article-title: Adaptive dynamic programming for optimal tracking control of unknown nonlinear systems with application to coal gasification publication-title: IEEE Trans. Autom. Sci. Eng. doi: 10.1109/TASE.2013.2284545 – volume: 22 start-page: 25 year: 1977 ident: 10.1016/j.ins.2015.04.044_b0290 article-title: Advanced forecasting methods for global crisis warning and models of intelligence publication-title: Gen. Syst. Yearbook – volume: 72 start-page: 1839 year: 2009 ident: 10.1016/j.ins.2015.04.044_b0285 article-title: Model-free multiobjective approximate dynamic programming for discrete-time nonlinear systems with general performance index functions publication-title: Neurocomputing doi: 10.1016/j.neucom.2008.05.012 – volume: 25 start-page: 418 year: 2014 ident: 10.1016/j.ins.2015.04.044_b0115 article-title: Decentralized stabilization for a class of continuous-time nonlinear interconnected systems using online learning optimal control approach publication-title: IEEE Trans. Neural Netw. Learn. Syst. doi: 10.1109/TNNLS.2013.2280013 – volume: 58 start-page: 1511 year: 2013 ident: 10.1016/j.ins.2015.04.044_b0220 article-title: Robust synchronization of uncertain linear multi-agent systems publication-title: IEEE Trans. Autom. Contr. doi: 10.1109/TAC.2013.2239011 – volume: 41 start-page: 826 year: 2002 ident: 10.1016/j.ins.2015.04.044_b0190 article-title: Limited Hamilton–Jacobi–Isaacs equations for singularly perturbed zero-sum dynamic (discrete time) games publication-title: SIAM J. Contr. Optimiz. doi: 10.1137/S036301290037908X – volume: 61 start-page: 6399 year: 2014 ident: 10.1016/j.ins.2015.04.044_b0280 article-title: Data-driven neuro-optimal temperature control of water gas shift reaction using stable iterative adaptive dynamic programming publication-title: IEEE Trans. Ind. Electron. doi: 10.1109/TIE.2014.2301770 – year: 2013 ident: 10.1016/j.ins.2015.04.044_b0225 – volume: 24 start-page: 145 year: 2013 ident: 10.1016/j.ins.2015.04.044_b0070 article-title: Finite-horizon control-constrained nonlinear optimal control using single network adaptive critics publication-title: IEEE Trans. Neural Netw. Learn. Syst. doi: 10.1109/TNNLS.2012.2227339 – volume: 50 start-page: 1780 year: 2014 ident: 10.1016/j.ins.2015.04.044_b0155 article-title: Optimal tracking control of nonlinear partially-unknown constrained-input systems using integral reinforcement learning publication-title: Automatica doi: 10.1016/j.automatica.2014.05.011 – volume: 11 start-page: 839 year: 2014 ident: 10.1016/j.ins.2015.04.044_b0325 article-title: Neural-network-based constrained optimal control scheme for discrete-time switched nonlinear system using dual heuristic programming publication-title: IEEE Trans. Autom. Sci. Eng. doi: 10.1109/TASE.2014.2303139 – volume: 50 start-page: 335 year: 2014 ident: 10.1016/j.ins.2015.04.044_b0035 article-title: Internal structure of coalitions in competitive and altruistic graphical coalitional games publication-title: Automatica doi: 10.1016/j.automatica.2013.11.002 – year: 1982 ident: 10.1016/j.ins.2015.04.044_b0085 – volume: 8 start-page: 997 year: 1997 ident: 10.1016/j.ins.2015.04.044_b0185 article-title: Adaptive critic designs publication-title: IEEE Trans. Neural Netw. doi: 10.1109/72.623201 – volume: 48 start-page: 1598 year: 2012 ident: 10.1016/j.ins.2015.04.044_b0235 article-title: Multi-agent differential graphical games: online adaptive learning solution for synchronization with optimality publication-title: Automatica doi: 10.1016/j.automatica.2012.05.074 – volume: 11 start-page: 1176 year: 2014 ident: 10.1016/j.ins.2015.04.044_b0270 article-title: A novel iterative θ-adaptive dynamic programming for discrete-time nonlinear systems publication-title: IEEE Trans. Autom. Sci. Eng. doi: 10.1109/TASE.2013.2280974 – volume: 24 start-page: 2088 year: 2014 ident: 10.1016/j.ins.2015.04.044_b0060 article-title: An equivalence between adaptive dynamic programming with a critic and backpropagation through time publication-title: IEEE Trans. Neural Netw. Learn. Syst. doi: 10.1109/TNNLS.2013.2271778 – volume: 12 start-page: 264 year: 2001 ident: 10.1016/j.ins.2015.04.044_b0195 article-title: On-line learning control by association and reinforcement publication-title: IEEE Trans. Neural Netw. doi: 10.1109/72.914523 – volume: 43 start-page: 358 year: 2013 ident: 10.1016/j.ins.2015.04.044_b0215 article-title: Distributed synchronization in networks of agent systems with nonlinearities and random switchings publication-title: IEEE Trans. Cybernet. doi: 10.1109/TSMCB.2012.2207718 – volume: 11 start-page: 706 year: 2014 ident: 10.1016/j.ins.2015.04.044_b0105 article-title: Integral reinforcement learning for linear continuous-time zero-sum games with completely unknown dynamics publication-title: IEEE Trans. Autom. Sci. Eng. doi: 10.1109/TASE.2014.2300532 – year: 1982 ident: 10.1016/j.ins.2015.04.044_b0180 – volume: 38 start-page: 943 year: 2008 ident: 10.1016/j.ins.2015.04.044_b0030 article-title: Discrete-time nonlinear HJB solution using approximate dynamic programming: convergence proof publication-title: IEEE Trans. Syst. Man Cybernet. – Part B: Cybernet. doi: 10.1109/TSMCB.2008.926614 – volume: 9 start-page: 1739 year: 2013 ident: 10.1016/j.ins.2015.04.044_b0210 article-title: Sensor networks with random link failures: distributed filtering for T–S fuzzy systems publication-title: IEEE Trans. Ind. Inform. doi: 10.1109/TII.2012.2231085 – volume: 43 start-page: 473 year: 2007 ident: 10.1016/j.ins.2015.04.044_b0025 article-title: Model-free Q-learning designs for linear discrete-time zero-sum games with application to H-infinity control publication-title: Automatica doi: 10.1016/j.automatica.2006.09.019 – year: 1982 ident: 10.1016/j.ins.2015.04.044_b0045 – volume: 32 start-page: 236 year: 2012 ident: 10.1016/j.ins.2015.04.044_b0260 article-title: An iterative ∊-optimal control scheme for a class of discrete-time nonlinear systems with unfixed initial state publication-title: Neural Netw. doi: 10.1016/j.neunet.2012.02.027 – volume: 43 start-page: 206 year: 2013 ident: 10.1016/j.ins.2015.04.044_b0315 article-title: Near-optimal control for nonzero-sum differential games of continuous-time nonlinear systems using single-network ADP publication-title: IEEE Trans. Cybernet. doi: 10.1109/TSMCB.2012.2203336 – volume: 286 start-page: 209 year: 2014 ident: 10.1016/j.ins.2015.04.044_b0065 article-title: Reinforcement learning with automatic basis construction based on isometric feature mapping publication-title: Inform. Sci. doi: 10.1016/j.ins.2014.07.008 – volume: 50 start-page: 1167 year: 2014 ident: 10.1016/j.ins.2015.04.044_b0080 article-title: Reinforcement-learning for optimal tracking control of linear discrete-time systems with unknown dynamics publication-title: Automatica doi: 10.1016/j.automatica.2014.02.015 – volume: 24 start-page: 1150 year: 2013 ident: 10.1016/j.ins.2015.04.044_b0090 article-title: Robust adaptive dynamic programming with an application to power systems publication-title: IEEE Trans. Neural Netw. Learn. Syst. doi: 10.1109/TNNLS.2013.2249668 – volume: 25 start-page: 882 year: 2014 ident: 10.1016/j.ins.2015.04.044_b0095 article-title: Robust adaptive dynamic programming and feedback stabilization of nonlinear systems publication-title: IEEE Trans. Neural Netw. Learn. Syst. doi: 10.1109/TNNLS.2013.2294968 – year: 1995 ident: 10.1016/j.ins.2015.04.044_b0040 – volume: 16 start-page: 1219 year: 2005 ident: 10.1016/j.ins.2015.04.044_b0145 article-title: A self-learning call admission control scheme for CDMA cellular networks publication-title: IEEE Trans. Neural Netw. doi: 10.1109/TNN.2005.853408 – volume: 28 start-page: 2670 year: 2013 ident: 10.1016/j.ins.2015.04.044_b0110 article-title: Two-level dynamic stochastic optimal power flow control for power systems with intermittent renewable generation publication-title: IEEE Trans. Power Syst. doi: 10.1109/TPWRS.2013.2237793 – volume: 28 start-page: 167 year: 2014 ident: 10.1016/j.ins.2015.04.044_b0255 article-title: Neural-network-based robust optimal control design for a class of uncertain nonlinear systems via adaptive dynamic programming publication-title: Inform. Sci. doi: 10.1016/j.ins.2014.05.050 – volume: 25 start-page: 635 year: 2014 ident: 10.1016/j.ins.2015.04.044_b0305 article-title: Reinforcement learning output feedback NN control using deterministic learning technique publication-title: IEEE Trans. Neural Netw. Learn. Syst. doi: 10.1109/TNNLS.2013.2292704 – volume: 4 start-page: 498 year: 2013 ident: 10.1016/j.ins.2015.04.044_b0165 article-title: Intelligent local area signals based damping of power system oscillations using virtual generators and approximate dynamic programming publication-title: IEEE Trans. Smart Grid doi: 10.1109/TSG.2012.2233224 – ident: 10.1016/j.ins.2015.04.044_b0245 doi: 10.1109/JSYST.2014.2330392 – volume: 37 start-page: 240 year: 2007 ident: 10.1016/j.ins.2015.04.044_b0020 article-title: Adaptive critic designs for discrete-time zero-sum games with application to H∞ control publication-title: IEEE Trans. Syst. Man Cybernet. – Part B: Cybernet. doi: 10.1109/TSMCB.2006.880135 – volume: 7 start-page: 1472 year: 2013 ident: 10.1016/j.ins.2015.04.044_b0265 article-title: Numerical adaptive learning control scheme for discrete-time nonlinear systems publication-title: IET Contr. Theory Appl. doi: 10.1049/iet-cta.2012.0486
SSID	ssj0004766
Score	2.4841194
Snippet	In this paper, a new optimal distributed synchronization control scheme for the consensus problem of heterogeneous multi-agent differential graphical games is...
SourceID	proquest crossref elsevier
SourceType	Aggregation Database Enrichment Source Index Database Publisher
StartPage	96
SubjectTerms	Adaptive critic designs Adaptive dynamic programming Algorithms Approximate dynamic programming Dynamics Games Graphical games Heterogeneous multi-agents Multiagent systems Optimization Policies Policy iteration Synchronism Synchronization
Title	Optimal distributed synchronization control for continuous-time heterogeneous multi-agent differential graphical games
URI	https://dx.doi.org/10.1016/j.ins.2015.04.044 https://www.proquest.com/docview/1770353198
Volume	317
WOSCitedRecordID	wos000358093400006&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
journalDatabaseRights	– providerCode: PRVESC databaseName: Elsevier SD Freedom Collection Journals 2021 customDbUrl: eissn: 1872-6291 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0004766 issn: 0020-0255 databaseCode: AIEXJ dateStart: 19950101 isFulltext: true titleUrlDefault: https://www.sciencedirect.com providerName: Elsevier
link	http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV1Lb9QwELaWLQc4VFBALaXISIgDVZDzcGwfK1QEqCogFdhbZDtOlVXJVt0H5cJvZ_xINg1qRQ9Iqyiy1pbj-TIzsb-ZQehlrstKpamINBcmylLFI26YjijXVaJykVNXi-DbETs-5pOJ-Dwa_W5jYVZnrGn45aU4_6-ihjYQtg2dvYW4u0GhAe5B6HAFscP1nwT_CZTAD3_04qtZgUs5_9VolwXXB112BHXLMbT3dbOcLeeRLTQPriMs9QyGN5Yd6wiHkbQBWF0xlYXdZXeZrp2ET2UbRTJtafFdSOR-sLCd5_7dOPrAl9oWD6k7OlC99MoP5ni65gj99AkQXGH5_aM3_R2KmHZct3XEADQkPh1vq3VTH7IZ9KbIexY49tGpfyl3v88whS8Sm2c9pi5Hrc8eeTWR9sDAdbTDltE2LWCIwg5RkAx-2R20kTAq-BhtHHw4nHxcR9Yyf9rdPkF7Lu4YgoN5XOfZDGy8c1xOHqDN8MWBDzxSHqKRabbQ_V4eyi20F6JX8Cvckx0Oev8RWgVM4R6m8ABTOGAKQ388wBS-gincwxTuYwp3mMIOU4_R13eHJ2_fR6FgR6TTlCyiTKisJFlcESJ5xTUvRZIwSXMVU0MFJYrFIpMVKY2RnAgZy5ypKuVllVQcLMUTNG5mjdlGmFMlNCulobnOqoQIoxJmOLiXhiWKpjuItItd6JDN3hZVOSuuFfIOet11OfepXG76c9ZKsAhvivcxC0DjTd1etNIuQE_bwzfplraIGdhWa_D409vMYxfdW79Sz9B4cbE0e-iuXi3q-cXzANc_f769cQ
linkProvider	Elsevier
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Optimal+distributed+synchronization+control+for+continuous-time+heterogeneous+multi-agent+differential+graphical+games&rft.jtitle=Information+sciences&rft.au=Wei%2C+Qinglai&rft.au=Liu%2C+Derong&rft.au=Lewis%2C+Frank+L.&rft.date=2015-10-01&rft.issn=0020-0255&rft.volume=317&rft.spage=96&rft.epage=113&rft_id=info:doi/10.1016%2Fj.ins.2015.04.044&rft.externalDBID=n%2Fa&rft.externalDocID=10_1016_j_ins_2015_04_044
thumbnail_l	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0020-0255&client=summon
thumbnail_m	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0020-0255&client=summon
thumbnail_s	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0020-0255&client=summon