Coordination in Large Multiagent Reinforcement Learning Problems
Large distributed systems often require intelligent behavior. Although multiagent reinforcement learning can be applied to such systems, several yet unsolved challenges arise due to the large number of simultaneous learners. Among others, these include exponential growth of state-action spaces and c...
Uloženo v:
| Vydáno v: | 2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology Ročník 2; s. 236 - 239 |
|---|---|
| Hlavní autoři: | , |
| Médium: | Konferenční příspěvek |
| Jazyk: | angličtina |
| Vydáno: |
IEEE
01.08.2011
|
| Témata: | |
| ISBN: | 9781457713736, 145771373X |
| On-line přístup: | Získat plný text |
| Tagy: |
Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
|
| Abstract | Large distributed systems often require intelligent behavior. Although multiagent reinforcement learning can be applied to such systems, several yet unsolved challenges arise due to the large number of simultaneous learners. Among others, these include exponential growth of state-action spaces and coordination. In this work, we deal with these two issues. Therefore, we consider a subclass of stochastic games called cooperative sequential stage games. With the help of a stateless distributed learning algorithm we solve the problem of growing state-action spaces. Then, we present six different techniques to coordinate action selection during the learning process. We prove a property of the learning algorithm that helps to reduce computational costs of one technique. An experimental analysis in a distributed agent partitioning problem with hundreds of agents reveals that the proposed techniques can lead to higher quality solutions and increase convergence speed compared to the basic approach. Some techniques even outperform a state-of-the-art special purpose approach. |
|---|---|
| AbstractList | Large distributed systems often require intelligent behavior. Although multiagent reinforcement learning can be applied to such systems, several yet unsolved challenges arise due to the large number of simultaneous learners. Among others, these include exponential growth of state-action spaces and coordination. In this work, we deal with these two issues. Therefore, we consider a subclass of stochastic games called cooperative sequential stage games. With the help of a stateless distributed learning algorithm we solve the problem of growing state-action spaces. Then, we present six different techniques to coordinate action selection during the learning process. We prove a property of the learning algorithm that helps to reduce computational costs of one technique. An experimental analysis in a distributed agent partitioning problem with hundreds of agents reveals that the proposed techniques can lead to higher quality solutions and increase convergence speed compared to the basic approach. Some techniques even outperform a state-of-the-art special purpose approach. |
| Author | Buning, H. K. Kemmerich, T. |
| Author_xml | – sequence: 1 givenname: T. surname: Kemmerich fullname: Kemmerich, T. email: kemmerich@upb.de organization: Int. Grad. Sch. Dynamic Intell. Syst., Univ. of Paderborn, Paderborn, Germany – sequence: 2 givenname: H. K. surname: Buning fullname: Buning, H. K. email: kbcsl@upb.de organization: Dept. of Comput. Sci., Univ. of Paderborn, Paderborn, Germany |
| BookMark | eNotjM1KxDAURiMqqGO3btz0BVpzkzTp3TkMjhYqioy4HNL2tkTaRNK68O0df87m48DHuWAnPnhi7Ap4DsDx5q3KqvUuFxwgV-qIJWhKbjQWqgCpjn8dVGEMSCP1GUvm-Z0f0BoR4ZzdbkKInfN2ccGnzqe1jQOlj5_j4uxAfklfyPk-xJamH6vJRu_8kD7H0Iw0zZfstLfjTMn_rtjr9m63ecjqp_tqs64zC5ovmYSO9500KJumKUm1aIQVfWGlpEYo7K2wCB0VCg1XBpG4AJIoqe11d7it2PVf1xHR_iO6ycavveaKm1LKb_47TK0 |
| ContentType | Conference Proceeding |
| DBID | 6IE 6IL CBEJK RIE RIL |
| DOI | 10.1109/WI-IAT.2011.44 |
| DatabaseName | IEEE Electronic Library (IEL) Conference Proceedings IEEE Xplore POP ALL IEEE Xplore All Conference Proceedings IEEE Electronic Library (IEL) IEEE Proceedings Order Plans (POP All) 1998-Present |
| DatabaseTitleList | |
| Database_xml | – sequence: 1 dbid: RIE name: IEEE/IET Electronic Library (IEL) (UW System Shared) url: https://ieeexplore.ieee.org/ sourceTypes: Publisher |
| DeliveryMethod | fulltext_linktorsrc |
| Discipline | Computer Science |
| EISBN | 9780769545134 0769545130 |
| EndPage | 239 |
| ExternalDocumentID | 6040783 |
| Genre | orig-research |
| GroupedDBID | 6IE 6IL ACM ALMA_UNASSIGNED_HOLDINGS APO CBEJK GUFHI LHSKQ RIB RIC RIE RIL |
| ID | FETCH-LOGICAL-a160t-31d0fd3793bbb8e4c972a2f5a33eb249fa2a91de549704799e021e393ecf6d5a3 |
| IEDL.DBID | RIE |
| ISBN | 9781457713736 145771373X |
| IngestDate | Wed Aug 27 02:48:18 EDT 2025 |
| IsPeerReviewed | false |
| IsScholarly | false |
| Language | English |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-LOGICAL-a160t-31d0fd3793bbb8e4c972a2f5a33eb249fa2a91de549704799e021e393ecf6d5a3 |
| PageCount | 4 |
| ParticipantIDs | ieee_primary_6040783 |
| PublicationCentury | 2000 |
| PublicationDate | 2011-Aug. |
| PublicationDateYYYYMMDD | 2011-08-01 |
| PublicationDate_xml | – month: 08 year: 2011 text: 2011-Aug. |
| PublicationDecade | 2010 |
| PublicationTitle | 2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology |
| PublicationTitleAbbrev | wi-iat |
| PublicationYear | 2011 |
| Publisher | IEEE |
| Publisher_xml | – name: IEEE |
| SSID | ssj0000669991 ssj0001120470 |
| Score | 1.4712393 |
| Snippet | Large distributed systems often require intelligent behavior. Although multiagent reinforcement learning can be applied to such systems, several yet unsolved... |
| SourceID | ieee |
| SourceType | Publisher |
| StartPage | 236 |
| SubjectTerms | Convergence Cooperative Stochastic Games Coordination DSL Educational institutions Electronic mail Games Joints Learning Multiagent Reinforcement Learning |
| Title | Coordination in Large Multiagent Reinforcement Learning Problems |
| URI | https://ieeexplore.ieee.org/document/6040783 |
| Volume | 2 |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| link | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV3PS8MwFH5sw4OnqZvo_EEOHq1L07RpbspwOJAxZOJuI2leZJdO9sO_3yTtNgQv3tpSQkl5ee99yfd9AHeupHZFrZWRpKmKOLMs0jGKSBuXTlKh89zaYDYhxuN8NpOTBtzvuTCIGA6f4YO_DHv5ZllsPVTWz2jYdWpCU4is4mrt8RSXOn2tc8BXYka5oIHLlQrXiolktpN4qu-zWsQxprL_MYpGT9NK0pPzX1YrIdMM2__7xhPoHih7ZLJPRqfQwPIM2jvPBlKHcAceB0vXbi4qDJAsSvLqj4KTwMNVnmZF3jCIqRYBNyS1_uqnH9w7z6y78D58ng5eotpFIVJxRjdukTXUmsTFodY6R15IwRSzqUoS11VzaRVTMjboGkXh9eYlurSPiUywsJlxr51Dq1yWeAHE2xTTTFtlbMzRUO3i3RhXgLCCFZKnl9DxEzL_qoQy5vVc9P5-fAXHO4CWxtfQ2qy2eANHxfdmsV7dhr_7A2Z5oB4 |
| linkProvider | IEEE |
| linkToHtml | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV3PS8MwFH7MKehp6ib-NgePxqVp2jQ3RRwbzjFk4m4jaRLZpZP98O83SbsNwYu3tpRQUl7ee1_yfR_ArSupXVFrBRYkkZhRS7GKDMdKu3SScJVl1gazCT4YZOOxGNbgbsOFMcaEw2fm3l-GvXw9y1ceKmunJOw67cBuwhglJVtrg6i45OmrnS3CElHCOAlsroS7ZozH47XIU3WfVjKOERHtjx7uPY5KUU_GfpmthFzTafzvKw-htSXtoeEmHR1BzRTH0Fi7NqAqiJvw8DRzDee0RAHRtEB9fxgcBSau9EQr9GaCnGoekENUKbB--sG998yiBe-d59FTF1c-ClhGKVm6ZVYTq2MXiUqpzLBccCqpTWQcu76aCSupFJE2rlXkXnFeGJf4TSxik9tUu9dOoF7MCnMKyBsVk1RZqW3EjCbKRbzWrgShOc0FS86g6Sdk8lVKZUyquTj_-_EN7HdHr_1Jvzd4uYCDNVxLokuoL-crcwV7-fdyuphfhz_9A--Vo2U |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2011+IEEE%2FWIC%2FACM+International+Conferences+on+Web+Intelligence+and+Intelligent+Agent+Technology&rft.atitle=Coordination+in+Large+Multiagent+Reinforcement+Learning+Problems&rft.au=Kemmerich%2C+T.&rft.au=Buning%2C+H.+K.&rft.date=2011-08-01&rft.pub=IEEE&rft.isbn=9781457713736&rft.volume=2&rft.spage=236&rft.epage=239&rft_id=info:doi/10.1109%2FWI-IAT.2011.44&rft.externalDocID=6040783 |
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9781457713736/lc.gif&client=summon&freeimage=true |
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9781457713736/mc.gif&client=summon&freeimage=true |
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9781457713736/sc.gif&client=summon&freeimage=true |

