Gradient-Based Algorithms With Intermediate Observations in Static and Differential Games

In two-player static and differential games, strategic players often use available or delayed information about the other player's decisions and solve an optimization or optimal control problem to determine their strategic choices. Without this information, the player's ability to determin...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:IEEE access Jg. 13; S. 2694 - 2704
Hauptverfasser: Hossain, Mohammad Safayet, Simaan, Marwan A., Qu, Zhihua
Format: Journal Article
Sprache:Englisch
Veröffentlicht: Piscataway IEEE 2025
The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Schlagworte:
ISSN:2169-3536, 2169-3536
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:In two-player static and differential games, strategic players often use available or delayed information about the other player's decisions and solve an optimization or optimal control problem to determine their strategic choices. Without this information, the player's ability to determine its optimal decisions becomes problematic. In this paper, we propose an approach in which each player implements an iterative discrete-time gradient-based algorithm that relies only on intermediate either current or prior observations about the other player's actions. We explore the implementation of such gradient play algorithms in the case of non-zero-sum static games and in the more complex case of differential games. We discuss the properties of these algorithms with heterogeneous stepsizes and derive explicit necessary and sufficient conditions on the game parameters in the objective functions and stepsizes that guarantee convergence to the Nash equilibrium in static games with quadratic objective functions. Examples in both static and differential games are presented to illustrate the results.
Bibliographie:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:2169-3536
2169-3536
DOI:10.1109/ACCESS.2024.3523258