Adaptive generalized ZEM-ZEV feedback guidance for planetary landing via a deep reinforcement learning approach
Precision landing on large and small planetary bodies is a technology of utmost importance for future human and robotic exploration of the solar system. In this context, the Zero-Effort-Miss/Zero-Effort-Velocity (ZEM/ZEV) feedback guidance algorithm has been studied extensively and is still a field...
Gespeichert in:
| Veröffentlicht in: | Acta astronautica Jg. 171; S. 156 - 171 |
|---|---|
| Hauptverfasser: | , , , |
| Format: | Journal Article |
| Sprache: | Englisch |
| Veröffentlicht: |
Elmsford
Elsevier Ltd
01.06.2020
Elsevier BV |
| Schlagworte: | |
| ISSN: | 0094-5765, 1879-2030 |
| Online-Zugang: | Volltext |
| Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
| Zusammenfassung: | Precision landing on large and small planetary bodies is a technology of utmost importance for future human and robotic exploration of the solar system. In this context, the Zero-Effort-Miss/Zero-Effort-Velocity (ZEM/ZEV) feedback guidance algorithm has been studied extensively and is still a field of active research. The algorithm, although powerful in terms of accuracy and ease of implementation, has some limitations. Therefore with this paper we present an adaptive guidance algorithm based on classical ZEM/ZEV in which machine learning is used to overcome its limitations and create a closed loop guidance algorithm that is sufficiently lightweight to be implemented on board spacecraft and flexible enough to be able to adapt to the given constraint scenario. The adopted methodology is an actor-critic reinforcement learning algorithm that learns the parameters of the above-mentioned guidance architecture according to the given problem constraints.
•A Deep Reinforcement Learning (DRL) framework for structured planetary landing guidance is presented.•The DRL framework is based on a customized actor-critic approach that employs Extreme Learning Machines (ELM).•The algorithm learns the ZEM/ZEV generalized feedback guidance gains as function of the spacecraft state.•The gains are adapted to ensure both quasi-optimality and flight-constraints.•Stability analysis shows the proposed approach is globally stable. |
|---|---|
| Bibliographie: | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 |
| ISSN: | 0094-5765 1879-2030 |
| DOI: | 10.1016/j.actaastro.2020.02.051 |