Adaptive generalized ZEM-ZEV feedback guidance for planetary landing via a deep reinforcement learning approach

Precision landing on large and small planetary bodies is a technology of utmost importance for future human and robotic exploration of the solar system. In this context, the Zero-Effort-Miss/Zero-Effort-Velocity (ZEM/ZEV) feedback guidance algorithm has been studied extensively and is still a field...

Celý popis

Uloženo v:
Podrobná bibliografie
Vydáno v:Acta astronautica Ročník 171; s. 156 - 171
Hlavní autoři: Furfaro, Roberto, Scorsoglio, Andrea, Linares, Richard, Massari, Mauro
Médium: Journal Article
Jazyk:angličtina
Vydáno: Elmsford Elsevier Ltd 01.06.2020
Elsevier BV
Témata:
ISSN:0094-5765, 1879-2030
On-line přístup:Získat plný text
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Popis
Shrnutí:Precision landing on large and small planetary bodies is a technology of utmost importance for future human and robotic exploration of the solar system. In this context, the Zero-Effort-Miss/Zero-Effort-Velocity (ZEM/ZEV) feedback guidance algorithm has been studied extensively and is still a field of active research. The algorithm, although powerful in terms of accuracy and ease of implementation, has some limitations. Therefore with this paper we present an adaptive guidance algorithm based on classical ZEM/ZEV in which machine learning is used to overcome its limitations and create a closed loop guidance algorithm that is sufficiently lightweight to be implemented on board spacecraft and flexible enough to be able to adapt to the given constraint scenario. The adopted methodology is an actor-critic reinforcement learning algorithm that learns the parameters of the above-mentioned guidance architecture according to the given problem constraints. •A Deep Reinforcement Learning (DRL) framework for structured planetary landing guidance is presented.•The DRL framework is based on a customized actor-critic approach that employs Extreme Learning Machines (ELM).•The algorithm learns the ZEM/ZEV generalized feedback guidance gains as function of the spacecraft state.•The gains are adapted to ensure both quasi-optimality and flight-constraints.•Stability analysis shows the proposed approach is globally stable.
Bibliografie:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:0094-5765
1879-2030
DOI:10.1016/j.actaastro.2020.02.051