Transfer Learning for Dynamical Systems Models via Autoencoders and GANs

Transfer learning has seen significant progress in the domains of supervised learning and reinforcement learning, yet there remains a noticeable gap in the area of regression. In supervised learning, various transfer learning methods have been developed to leverage knowledge from one task to improve...

Celý popis

Uložené v:
Podrobná bibliografia
Vydané v:Proceedings of the American Control Conference s. 8 - 14
Hlavní autori: Damiani, Angelo, Lopez, Gustavo Viera, Manganini, Giorgio, Metelli, Alberto Maria, Restelli, Marcello
Médium: Konferenčný príspevok..
Jazyk:English
Vydavateľské údaje: AACC 10.07.2024
Predmet:
ISSN:2378-5861
On-line prístup:Získať plný text
Tagy: Pridať tag
Žiadne tagy, Buďte prvý, kto otaguje tento záznam!
Popis
Shrnutí:Transfer learning has seen significant progress in the domains of supervised learning and reinforcement learning, yet there remains a noticeable gap in the area of regression. In supervised learning, various transfer learning methods have been developed to leverage knowledge from one task to improve performance on another, but these approaches are primarily designed for classification tasks. In the realm of reinforcement learning, many techniques focus on policy transfer, and those that do address samples transfer predominantly apply it within homogeneous contexts. This paper presents a novel algorithm that bridges the transfer learning gap between supervised learning and reinforcement learning while specifically addressing the regression problem of dynamical systems' model estimation. Our approach harnesses the feature extraction capabilities of autoencoders and the generative power of Generative Adversarial Networks (GANs) to train a mapping that facilitates the seamless transformation of samples between dynamical systems. This approach represents a significant advancement in the field of transfer learning, as it offers a versatile and effective solution for transferring regression models across heterogeneous domains. Our experimental results demonstrate the algorithm's efficacy and its potential to improve model generalization and adaptability in diverse scenarios with varying data distributions and dynamics.
ISSN:2378-5861
DOI:10.23919/ACC60939.2024.10644658