Multiagent-based deep reinforcement learning framework for multi-asset adaptive trading and portfolio management

The highly dynamic nature of stock markets has motivated researchers to propose various supervised learning models to assist investors to optimize financial performance. Machine learning models have been used to predict price trends, and approaches have been proposed for portfolio management. Howeve...

Celý popis

Uložené v:

Podrobná bibliografia
Vydané v:	Neurocomputing (Amsterdam) Ročník 594; s. 127800
Hlavní autori:	Cheng, Li-Chen, Sun, Jian-Shiou
Médium:	Journal Article
Jazyk:	English
Vydavateľské údaje:	Elsevier B.V 14.08.2024
Predmet:	Multi agent Portfolio optimization Quantitative trading Reinforcement learning Twin delayed deep deterministic policy Gradient algorithm Portfolio optimization Quantitative trading Twin delayed deep deterministic policy Gradient algorithm Multi agent Reinforcement learning
ISSN:	0925-2312, 1872-8286
On-line prístup:	Získať plný text
Tagy:	Pridať tag Žiadne tagy, Buďte prvý, kto otaguje tento záznam!

Popis
Shrnutí:	The highly dynamic nature of stock markets has motivated researchers to propose various supervised learning models to assist investors to optimize financial performance. Machine learning models have been used to predict price trends, and approaches have been proposed for portfolio management. However, these studies focus on only one kind of financial issue, and the methods proposed exhibit poor generalizability. We address these problems with a multi-agent portfolio adaptive trading framework based on reinforcement learning to create an automated trading system with the best trading strategy that can be achieved by long-short situation judgment and adaptive capital allocation. We use the TD3 algorithm in the multi-agent algorithm to mitigate the overestimation and overfitting exhibited by traditional value functions and improve training stability. Experimental results show that the proposed framework outperforms single-agent reinforcement learning algorithms while achieving more stable returns. •This study seeks to optimize trading strategy and portfolio management issues using RL.•The multi-agent architecture allows agents to explore chaotic environments aside from their own capital assets.•The adaptive TPM autonomously rates assets, optimizing Sortino ratio rewards through dynamic trading weights.•Experimental results show that the proposed framework outperforms single-agent reinforcement learning algorithms.
ISSN:	0925-2312 1872-8286
DOI:	10.1016/j.neucom.2024.127800