CSR: Cascade Conditional Variational Auto Encoder with Socially-aware Regression for Pedestrian Trajectory Prediction

•The proposed trajectory prediction method consists of a cascaded CVAE module and a socially aware regression module.•The cascaded CVAE module decouples and balances the loss function with respect to time steps and minimizes the losses at every time steps independently.•The socially aware regression...

Celý popis

Uložené v:
Podrobná bibliografia
Vydané v:Pattern recognition Ročník 133; s. 109030
Hlavní autori: Zhou, Hao, Ren, Dongchun, Yang, Xu, Fan, Mingyu, Huang, Hai
Médium: Journal Article
Jazyk:English
Vydavateľské údaje: Elsevier Ltd 01.01.2023
Predmet:
ISSN:0031-3203, 1873-5142
On-line prístup:Získať plný text
Tagy: Pridať tag
Žiadne tagy, Buďte prvý, kto otaguje tento záznam!
Popis
Shrnutí:•The proposed trajectory prediction method consists of a cascaded CVAE module and a socially aware regression module.•The cascaded CVAE module decouples and balances the loss function with respect to time steps and minimizes the losses at every time steps independently.•The socially aware regression module corrects the predictions by checking the compatibility between the interaction coding and the crude predicted trajectories. Pedestrian trajectory prediction is a key technology in many real applications such as video surveillance, social robot navigation, and autonomous driving, and significant progress has been made in this research topic. However, there remain two limitations of previous studies. First, the losses of the last time steps are heavier weighted than that of the beginning time steps in the objective function at the learning stage, causing the prediction errors generated at the beginning to accumulate to large errors at the last time steps at the inference stage. Second, the prediction results of multiple pedestrians in the prediction horizon might be socially incompatible with the interactions modeled by past trajectories. To overcome these limitations, this work proposes a novel trajectory prediction method called CSR, which consists of a cascaded conditional variational autoencoder (CVAE) module and a socially-aware regression module. The CVAE module estimates the future trajectories in a cascaded sequential manner. Specifically, each CVAE concatenates the past trajectories and the predicted location points so far as the input and predicts the adjacent location at the following time step. The socially-aware regression module generates offsets from the estimated future trajectories to produce the corrected predictions, which are more reasonable and accurate than the estimated trajectories. Experiments results demonstrate that the proposed method exhibits significant improvements over state-of-the-art methods on the Stanford Drone Dataset (SDD) and the ETH/UCY dataset of approximately 38.0% and 22.2%, respectively. The code is available at https://github.com/zhouhao94/CSR.
ISSN:0031-3203
1873-5142
DOI:10.1016/j.patcog.2022.109030