CSR: Cascade Conditional Variational Auto Encoder with Socially-aware Regression for Pedestrian Trajectory Prediction

•The proposed trajectory prediction method consists of a cascaded CVAE module and a socially aware regression module.•The cascaded CVAE module decouples and balances the loss function with respect to time steps and minimizes the losses at every time steps independently.•The socially aware regression...

Celý popis

Uloženo v:
Podrobná bibliografie
Vydáno v:Pattern recognition Ročník 133; s. 109030
Hlavní autoři: Zhou, Hao, Ren, Dongchun, Yang, Xu, Fan, Mingyu, Huang, Hai
Médium: Journal Article
Jazyk:angličtina
Vydáno: Elsevier Ltd 01.01.2023
Témata:
ISSN:0031-3203, 1873-5142
On-line přístup:Získat plný text
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Popis
Shrnutí:•The proposed trajectory prediction method consists of a cascaded CVAE module and a socially aware regression module.•The cascaded CVAE module decouples and balances the loss function with respect to time steps and minimizes the losses at every time steps independently.•The socially aware regression module corrects the predictions by checking the compatibility between the interaction coding and the crude predicted trajectories. Pedestrian trajectory prediction is a key technology in many real applications such as video surveillance, social robot navigation, and autonomous driving, and significant progress has been made in this research topic. However, there remain two limitations of previous studies. First, the losses of the last time steps are heavier weighted than that of the beginning time steps in the objective function at the learning stage, causing the prediction errors generated at the beginning to accumulate to large errors at the last time steps at the inference stage. Second, the prediction results of multiple pedestrians in the prediction horizon might be socially incompatible with the interactions modeled by past trajectories. To overcome these limitations, this work proposes a novel trajectory prediction method called CSR, which consists of a cascaded conditional variational autoencoder (CVAE) module and a socially-aware regression module. The CVAE module estimates the future trajectories in a cascaded sequential manner. Specifically, each CVAE concatenates the past trajectories and the predicted location points so far as the input and predicts the adjacent location at the following time step. The socially-aware regression module generates offsets from the estimated future trajectories to produce the corrected predictions, which are more reasonable and accurate than the estimated trajectories. Experiments results demonstrate that the proposed method exhibits significant improvements over state-of-the-art methods on the Stanford Drone Dataset (SDD) and the ETH/UCY dataset of approximately 38.0% and 22.2%, respectively. The code is available at https://github.com/zhouhao94/CSR.
ISSN:0031-3203
1873-5142
DOI:10.1016/j.patcog.2022.109030