Interpretable Multi-Pedestrian Trajectory Prediction Using Social GAN and Social GCNN
Shin, Kazuki
This item is only available for download by members of the University of Illinois community. Students, faculty, and staff at the U of I may log in with your NetID and password to view the item. If you are trying to access an Illinois-restricted dissertation or thesis, you can request a copy through your library's Inter-Library Loan office or purchase a copy directly from ProQuest.
Permalink
https://hdl.handle.net/2142/110295
Description
Title
Interpretable Multi-Pedestrian Trajectory Prediction Using Social GAN and Social GCNN
Author(s)
Shin, Kazuki
Contributor(s)
Driggs-Campbell
Issue Date
2021-05
Keyword(s)
trajectory prediction
deep neural networks
Abstract
Multi-pedestrian trajectory prediction is a challenging problem that has a wide variety of real-world
applications ranging from crowd navigation to self-driving cars. Though much research is done in
this field, most methods are focused on showing qualitative results which capture the interaction. As
a consequence, the interpretability of these prediction algorithms is not well studied. Interpretability
helps us to inspect the dynamics between input features and output predictions. The block box
property of neural networks makes models, such as generative adversarial networks (GAN) and
graph convolutional neural networks (GCNN), hard to visualize and understand what is going on
internally. This thesis presents explanations in addition to the predictions as a way to improve
transparency in these trajectory predictions from prior works. The presentation is twofold: (1)
Based on the original Social GAN training model, we explore the interpretability of the latent code,
and (2) We improve the prediction performance of Social-STGCNN, based on the integration of
physical intuition and attention-based mechanisms. Our parametric study of these physical intuitions
shows that including both the velocity and position of neighboring pedestrians in the attention
mechanism improves model performance. Furthermore, we demonstrate that single attributes
of multi-pedestrian trajectories can be explored without affecting other attributes through latent
space disentanglement. Through this technique, various pedestrian behaviors can be identified and controlled by finding meaningful representations from the manifolds. We evaluate our approach on
the ETH/UCY pedestrian datasets using average displacement error (ADE) and final displacement
error (FDE) metrics. The results show social interactions that are intuitive in understanding why
exactly these algorithms make the decisions they do.
Use this login method if you
don't
have an
@illinois.edu
email address.
(Oops, I do have one)
IDEALS migrated to a new platform on June 23, 2022. If you created
your account prior to this date, you will have to reset your password
using the forgot-password link below.