Yang, Delong and Zhong, Xunyu and Gu, Dongbing and Peng, Xiafu and Yang, Gongliu and Zou, Chaosheng (2020) Unsupervised learning of depth estimation, camera motion prediction and dynamic object localization from video. International Journal of Advanced Robotic Systems, 17 (2). p. 172988142090965. DOI https://doi.org/10.1177/1729881420909653
Yang, Delong and Zhong, Xunyu and Gu, Dongbing and Peng, Xiafu and Yang, Gongliu and Zou, Chaosheng (2020) Unsupervised learning of depth estimation, camera motion prediction and dynamic object localization from video. International Journal of Advanced Robotic Systems, 17 (2). p. 172988142090965. DOI https://doi.org/10.1177/1729881420909653
Yang, Delong and Zhong, Xunyu and Gu, Dongbing and Peng, Xiafu and Yang, Gongliu and Zou, Chaosheng (2020) Unsupervised learning of depth estimation, camera motion prediction and dynamic object localization from video. International Journal of Advanced Robotic Systems, 17 (2). p. 172988142090965. DOI https://doi.org/10.1177/1729881420909653
Abstract
Estimating scene depth, predicting camera motion and localizing dynamic objects from monocular videos are fundamental but challenging research topics in computer vision. Deep learning has demonstrated an amazing performance for these tasks recently. This article presents a novel unsupervised deep learning framework for scene depth estimation, camera motion prediction and dynamic object localization from videos. Consecutive stereo image pairs are used to train the system while only monocular images are needed for inference. The supervisory signals for the training stage come from various forms of image synthesis. Due to the use of consecutive stereo video, both spatial and temporal photometric errors are used to synthesize the images. Furthermore, to relieve the impacts of occlusions, adaptive left-right consistency and forward-backward consistency losses are added to the objective function. Experimental results on the KITTI and Cityscapes datasets demonstrate that our method is more effective in depth estimation, camera motion prediction and dynamic object localization compared to previous models.
Item Type: | Article |
---|---|
Uncontrolled Keywords: | Deep learning, CNN, depth estimation, camera motion prediction, dynamic object localization |
Divisions: | Faculty of Science and Health Faculty of Science and Health > Computer Science and Electronic Engineering, School of |
SWORD Depositor: | Unnamed user with email elements@essex.ac.uk |
Depositing User: | Unnamed user with email elements@essex.ac.uk |
Date Deposited: | 30 Mar 2020 09:04 |
Last Modified: | 23 Sep 2022 19:38 |
URI: | http://repository.essex.ac.uk/id/eprint/27182 |
Available files
Filename: 1729881420909653.pdf
Licence: Creative Commons: Attribution 3.0