Not logged in.

Quick Search - Contribution

Contribution Details

Type	Journal Article
Scope	Discipline-based scholarship
Title	Towards domain independence for learning-based monocular depth estimation
Organization Unit	Robotics and Perception Group (Davide Scaramuzza)
Authors	Michele Mancini Gabriele Costante Paolo Valigi Thomas Alessandro Ciarfuglia Jeffrey Delmerico Davide Scaramuzza
Item Subtype	Original Work
Refereed	Yes
Status	Published in final form
Language	English
Journal Title	IEEE Robotics and Automation Letters
Publisher	Institute of Electrical and Electronics Engineers
Geographical Reach	international
ISSN	2377-3766
Volume	2
Number	3
Page Range	1778 - 1785
Date	2017
Abstract Text	Modern autonomous mobile robots require a strong understanding of their surroundings in order to safely operate in cluttered and dynamic environments. Monocular depth estimation offers a geometry-independent paradigm to detect free, navigable space with minimum space, and power consumption. These represent highly desirable features, especially for microaerial vehicles. In order to guarantee robust operation in real-world scenarios, the estimator is required to generalize well in diverse environments. Most of the existent depth estimators do not consider generalization, and only benchmark their performance on publicly available datasets after specific fine tuning. Generalization can be achieved by training on several heterogeneous datasets, but their collection and labeling is costly. In this letter, we propose a deep neural network for scene depth estimation that is trained on synthetic datasets, which allow inexpensive generation of ground truth data. We show how this approach is able to generalize well across different scenarios. In addition, we show how the addition of long short-term memory layers in the network helps to alleviate, in sequential image streams, some of the intrinsic limitations of monocular vision, such as global scale estimation, with low computational overhead. We demonstrate that the network is able to generalize well with respect to different real-world environments without any fine tuning, achieving comparable performance to state-of-the-art methods on the KITTI dataset.
Free access at	DOI
Digital Object Identifier	10.1109/lra.2017.2657002
Other Identification Number	merlin-id:15100
PDF File	Download from ZORA
Export	BibTeX EP3 XML (ZORA)