Not logged in.

Contribution Details

Type Journal Article
Scope Discipline-based scholarship
Title Towards domain independence for learning-based monocular depth estimation
Organization Unit
Authors
  • Michele Mancini
  • Gabriele Costante
  • Paolo Valigi
  • Thomas Alessandro Ciarfuglia
  • Jeffrey Delmerico
  • Davide Scaramuzza
Item Subtype Original Work
Refereed Yes
Status Published in final form
Language
  • English
Journal Title IEEE Robotics and Automation Letters
Publisher Institute of Electrical and Electronics Engineers
Geographical Reach international
ISSN 2377-3766
Volume 2
Number 3
Page Range 1778 - 1785
Date 2017
Abstract Text Modern autonomous mobile robots require a strong understanding of their surroundings in order to safely operate in cluttered and dynamic environments. Monocular depth estimation offers a geometry-independent paradigm to detect free, navigable space with minimum space, and power consumption. These represent highly desirable features, especially for microaerial vehicles. In order to guarantee robust operation in real-world scenarios, the estimator is required to generalize well in diverse environments. Most of the existent depth estimators do not consider generalization, and only benchmark their performance on publicly available datasets after specific fine tuning. Generalization can be achieved by training on several heterogeneous datasets, but their collection and labeling is costly. In this letter, we propose a deep neural network for scene depth estimation that is trained on synthetic datasets, which allow inexpensive generation of ground truth data. We show how this approach is able to generalize well across different scenarios. In addition, we show how the addition of long short-term memory layers in the network helps to alleviate, in sequential image streams, some of the intrinsic limitations of monocular vision, such as global scale estimation, with low computational overhead. We demonstrate that the network is able to generalize well with respect to different real-world environments without any fine tuning, achieving comparable performance to state-of-the-art methods on the KITTI dataset.
Free access at DOI
Digital Object Identifier 10.1109/lra.2017.2657002
Other Identification Number merlin-id:15100
PDF File Download from ZORA
Export BibTeX
EP3 XML (ZORA)