Deep-Learning Assisted High-Resolution Binocular Stereo Depth Reconstruction

Hu, Yaoyu; Zhen, Weikun; Scherer, Sebastian

Computer Science > Computer Vision and Pattern Recognition

arXiv:1912.05012 (cs)

[Submitted on 23 Nov 2019 (v1), last revised 28 Feb 2020 (this version, v2)]

Title:Deep-Learning Assisted High-Resolution Binocular Stereo Depth Reconstruction

Authors:Yaoyu Hu, Weikun Zhen, Sebastian Scherer

View PDF

Abstract:This work presents dense stereo reconstruction using high-resolution images for infrastructure inspections. The state-of-the-art stereo reconstruction methods, both learning and non-learning ones, consume too much computational resource on high-resolution data. Recent learning-based methods achieve top ranks on most benchmarks. However, they suffer from the generalization issue due to lack of task-specific training data. We propose to use a less resource demanding non-learning method, guided by a learning-based model, to handle high-resolution images and achieve accurate stereo reconstruction. The deep-learning model produces an initial disparity prediction with uncertainty for each pixel of the down-sampled stereo image pair. The uncertainty serves as a self-measurement of its generalization ability and the per-pixel searching range around the initially predicted disparity. The downstream process performs a modified version of the Semi-Global Block Matching method with the up-sampled per-pixel searching range. The proposed deep-learning assisted method is evaluated on the Middlebury dataset and high-resolution stereo images collected by our customized binocular stereo camera. The combination of learning and non-learning methods achieves better performance on 12 out of 15 cases of the Middlebury dataset. In our infrastructure inspection experiments, the average 3D reconstruction error is less than 0.004m.

Comments:	Submitted to International Conference on Robotics and Automation (ICRA2020)
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
Cite as:	arXiv:1912.05012 [cs.CV]
	(or arXiv:1912.05012v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1912.05012

Submission history

From: Yaoyu Hu [view email]
[v1] Sat, 23 Nov 2019 00:55:28 UTC (5,431 KB)
[v2] Fri, 28 Feb 2020 20:11:08 UTC (5,497 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Deep-Learning Assisted High-Resolution Binocular Stereo Depth Reconstruction

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Deep-Learning Assisted High-Resolution Binocular Stereo Depth Reconstruction

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators