ActiveStereoNet: End-to-End Self-supervised Learning for Active Stereo Systems

Zhang, Yinda; Khamis, Sameh; Rhemann, Christoph; Valentin, Julien; Kowdle, Adarsh; Tankovich, Vladimir; Schoenberg, Michael; Izadi, Shahram; Funkhouser, Thomas; Fanello, Sean

ActiveStereoNet: End-to-End Self-supervised Learning for Active Stereo Systems

Author(s): Zhang, Yinda; Khamis, Sameh; Rhemann, Christoph; Valentin, Julien; Kowdle, Adarsh; et al

Download

To refer to this page use: http://arks.princeton.edu/ark:/88435/pr18g2m

Full metadata record

DC Field	Value	Language
dc.contributor.author	Zhang, Yinda	-
dc.contributor.author	Khamis, Sameh	-
dc.contributor.author	Rhemann, Christoph	-
dc.contributor.author	Valentin, Julien	-
dc.contributor.author	Kowdle, Adarsh	-
dc.contributor.author	Tankovich, Vladimir	-
dc.contributor.author	Schoenberg, Michael	-
dc.contributor.author	Izadi, Shahram	-
dc.contributor.author	Funkhouser, Thomas	-
dc.contributor.author	Fanello, Sean	-
dc.date.accessioned	2021-10-08T19:46:23Z	-
dc.date.available	2021-10-08T19:46:23Z	-
dc.date.issued	2018	en_US
dc.identifier.citation	Zhang, Yinda, Sameh Khamis, Christoph Rhemann, Julien Valentin, Adarsh Kowdle, Vladimir Tankovich, Michael Schoenberg, Shahram Izadi, Thomas Funkhouser, and Sean Fanello. "ActiveStereoNet: End-to-End Self-supervised Learning for Active Stereo Systems." In European Conference on Computer Vision (ECCV) (2018): pp. 802-819. doi: 10.1007/978-3-030-01237-3_48	en_US
dc.identifier.issn	0302-9743	-
dc.identifier.uri	https://arxiv.org/pdf/1807.06009.pdf	-
dc.identifier.uri	http://arks.princeton.edu/ark:/88435/pr18g2m	-
dc.description.abstract	In this paper we present ActiveStereoNet, the first deep learning solution for active stereo systems. Due to the lack of ground truth, our method is fully self-supervised, yet it produces precise depth with a subpixel precision of 1 / 30th of a pixel; it does not suffer from the common over-smoothing issues; it preserves the edges; and it explicitly handles occlusions. We introduce a novel reconstruction loss that is more robust to noise and texture-less patches, and is invariant to illumination changes. The proposed loss is optimized using a window-based cost aggregation with an adaptive support weight scheme. This cost aggregation is edge-preserving and smooths the loss function, which is key to allow the network to reach compelling results. Finally we show how the task of predicting invalid regions, such as occlusions, can be trained end-to-end without ground-truth. This component is crucial to reduce blur and particularly improves predictions along depth discontinuities. Extensive quantitatively and qualitatively evaluations on real and synthetic data demonstrate state of the art results in many challenging scenes.	en_US
dc.format.extent	802 - 819	en_US
dc.language.iso	en_US	en_US
dc.relation.ispartof	European Conference on Computer Vision (ECCV)	en_US
dc.rights	Author's manuscript	en_US
dc.title	ActiveStereoNet: End-to-End Self-supervised Learning for Active Stereo Systems	en_US
dc.type	Conference Article	en_US
dc.identifier.doi	10.1007/978-3-030-01237-3_48	-
dc.identifier.eissn	1611-3349	-
pu.type.symplectic	http://www.symplectic.co.uk/publications/atom-terms/1.0/conference-proceeding	en_US

Files in This Item:

File	Description	Size	Format
SupervisedLearningActiveStereoSystems.pdf		14.72 MB	Adobe PDF	View/Download

Show Simple Item Record