Skip to main content

Rel3D: A Minimally Contrastive Benchmark for Grounding Spatial Relations in 3D

Author(s): Goyal, Anit; Yang, Kaiyu; Yang, Dawei; Deng, Jia

Download
To refer to this page use: http://arks.princeton.edu/ark:/88435/pr1k55j
Full metadata record
DC FieldValueLanguage
dc.contributor.authorGoyal, Anit-
dc.contributor.authorYang, Kaiyu-
dc.contributor.authorYang, Dawei-
dc.contributor.authorDeng, Jia-
dc.date.accessioned2021-10-08T19:50:44Z-
dc.date.available2021-10-08T19:50:44Z-
dc.date.issued2020en_US
dc.identifier.citationGoyal, Ankit, Kaiyu Yang, Dawei Yang, and Jia Deng. "Rel3D: A Minimally Contrastive Benchmark for Grounding Spatial Relations in 3D." Advances in Neural Information Processing Systems 33 (2020): pp. 10514-10525en_US
dc.identifier.issn1049-5258-
dc.identifier.urihttps://proceedings.neurips.cc/paper/2020/file/76dc611d6ebaafc66cc0879c71b5db5c-Paper.pdf-
dc.identifier.urihttp://arks.princeton.edu/ark:/88435/pr1k55j-
dc.description.abstractUnderstanding spatial relations (e.g., laptop on table) in visual input is important for both humans and robots. Existing datasets are insufficient as they lack large-scale, high-quality 3D ground truth information, which is critical for learning spatial relations. In this paper, we fill this gap by constructing Rel3D: the first large-scale, human-annotated dataset for grounding spatial relations in 3D. Rel3D enables quantifying the effectiveness of 3D information in predicting spatial relations on large-scale human data. Moreover, we propose minimally contrastive data collection---a novel crowdsourcing method for reducing dataset bias. The 3D scenes in our dataset come in minimally contrastive pairs: two scenes in a pair are almost identical, but a spatial relation holds in one and fails in the other. We empirically validate that minimally contrastive examples can diagnose issues with current relation detection models as well as lead to sample-efficient training. Code and data are available at https://github.com/princeton-vl/Rel3D.en_US
dc.format.extent10514 - 10525en_US
dc.language.isoen_USen_US
dc.relation.ispartofAdvances in Neural Information Processing Systemsen_US
dc.rightsFinal published version. Article is made available in OAR by the publisher's permission or policy.en_US
dc.titleRel3D: A Minimally Contrastive Benchmark for Grounding Spatial Relations in 3Den_US
dc.typeConference Articleen_US
pu.type.symplectichttp://www.symplectic.co.uk/publications/atom-terms/1.0/conference-proceedingen_US

Files in This Item:
File Description SizeFormat 
Rel3D.pdf3.75 MBAdobe PDFView/Download


Items in OAR@Princeton are protected by copyright, with all rights reserved, unless otherwise indicated.