DeepDriving: Learning Affordance for Direct Perception in Autonomous Driving

Chen, Chenyi; Seff, Ari; Kornhauser, Alain; Xiao, Jianxiong

DeepDriving: Learning Affordance for Direct Perception in Autonomous Driving

Author(s): Chen, Chenyi; Seff, Ari; Kornhauser, Alain; Xiao, Jianxiong

Download

To refer to this page use: http://arks.princeton.edu/ark:/88435/pr1355q

Full metadata record

DC Field	Value	Language
dc.contributor.author	Chen, Chenyi	-
dc.contributor.author	Seff, Ari	-
dc.contributor.author	Kornhauser, Alain	-
dc.contributor.author	Xiao, Jianxiong	-
dc.date.accessioned	2021-10-08T19:48:58Z	-
dc.date.available	2021-10-08T19:48:58Z	-
dc.date.issued	2015	en_US
dc.identifier.citation	Chen, Chenyi, Ari Seff, Alain Kornhauser, and Jianxiong Xiao. "DeepDriving: Learning Affordance for Direct Perception in Autonomous Driving." In IEEE International Conference on Computer Vision (ICCV) (2015): pp. 2722-2730. doi:10.1109/ICCV.2015.312	en_US
dc.identifier.uri	https://openaccess.thecvf.com/content_iccv_2015/papers/Chen_DeepDriving_Learning_Affordance_ICCV_2015_paper.pdf	-
dc.identifier.uri	http://arks.princeton.edu/ark:/88435/pr1355q	-
dc.description.abstract	Today, there are two major paradigms for vision-based autonomous driving systems: mediated perception approaches that parse an entire scene to make a driving decision, and behavior reflex approaches that directly map an input image to a driving action by a regressor. In this paper, we propose a third paradigm: a direct perception approach to estimate the affordance for driving. We propose to map an input image to a small number of key perception indicators that directly relate to the affordance of a road/traffic state for driving. Our representation provides a set of compact yet complete descriptions of the scene to enable a simple controller to drive autonomously. Falling in between the two extremes of mediated perception and behavior reflex, we argue that our direct perception representation provides the right level of abstraction. To demonstrate this, we train a deep Convolutional Neural Network using recording from 12 hours of human driving in a video game and show that our model can work well to drive a car in a very diverse set of virtual environments. We also train a model for car distance estimation on the KITTI dataset. Results show that our direct perception approach can generalize well to real driving images. Source code and data are available on our project website.	en_US
dc.format.extent	2722 - 2730	en_US
dc.language.iso	en_US	en_US
dc.relation.ispartof	Proceedings of the IEEE International Conference on Computer Vision	en_US
dc.rights	Author's manuscript	en_US
dc.title	DeepDriving: Learning Affordance for Direct Perception in Autonomous Driving	en_US
dc.type	Conference Article	en_US
dc.identifier.doi	10.1109/ICCV.2015.312	-
dc.identifier.eissn	2380-7504	-
pu.type.symplectic	http://www.symplectic.co.uk/publications/atom-terms/1.0/conference-proceeding	en_US

Files in This Item:

File	Description	Size	Format
DeepDrivingAffordanceDirect.pdf		1.8 MB	Adobe PDF	View/Download

Show Simple Item Record