Task-agnostic dynamics priors for deep reinforcement learning

Du, Y; Narasimhan, Karthik

Task-agnostic dynamics priors for deep reinforcement learning

Author(s): Du, Y; Narasimhan, Karthik

Download

To refer to this page use: http://arks.princeton.edu/ark:/88435/pr14c23

Full metadata record

DC Field	Value	Language
dc.contributor.author	Du, Y	-
dc.contributor.author	Narasimhan, Karthik	-
dc.date.accessioned	2021-10-08T19:46:55Z	-
dc.date.available	2021-10-08T19:46:55Z	-
dc.date.issued	2019-01-01	en_US
dc.identifier.citation	Du, Y, Narasimhan, K. (2019). Task-agnostic dynamics priors for deep reinforcement learning. 36th International Conference on Machine Learning, ICML 2019, 2019-June (3063 - 3078	en_US
dc.identifier.uri	http://arks.princeton.edu/ark:/88435/pr14c23	-
dc.description.abstract	Copyright 2019 by the author(s). While model-based deep reinforcement learning (RL) holds great promise for sample efficiency and generalization, learning an accurate dynamics model is often challenging and requires substantial interaction with the environment. A wide variety of domains have dynamics that share common foundations like the laws of classical mechanics, which are rarely exploited by existing algorithms. In fact, humans continuously acquire and use such dynamics priors to easily adapt to operating in new environments. In this work, we propose an approach to learn task-agnostic dynamics priors from videos and incorporate them into an RL agent. Our method involves pre-training a frame predictor on task-agnostic physics videos to initialize dynamics models (and fine-tune them) for unseen target environments. Our frame prediction architecture, SpatialNet, is designed specifically to capture localized physical phenomena and interactions. Our approach allows for both faster policy learning and convergence to better policies, outperforming competitive approaches on several different environments. We also demonstrate that incorporating this prior allows for more effective transfer between environments.	en_US
dc.format.extent	3063 - 3078	en_US
dc.language.iso	en_US	en_US
dc.relation.ispartof	36th International Conference on Machine Learning, ICML 2019	en_US
dc.rights	Author's manuscript	en_US
dc.title	Task-agnostic dynamics priors for deep reinforcement learning	en_US
dc.type	Journal Article	en_US
pu.type.symplectic	http://www.symplectic.co.uk/publications/atom-terms/1.0/conference-proceeding	en_US

Files in This Item:

File	Description	Size	Format
DynamicsPriorsDeepReinforcementLearning.pdf		3.8 MB	Adobe PDF	View/Download

Show Simple Item Record