Provable Representation Learning for Imitation Learning via Bi-level Optimization

Arora, Sanjeev; Du, Simon; Kakade, Sham; Luo, Yuping; Saunshi, Nikunj

Provable Representation Learning for Imitation Learning via Bi-level Optimization

Author(s): Arora, Sanjeev; Du, Simon; Kakade, Sham; Luo, Yuping; Saunshi, Nikunj

Download

To refer to this page use: http://arks.princeton.edu/ark:/88435/pr1xg1r

Full metadata record

DC Field	Value	Language
dc.contributor.author	Arora, Sanjeev	-
dc.contributor.author	Du, Simon	-
dc.contributor.author	Kakade, Sham	-
dc.contributor.author	Luo, Yuping	-
dc.contributor.author	Saunshi, Nikunj	-
dc.date.accessioned	2021-10-08T19:50:46Z	-
dc.date.available	2021-10-08T19:50:46Z	-
dc.date.issued	2020	en_US
dc.identifier.citation	Arora, Sanjeev, Simon Du, Sham Kakade, Yuping Luo, and Nikunj Saunshi. "Provable Representation Learning for Imitation Learning via Bi-level Optimization." In International Conference on Machine Learning (2020): pp. 367-376.	en_US
dc.identifier.issn	2640-3498	-
dc.identifier.uri	http://proceedings.mlr.press/v119/arora20a/arora20a.pdf	-
dc.identifier.uri	http://arks.princeton.edu/ark:/88435/pr1xg1r	-
dc.description.abstract	A common strategy in modern learning systems is to learn a representation that is useful for many tasks, a.k.a. representation learning. We study this strategy in the imitation learning setting for Markov decision processes (MDPs) where multiple experts’ trajectories are available. We formulate representation learning as a bi-level optimization problem where the “outer" optimization tries to learn the joint representation and the “inner" optimization encodes the imitation learning setup and tries to learn task-specific parameters. We instantiate this framework for the imitation learning settings of behavior cloning and observation-alone. Theoretically, we show using our framework that representation learning can provide sample complexity benefits for imitation learning in both settings. We also provide proof-of-concept experiments to verify our theory.	en_US
dc.format.extent	367 - 376	en_US
dc.language.iso	en_US	en_US
dc.relation.ispartof	International Conference on Machine Learning	en_US
dc.rights	Final published version. Article is made available in OAR by the publisher's permission or policy.	en_US
dc.title	Provable Representation Learning for Imitation Learning via Bi-level Optimization	en_US
dc.type	Conference Article	en_US
pu.type.symplectic	http://www.symplectic.co.uk/publications/atom-terms/1.0/conference-proceeding	en_US

Files in This Item:

File	Description	Size	Format
RepLearning.pdf		943.47 kB	Adobe PDF	View/Download

Show Simple Item Record