Skip to main content

Provable Representation Learning for Imitation Learning via Bi-level Optimization

Author(s): Arora, Sanjeev; Du, Simon; Kakade, Sham; Luo, Yuping; Saunshi, Nikunj

Download
To refer to this page use: http://arks.princeton.edu/ark:/88435/pr1xg1r
Full metadata record
DC FieldValueLanguage
dc.contributor.authorArora, Sanjeev-
dc.contributor.authorDu, Simon-
dc.contributor.authorKakade, Sham-
dc.contributor.authorLuo, Yuping-
dc.contributor.authorSaunshi, Nikunj-
dc.date.accessioned2021-10-08T19:50:46Z-
dc.date.available2021-10-08T19:50:46Z-
dc.date.issued2020en_US
dc.identifier.citationArora, Sanjeev, Simon Du, Sham Kakade, Yuping Luo, and Nikunj Saunshi. "Provable Representation Learning for Imitation Learning via Bi-level Optimization." In International Conference on Machine Learning (2020): pp. 367-376.en_US
dc.identifier.issn2640-3498-
dc.identifier.urihttp://proceedings.mlr.press/v119/arora20a/arora20a.pdf-
dc.identifier.urihttp://arks.princeton.edu/ark:/88435/pr1xg1r-
dc.description.abstractA common strategy in modern learning systems is to learn a representation that is useful for many tasks, a.k.a. representation learning. We study this strategy in the imitation learning setting for Markov decision processes (MDPs) where multiple experts’ trajectories are available. We formulate representation learning as a bi-level optimization problem where the “outer" optimization tries to learn the joint representation and the “inner" optimization encodes the imitation learning setup and tries to learn task-specific parameters. We instantiate this framework for the imitation learning settings of behavior cloning and observation-alone. Theoretically, we show using our framework that representation learning can provide sample complexity benefits for imitation learning in both settings. We also provide proof-of-concept experiments to verify our theory.en_US
dc.format.extent367 - 376en_US
dc.language.isoen_USen_US
dc.relation.ispartofInternational Conference on Machine Learningen_US
dc.rightsFinal published version. Article is made available in OAR by the publisher's permission or policy.en_US
dc.titleProvable Representation Learning for Imitation Learning via Bi-level Optimizationen_US
dc.typeConference Articleen_US
pu.type.symplectichttp://www.symplectic.co.uk/publications/atom-terms/1.0/conference-proceedingen_US

Files in This Item:
File Description SizeFormat 
RepLearning.pdf943.47 kBAdobe PDFView/Download


Items in OAR@Princeton are protected by copyright, with all rights reserved, unless otherwise indicated.