A Theoretical Analysis of Contrastive Unsupervised Representation Learning

Saunshi, Nikunj; Plevrakis, Orestis; Arora, Sanjeev; Khodak, Mikhail; Khandeparkar, Hrishikesh

A Theoretical Analysis of Contrastive Unsupervised Representation Learning

Author(s): Saunshi, Nikunj; Plevrakis, Orestis; Arora, Sanjeev; Khodak, Mikhail; Khandeparkar, Hrishikesh

Download

To refer to this page use: http://arks.princeton.edu/ark:/88435/pr1w84c

Full metadata record

DC Field	Value	Language
dc.contributor.author	Saunshi, Nikunj	-
dc.contributor.author	Plevrakis, Orestis	-
dc.contributor.author	Arora, Sanjeev	-
dc.contributor.author	Khodak, Mikhail	-
dc.contributor.author	Khandeparkar, Hrishikesh	-
dc.date.accessioned	2021-10-08T19:51:06Z	-
dc.date.available	2021-10-08T19:51:06Z	-
dc.date.issued	2019	en_US
dc.identifier.citation	Saunshi, Nikunj, Orestis Plevrakis, Sanjeev Arora, Mikhail Khodak, and Hrishikesh Khandeparkar. "A Theoretical Analysis of Contrastive Unsupervised Representation Learning." In Proceedings of the 36th International Conference on Machine Learning (2019): pp. 5628-5637.	en_US
dc.identifier.issn	2640-3498	-
dc.identifier.uri	http://proceedings.mlr.press/v97/saunshi19a/saunshi19a.pdf	-
dc.identifier.uri	http://arks.princeton.edu/ark:/88435/pr1w84c	-
dc.description.abstract	Recent empirical works have successfully used unlabeled data to learn feature representations that are broadly useful in downstream classification tasks. Several of these methods are reminiscent of the well-known word2vec embedding algorithm: leveraging availability of pairs of semantically “similar" data points and “negative samples," the learner forces the inner product of representations of similar pairs with each other to be higher on average than with negative samples. The current paper uses the term contrastive learning for such algorithms and presents a theoretical framework for analyzing them by introducing latent classes and hypothesizing that semantically similar points are sampled from the same latent class. This framework allows us to show provable guarantees on the performance of the learned representations on the average classification task that is comprised of a subset of the same set of latent classes. Our generalization bound also shows that learned representations can reduce (labeled) sample complexity on downstream tasks. We conduct controlled experiments in both the text and image domains to support the theory.	en_US
dc.format.extent	5628 - 5637	en_US
dc.language.iso	en_US	en_US
dc.relation.ispartof	Proceedings of the 36th International Conference on Machine Learning	en_US
dc.rights	Final published version. Article is made available in OAR by the publisher's permission or policy.	en_US
dc.title	A Theoretical Analysis of Contrastive Unsupervised Representation Learning	en_US
dc.type	Conference Article	en_US
pu.type.symplectic	http://www.symplectic.co.uk/publications/atom-terms/1.0/conference-proceeding	en_US

Files in This Item:

File	Description	Size	Format
TheoreticalAnalysisContrastive.pdf		520.42 kB	Adobe PDF	View/Download

Show Simple Item Record