# Lipschitz Density-Ratios, Structured Data, and Data-driven Tuning

## Author(s): Kpotufe, S

To refer to this page use: http://arks.princeton.edu/ark:/88435/pr15c5p
DC FieldValueLanguage
dc.contributor.authorKpotufe, S-
dc.date.accessioned2021-10-11T14:17:12Z-
dc.date.available2021-10-11T14:17:12Z-
dc.date.issued2017en_US
dc.identifier.citationKpotufe, Samory. "Lipschitz density-ratios, structured data, and data-driven tuning." In Proceedings of the 20th International Conference on Artificial Intelligence and Statistics, PMLR 54, pp. 1320-1328. 2017.en_US
dc.identifier.issn2640-3498-
dc.identifier.urihttp://proceedings.mlr.press/v54/kpotufe17a.html-
dc.identifier.urihttp://arks.princeton.edu/ark:/88435/pr15c5p-
dc.description.abstractDensity-ratio estimation (i.e. estimating f=fQ/fP for two unknown distributions Q and P) has proved useful in many Machine Learning tasks, e.g., risk-calibration in transfer-learning, two-sample tests, and also useful in common techniques such importance sampling and bias correction. While there are many important analyses of this estimation problem, the present paper derives convergence rates in other practical settings that are less understood, namely, extensions of traditional Lipschitz smoothness conditions, and common high-dimensional settings with structured data (e.g. manifold data, sparse data). Various interesting facts, which hold in earlier settings, are shown to extend to these settings. Namely, (1) optimal rates depend only on the smoothness of the ratio f, and not on the densities fQ, fP, supporting the belief that plugging in estimates for fQ, fP is suboptimal; (2) optimal rates depend only on the intrinsic dimension of data, i.e. this problem – unlike density estimation – escapes the curse of dimension. We further show that near-optimal rates are attainable by estimators tuned from data alone, i.e. with no prior distributional information. This last fact is of special interest in unsupervised settings such as this one, where only oracle rates seem to be known, i.e., rates which assume critical distributional information usually unavailable in practice.en_US
dc.format.extent1320-1328en_US
dc.language.isoen_USen_US
dc.relation.ispartofProceedings of the 20th International Conference on Artificial Intelligence and Statistics, PMLRen_US
dc.relation.ispartofseriesProceedings of Machine Learning Research;-
dc.rightsFinal published version. Article is made available in OAR by the publisher's permission or policy.en_US
dc.titleLipschitz Density-Ratios, Structured Data, and Data-driven Tuningen_US
dc.typeConference Articleen_US
pu.type.symplectichttp://www.symplectic.co.uk/publications/atom-terms/1.0/conference-proceedingen_US

Files in This Item:
File Description SizeFormat