Consistent procedures for cluster tree estimation and pruning
Author(s): Chaudhuri, Kamalika; Dasgupta, Sanjoy; Kpotufe, Samory; Luxburg, Ulrike von
DownloadTo refer to this page use:
http://arks.princeton.edu/ark:/88435/pr1jg3s
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Chaudhuri, Kamalika | - |
dc.contributor.author | Dasgupta, Sanjoy | - |
dc.contributor.author | Kpotufe, Samory | - |
dc.contributor.author | Luxburg, Ulrike von | - |
dc.date.accessioned | 2021-10-11T14:17:11Z | - |
dc.date.available | 2021-10-11T14:17:11Z | - |
dc.date.issued | 2014-12 | en_US |
dc.identifier.citation | Chaudhuri, Kamalika, Sanjoy Dasgupta, Samory Kpotufe, and Ulrike Von Luxburg. "Consistent procedures for cluster tree estimation and pruning." IEEE Transactions on Information Theory 60, no. 12 (2014): 7900-7912. doi:10.1109/TIT.2014.2361055 | en_US |
dc.identifier.issn | 0018-9448 | - |
dc.identifier.issn | 1557-9654 | - |
dc.identifier.uri | http://www.columbia.edu/~skk2175/allpapers.html | - |
dc.identifier.uri | http://arks.princeton.edu/ark:/88435/pr1jg3s | - |
dc.description.abstract | For a density f on R d , a high-density cluster is any connected component of {x : f (x) ≥ λ}, for some λ > 0. The set of all high-density clusters forms a hierarchy called the cluster tree of f . We present two procedures for estimating the cluster tree given samples from f . The first is a robust variant of the single linkage algorithm for hierarchical clustering. The second is based on the k-nearest neighbor graph of the samples. We give finite-sample convergence rates for these algorithms, which also imply consistency, and we derive lower bounds on the sample complexity of cluster tree estimation. Finally, we study a tree pruning procedure that guarantees, under milder conditions than usual, to remove clusters that are spurious while recovering those that are salient. | en_US |
dc.format.extent | 7900 - 7912 | en_US |
dc.language.iso | en_US | en_US |
dc.relation.ispartof | IEEE Transactions on Information Theory | en_US |
dc.rights | Author's manuscript | en_US |
dc.title | Consistent procedures for cluster tree estimation and pruning | en_US |
dc.type | Journal Article | en_US |
dc.identifier.doi | 10.1109/TIT.2014.2361055 | - |
pu.type.symplectic | http://www.symplectic.co.uk/publications/atom-terms/1.0/journal-article | en_US |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
ClusterTreeEstimationPruning.pdf | 185.66 kB | Adobe PDF | View/Download |
Items in OAR@Princeton are protected by copyright, with all rights reserved, unless otherwise indicated.