The Hardness of Approximation of Euclidean k-Means
Author(s): Awasthi, Pranjal; Charikar, Moses; Krishnaswamy, Ravishankar; Sinop, Ali K
DownloadTo refer to this page use:
http://arks.princeton.edu/ark:/88435/pr18g16
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Awasthi, Pranjal | - |
dc.contributor.author | Charikar, Moses | - |
dc.contributor.author | Krishnaswamy, Ravishankar | - |
dc.contributor.author | Sinop, Ali K | - |
dc.date.accessioned | 2021-10-08T19:44:38Z | - |
dc.date.available | 2021-10-08T19:44:38Z | - |
dc.date.issued | 2015 | en_US |
dc.identifier.citation | Awasthi, Pranjal, Moses Charikar, Ravishankar Krishnaswamy, and Ali Kemal Sinop. "The Hardness of Approximation of Euclidean k-Means." In 31st International Symposium on Computational Geometry (SoCG 2015), 34 (2015): 754-767. doi:10.4230/LIPIcs.SOCG.2015.754 | en_US |
dc.identifier.issn | 1868-8969 | - |
dc.identifier.uri | http://arks.princeton.edu/ark:/88435/pr18g16 | - |
dc.description.abstract | The Euclidean k-means problem is a classical problem that has been extensively studied in the theoretical computer science, machine learning and the computational geometry communities. In this problem, we are given a set of n points in Euclidean space R^d, and the goal is to choose k center points in R^d so that the sum of squared distances of each point to its nearest center is minimized. The best approximation algorithms for this problem include a polynomial time constant factor approximation for general k and a (1+c)-approximation which runs in time poly(n) exp(k/c). At the other extreme, the only known computational complexity result for this problem is NP-hardness [Aloise et al.'09]. The main difficulty in obtaining hardness results stems from the Euclidean nature of the problem, and the fact that any point in R^d can be a potential center. This gap in understanding left open the intriguing possibility that the problem might admit a PTAS for all k, d. In this paper we provide the first hardness of approximation for the Euclidean k-means problem. Concretely, we show that there exists a constant c > 0 such that it is NP-hard to approximate the k-means objective to within a factor of (1+c). We show this via an efficient reduction from the vertex cover problem on triangle-free graphs: given a triangle-free graph, the goal is to choose the fewest number of vertices which are incident on all the edges. Additionally, we give a proof that the current best hardness results for vertex cover can be carried over to triangle-free graphs. To show this we transform G, a known hard vertex cover instance, by taking a graph product with a suitably chosen graph H, and showing that the size of the (normalized) maximum independent set is almost exactly preserved in the product graph using a spectral analysis, which might be of independent interest. | en_US |
dc.format.extent | 754 - 767 | en_US |
dc.language.iso | en_US | en_US |
dc.relation.ispartof | 31st International Symposium on Computational Geometry (SoCG 2015) | en_US |
dc.rights | Final published version. This is an open access article. | en_US |
dc.title | The Hardness of Approximation of Euclidean k-Means | en_US |
dc.type | Conference Article | en_US |
dc.identifier.doi | 10.4230/LIPIcs.SOCG.2015.754 | - |
pu.type.symplectic | http://www.symplectic.co.uk/publications/atom-terms/1.0/conference-proceeding | en_US |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
HardnessApproximationEuclideankMeans.pdf | 529.14 kB | Adobe PDF | View/Download |
Items in OAR@Princeton are protected by copyright, with all rights reserved, unless otherwise indicated.