A local search approximation algorithm for k-means clustering
In k-means clustering we are given a set of n data points in d-dimensional space R d and an integer k, and the problem is to determine a set of k points in R d , called centers, to minimize the mean squared distance from each data point to its nearest center. No exact polynomial-time algorithms a...
Uloženo v:
| Vydáno v: | Computational geometry : theory and applications Ročník 28; číslo 2; s. 89 - 112 |
|---|---|
| Hlavní autoři: | , , , , , |
| Médium: | Journal Article |
| Jazyk: | angličtina |
| Vydáno: |
Elsevier B.V
01.06.2004
|
| Témata: | |
| ISSN: | 0925-7721 |
| On-line přístup: | Získat plný text |
| Tagy: |
Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
|
| Abstract | In
k-means clustering we are given a set of
n data points in
d-dimensional space
R
d
and an integer
k, and the problem is to determine a set of
k points in
R
d
, called
centers, to minimize the mean squared distance from each data point to its nearest center. No exact polynomial-time algorithms are known for this problem. Although asymptotically efficient approximation algorithms exist, these algorithms are not practical due to the very high constant factors involved. There are many heuristics that are used in practice, but we know of no bounds on their performance.
We consider the question of whether there exists a simple and practical approximation algorithm for
k-means clustering. We present a local improvement heuristic based on swapping centers in and out. We prove that this yields a (9+
ε)-approximation algorithm. We present an example showing that any approach based on performing a fixed number of swaps achieves an approximation factor of at least (9−
ε) in all sufficiently high dimensions. Thus, our approximation factor is almost tight for algorithms based on performing a fixed number of swaps. To establish the practical value of the heuristic, we present an empirical study that shows that, when combined with Lloyd's algorithm, this heuristic performs quite well in practice. |
|---|---|
| AbstractList | In
k-means clustering we are given a set of
n data points in
d-dimensional space
R
d
and an integer
k, and the problem is to determine a set of
k points in
R
d
, called
centers, to minimize the mean squared distance from each data point to its nearest center. No exact polynomial-time algorithms are known for this problem. Although asymptotically efficient approximation algorithms exist, these algorithms are not practical due to the very high constant factors involved. There are many heuristics that are used in practice, but we know of no bounds on their performance.
We consider the question of whether there exists a simple and practical approximation algorithm for
k-means clustering. We present a local improvement heuristic based on swapping centers in and out. We prove that this yields a (9+
ε)-approximation algorithm. We present an example showing that any approach based on performing a fixed number of swaps achieves an approximation factor of at least (9−
ε) in all sufficiently high dimensions. Thus, our approximation factor is almost tight for algorithms based on performing a fixed number of swaps. To establish the practical value of the heuristic, we present an empirical study that shows that, when combined with Lloyd's algorithm, this heuristic performs quite well in practice. |
| Author | Silverman, Ruth Wu, Angela Y. Piatko, Christine D. Kanungo, Tapas Mount, David M. Netanyahu, Nathan S. |
| Author_xml | – sequence: 1 givenname: Tapas surname: Kanungo fullname: Kanungo, Tapas email: kanungo@almaden.ibm.com organization: IBM Almaden Research Center, San Jose, CA 95120, USA – sequence: 2 givenname: David M. surname: Mount fullname: Mount, David M. email: mount@cs.umd.edu organization: Department of Computer Science, University of Maryland, College Park, MD, USA – sequence: 3 givenname: Nathan S. surname: Netanyahu fullname: Netanyahu, Nathan S. email: nathan@macs.biu.ac.il organization: Department of Mathematics and Computer Science, Bar-Ilan University, Ramat-Gan 52900, Israel – sequence: 4 givenname: Christine D. surname: Piatko fullname: Piatko, Christine D. email: christine.piatko@jhuapl.edu organization: The Johns Hopkins University Applied Physics Laboratory, Laurel, MD, USA – sequence: 5 givenname: Ruth surname: Silverman fullname: Silverman, Ruth email: ruth@cfar.umd.edu organization: Center for Automation Research, University of Maryland, College Park, MD, USA – sequence: 6 givenname: Angela Y. surname: Wu fullname: Wu, Angela Y. email: awu@american.edu organization: Department of Computer Science, American University, Washington, DC, USA |
| BookMark | eNqFz81KAzEQwPEcKthW38BDXmDXSbLZdAWFUvyCghc9h-xs0qbubkqyir69W-vJg54GBv7D_GZk0ofeEnLBIGfAystdjqHb2JBzgCIHkQOICZlCxWWmFGenZJbSDgA4l9WUXC9pG9C0NFkTcUvNfh_Dh-_M4ENPTbsJ0Q_bjroQ6WvWWdMniu1bGmz0_eaMnDjTJnv-M-fk5e72efWQrZ_uH1fLdYZCwZCVxUIhK-pGKlWJigsjK6wlt6JWyhjLanTjgpXYQIXgCrOQ6Mpa8YVpUDoxJ1fHuxhDStE6jX74fnGIxreagT7g9U4f8fqA1yD0iB_j4le8jyMwfv6X3RwzO8LevY06obc92sZHi4Nugv_7wBdFT3tD |
| CitedBy_id | crossref_primary_10_1007_s10878_018_0340_4 crossref_primary_10_1016_j_str_2014_08_007 crossref_primary_10_1137_17M1127181 crossref_primary_10_1145_3749982 crossref_primary_10_1007_s10878_018_0261_2 crossref_primary_10_1109_TIA_2023_3249143 crossref_primary_10_3390_computation8040090 crossref_primary_10_1137_17M112717X crossref_primary_10_1080_14942119_2022_2139586 crossref_primary_10_3390_rs11070875 crossref_primary_10_1109_ACCESS_2020_2975449 crossref_primary_10_1007_s10107_018_1269_1 crossref_primary_10_1016_j_is_2021_101804 crossref_primary_10_1145_2395116_2395117 crossref_primary_10_1007_s00521_019_04673_0 crossref_primary_10_1007_s12145_024_01676_x crossref_primary_10_1109_TIT_2021_3122465 crossref_primary_10_1190_geo2015_0220_1 crossref_primary_10_1016_j_neuroimage_2009_06_014 crossref_primary_10_1016_j_adhoc_2016_11_009 crossref_primary_10_1007_s11440_023_01803_w crossref_primary_10_1109_TEVC_2023_3296645 crossref_primary_10_1137_23M1551936 crossref_primary_10_1016_j_ipl_2008_03_013 crossref_primary_10_1016_j_ipl_2022_106251 crossref_primary_10_1016_j_ipl_2016_11_009 crossref_primary_10_1016_j_tcs_2020_07_022 crossref_primary_10_1016_j_ins_2020_07_010 crossref_primary_10_1016_j_conbuildmat_2023_131141 crossref_primary_10_1145_1498698_1537601 crossref_primary_10_1016_j_neucom_2010_11_013 crossref_primary_10_1142_S012905412246008X crossref_primary_10_1142_S0217595922400073 crossref_primary_10_1186_s40537_018_0122_y crossref_primary_10_1007_s00453_015_0043_5 crossref_primary_10_1007_s10878_020_00550_y crossref_primary_10_1016_j_ins_2018_02_001 crossref_primary_10_1007_s00180_007_0090_8 crossref_primary_10_1142_S0217595920400059 crossref_primary_10_1016_j_patcog_2009_05_016 crossref_primary_10_3390_electronics11152396 crossref_primary_10_1007_s00180_019_00871_5 crossref_primary_10_1155_2014_506480 crossref_primary_10_1007_s10518_024_02007_7 crossref_primary_10_3390_rs13183628 crossref_primary_10_1007_s11761_018_00253_7 crossref_primary_10_26552_com_C_2024_010 crossref_primary_10_3758_s13415_019_00763_7 crossref_primary_10_1007_s10844_011_0158_3 crossref_primary_10_1007_s10878_020_00569_1 crossref_primary_10_1016_j_entcs_2019_08_050 crossref_primary_10_1017_S0960129521000104 crossref_primary_10_3390_s19132842 crossref_primary_10_1007_s10878_021_00737_x crossref_primary_10_1145_2027216_2027217 crossref_primary_10_1016_j_tcs_2010_05_034 crossref_primary_10_1109_TPDS_2013_186 crossref_primary_10_1109_TKDE_2020_3018744 crossref_primary_10_1109_TKDE_2013_113 crossref_primary_10_1145_2450142_2450144 crossref_primary_10_1186_s40537_020_00325_6 crossref_primary_10_1007_s10489_018_1238_7 crossref_primary_10_1007_s11081_020_09503_0 crossref_primary_10_1016_j_energy_2020_119134 crossref_primary_10_1142_S0217595922400097 crossref_primary_10_1371_journal_pone_0217050 crossref_primary_10_1007_s10878_018_0278_6 crossref_primary_10_1007_s00224_024_10211_w crossref_primary_10_23919_JSEE_2023_000023 crossref_primary_10_1080_01430750_2019_1630307 crossref_primary_10_1016_j_aei_2024_102799 crossref_primary_10_1049_iet_gtd_2018_6820 crossref_primary_10_1287_moor_2021_1216 crossref_primary_10_1137_18M1171321 crossref_primary_10_1142_S0217595922400140 crossref_primary_10_1016_j_datak_2010_12_002 crossref_primary_10_1587_transinf_E94_D_2271 crossref_primary_10_1007_s10898_018_00733_2 crossref_primary_10_57120_yalvac_1636844 crossref_primary_10_1016_j_ipl_2013_02_003 crossref_primary_10_1016_j_physa_2019_122992 crossref_primary_10_1109_ACCESS_2019_2943166 crossref_primary_10_1016_j_patcog_2014_03_017 crossref_primary_10_1016_j_tcs_2022_11_027 crossref_primary_10_1109_TPDS_2014_2306193 crossref_primary_10_1007_s40305_022_00394_9 crossref_primary_10_1016_j_jpdc_2024_104966 crossref_primary_10_3389_frai_2023_1156269 crossref_primary_10_1287_ijoc_2022_1166 crossref_primary_10_1142_S0217595919500064 crossref_primary_10_1214_21_AOS2140 crossref_primary_10_1145_3301446 crossref_primary_10_1111_itor_12808 crossref_primary_10_1016_j_compbiomed_2021_105193 crossref_primary_10_1016_j_neucom_2021_04_028 crossref_primary_10_1145_3311953 crossref_primary_10_1007_s13218_017_0519_3 crossref_primary_10_1016_j_ref_2022_06_007 crossref_primary_10_1177_1550147717707112 crossref_primary_10_1002_2050_7038_12757 crossref_primary_10_1111_j_1467_8659_2012_03137_x crossref_primary_10_1002_cpe_7447 crossref_primary_10_1145_3392720 crossref_primary_10_1049_enc2_12089 crossref_primary_10_1016_j_tcs_2020_06_029 crossref_primary_10_21272_mmi_2019_3_10 crossref_primary_10_1007_s00454_011_9340_1 crossref_primary_10_1145_2133803_2184450 crossref_primary_10_1007_s10878_021_00734_0 crossref_primary_10_1145_3674508 crossref_primary_10_4028_www_scientific_net_AMM_209_211_925 crossref_primary_10_1137_070683921 crossref_primary_10_3390_a13060146 crossref_primary_10_2478_acss_2023_0001 crossref_primary_10_1007_s00453_025_01338_4 crossref_primary_10_1002_sam_10097 crossref_primary_10_1162_netn_a_00050 crossref_primary_10_1371_journal_pone_0049946 crossref_primary_10_1016_j_jclepro_2019_117669 crossref_primary_10_1007_s12667_022_00535_2 crossref_primary_10_1007_s10878_019_00450_w crossref_primary_10_1016_j_tcs_2018_04_048 crossref_primary_10_1145_3477541 crossref_primary_10_1007_s00371_017_1352_2 |
| Cites_doi | 10.1145/331499.331504 10.1007/BF02716805 10.1142/S0218001401000927 10.1126/science.220.4598.671 10.1109/TPAMI.1984.4767478 10.1109/TPAMI.2002.1017616 10.1007/PL00009311 10.1109/TIT.1987.1057277 10.1109/TIT.1982.1056489 10.1109/34.824819 10.1016/0196-6774(91)90007-L 10.1137/S0036144599352836 10.1007/s004540010019 |
| ContentType | Journal Article |
| Copyright | 2004 Elsevier B.V. |
| Copyright_xml | – notice: 2004 Elsevier B.V. |
| DBID | 6I. AAFTH AAYXX CITATION |
| DOI | 10.1016/j.comgeo.2004.03.003 |
| DatabaseName | ScienceDirect Open Access Titles Elsevier:ScienceDirect:Open Access CrossRef |
| DatabaseTitle | CrossRef |
| DatabaseTitleList | |
| DeliveryMethod | fulltext_linktorsrc |
| Discipline | Mathematics |
| EndPage | 112 |
| ExternalDocumentID | 10_1016_j_comgeo_2004_03_003 S0925772104000215 |
| GroupedDBID | --K --M -DZ .DC .~1 0R~ 1B1 1RT 1~. 1~5 29F 4.4 457 4G. 5GY 5VS 6I. 7-5 71M 8P~ 9JN AACTN AAEDT AAEDW AAFTH AAIAV AAIKJ AAKOC AALRI AAOAW AAQFI AAQXK AAXUO AAYFN ABAOU ABBOA ABFNM ABMAC ABVKL ABXDB ABYKQ ACAZW ACDAQ ACGFS ACNNM ACRLP ACZNC ADBBV ADEZE ADMUD AEBSH AEKER AEXQZ AFKWA AFTJW AGHFR AGUBO AGYEJ AHHHB AHZHX AIALX AIEXJ AIGVJ AIKHN AITUG AJBFU AJOXV ALMA_UNASSIGNED_HOLDINGS AMFUW AMRAJ AOUOD ARUGR ASPBG AVWKF AXJTR AZFZN BKOJK BLXMC CS3 EBS EFJIC EFLBG EJD EO8 EO9 EP2 EP3 F5P FDB FEDTE FGOYB FIRID FNPLU FYGXN G-2 G-Q GBLVA GBOLZ HVGLF HZ~ IHE IXB J1W KOM LG9 M26 M41 MHUIS MO0 N9A NCXOZ O-L O9- OAUVE OK1 OZT P-8 P-9 P2P PC. Q38 R2- RIG RNS ROL RPZ SDF SDG SDP SES SEW SPC SPCBC SSV SSW SSZ T5K UHS WUQ XPP ZMT ~G- 9DU AATTM AAXKI AAYWO AAYXX ABJNI ABWVN ACLOT ACRPL ADNMO ADVLN AEIPS AFJKZ AGQPQ AIIUN ANKPU APXCP CITATION EFKBS ~HD |
| ID | FETCH-LOGICAL-c370t-6487c14bd57793923a59cb52e3b77aae1bcf9cb16cd09c0f4a85cf6b728adc5f3 |
| ISICitedReferencesCount | 377 |
| ISICitedReferencesURI | http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000221974900003&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| ISSN | 0925-7721 |
| IngestDate | Tue Nov 18 22:15:30 EST 2025 Sat Nov 29 03:12:50 EST 2025 Fri Feb 23 02:30:56 EST 2024 |
| IsDoiOpenAccess | true |
| IsOpenAccess | true |
| IsPeerReviewed | true |
| IsScholarly | true |
| Issue | 2 |
| Keywords | k-means Local search Computational geometry Approximation algorithms Clustering |
| Language | English |
| License | http://www.elsevier.com/open-access/userlicense/1.0 https://www.elsevier.com/tdm/userlicense/1.0 https://www.elsevier.com/open-access/userlicense/1.0 |
| LinkModel | OpenURL |
| MergedId | FETCHMERGED-LOGICAL-c370t-6487c14bd57793923a59cb52e3b77aae1bcf9cb16cd09c0f4a85cf6b728adc5f3 |
| OpenAccessLink | https://dx.doi.org/10.1016/j.comgeo.2004.03.003 |
| PageCount | 24 |
| ParticipantIDs | crossref_citationtrail_10_1016_j_comgeo_2004_03_003 crossref_primary_10_1016_j_comgeo_2004_03_003 elsevier_sciencedirect_doi_10_1016_j_comgeo_2004_03_003 |
| PublicationCentury | 2000 |
| PublicationDate | 2004-06-01 |
| PublicationDateYYYYMMDD | 2004-06-01 |
| PublicationDate_xml | – month: 06 year: 2004 text: 2004-06-01 day: 01 |
| PublicationDecade | 2000 |
| PublicationTitle | Computational geometry : theory and applications |
| PublicationYear | 2004 |
| Publisher | Elsevier B.V |
| Publisher_xml | – name: Elsevier B.V |
| References | Fayyad, Piatetsky-Shapiro, Smyth, Uthurusamy (BIB015) 1996 Matoušek (BIB032) 2000; 24 Jain, Dubes (BIB021) 1988 Du, Faber, Gunzburger (BIB010) 1999; 41 Faber (BIB014) 1994; 22 Feller (BIB016) 1968 Kirkpatrick, Gelatt, Vecchi (BIB026) 1983; 220 Korupolu, Plaxton, Rajaraman (BIB029) 1998 Phillips (BIB037) 2002; vol. 2409 Garey, Johnson (BIB018) 1979 Arora, Raghavan, Rao (BIB003) 1998 Inaba, Katoh, Imai (BIB020) 1994 Vaisey, Gersho (BIB041) 1988 Capoyleas, Rote, Woeginger (BIB008) 1991; 12 Agarwal, Procopiuc (BIB001) 1998 Selim, Ismail (BIB038) 1984; 6 Forgey (BIB017) 1965; 21 Duda, Hart (BIB011) 1973 Eppstein (BIB013) 1997 Arya, Mount, Narayan (BIB004) 1996; 16 Arya, Garg, Khandekar, Pandit, Meyerson, Munagala (BIB005) 2001 Jain, Murty, Flynn (BIB023) 1999; 31 Kohonen (BIB027) 1989 Lloyd (BIB030) 1982; 28 Bandyopadhyay, Maulik, Pakhira (BIB007) 2001; 15 Pelleg, Moore (BIB035) 1999 Thorup (BIB040) 2001; vol. 2076 Charikar, Guha (BIB009) 1999 Pelleg, Moore (BIB036) 2000 Alsabti, Ranka, Singh (BIB002) 1998 Mettu, Plaxton (BIB033) 2002 Wesolowsky (BIB042) 1993; 1 Kaufman, Rousseeuw (BIB025) 1990 Gersho, Gray (BIB019) 1992 ElGamal, Hemanchandra, Shperling, Wei (BIB012) 1987; 33 Kanungo, Mount, Netanyahu, Piatko, Silverman, Wu (BIB024) 2002; 24 Ng, Han (BIB034) 1994 Jain, Duin, Mao (BIB022) 2000; 22 Kolliopoulos, Rao (BIB028) 1999; vol. 1643 Sharir (BIB039) 1997; 18 Ball, Hall (BIB006) 1964 MacQueen (BIB031) 1967 Alsabti (10.1016/j.comgeo.2004.03.003_BIB002) 1998 Capoyleas (10.1016/j.comgeo.2004.03.003_BIB008) 1991; 12 Charikar (10.1016/j.comgeo.2004.03.003_BIB009) 1999 Ng (10.1016/j.comgeo.2004.03.003_BIB034) 1994 Jain (10.1016/j.comgeo.2004.03.003_BIB021) 1988 Garey (10.1016/j.comgeo.2004.03.003_BIB018) 1979 Agarwal (10.1016/j.comgeo.2004.03.003_BIB001) 1998 Du (10.1016/j.comgeo.2004.03.003_BIB010) 1999; 41 Faber (10.1016/j.comgeo.2004.03.003_BIB014) 1994; 22 Kaufman (10.1016/j.comgeo.2004.03.003_BIB025) 1990 Bandyopadhyay (10.1016/j.comgeo.2004.03.003_BIB007) 2001; 15 Kirkpatrick (10.1016/j.comgeo.2004.03.003_BIB026) 1983; 220 Selim (10.1016/j.comgeo.2004.03.003_BIB038) 1984; 6 Gersho (10.1016/j.comgeo.2004.03.003_BIB019) 1992 Matoušek (10.1016/j.comgeo.2004.03.003_BIB032) 2000; 24 Ball (10.1016/j.comgeo.2004.03.003_BIB006) 1964 Jain (10.1016/j.comgeo.2004.03.003_BIB023) 1999; 31 Kanungo (10.1016/j.comgeo.2004.03.003_BIB024) 2002; 24 Wesolowsky (10.1016/j.comgeo.2004.03.003_BIB042) 1993; 1 MacQueen (10.1016/j.comgeo.2004.03.003_BIB031) 1967 Pelleg (10.1016/j.comgeo.2004.03.003_BIB035) 1999 Arya (10.1016/j.comgeo.2004.03.003_BIB005) 2001 Mettu (10.1016/j.comgeo.2004.03.003_BIB033) 2002 Vaisey (10.1016/j.comgeo.2004.03.003_BIB041) 1988 Duda (10.1016/j.comgeo.2004.03.003_BIB011) 1973 Forgey (10.1016/j.comgeo.2004.03.003_BIB017) 1965; 21 Kolliopoulos (10.1016/j.comgeo.2004.03.003_BIB028) 1999; vol. 1643 Eppstein (10.1016/j.comgeo.2004.03.003_BIB013) 1997 ElGamal (10.1016/j.comgeo.2004.03.003_BIB012) 1987; 33 Inaba (10.1016/j.comgeo.2004.03.003_BIB020) 1994 Fayyad (10.1016/j.comgeo.2004.03.003_BIB015) 1996 Arya (10.1016/j.comgeo.2004.03.003_BIB004) 1996; 16 Korupolu (10.1016/j.comgeo.2004.03.003_BIB029) 1998 Lloyd (10.1016/j.comgeo.2004.03.003_BIB030) 1982; 28 Pelleg (10.1016/j.comgeo.2004.03.003_BIB036) 2000 Kohonen (10.1016/j.comgeo.2004.03.003_BIB027) 1989 Phillips (10.1016/j.comgeo.2004.03.003_BIB037) 2002; vol. 2409 Jain (10.1016/j.comgeo.2004.03.003_BIB022) 2000; 22 Thorup (10.1016/j.comgeo.2004.03.003_BIB040) 2001; vol. 2076 Feller (10.1016/j.comgeo.2004.03.003_BIB016) 1968 Arora (10.1016/j.comgeo.2004.03.003_BIB003) 1998 Sharir (10.1016/j.comgeo.2004.03.003_BIB039) 1997; 18 |
| References_xml | – year: 1989 ident: BIB027 article-title: Self-Organization and Associative Memory – start-page: 277 year: 1999 end-page: 281 ident: BIB035 article-title: Accelerating exact publication-title: Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Diego, CA – volume: 1 start-page: 5 year: 1993 end-page: 23 ident: BIB042 article-title: The Weber problem: History and perspective publication-title: Location Sci. – volume: 12 start-page: 341 year: 1991 end-page: 356 ident: BIB008 article-title: Geometric clusterings publication-title: J. Algorithms – volume: 22 start-page: 138 year: 1994 end-page: 144 ident: BIB014 article-title: Clustering and the continuous publication-title: Los Alamos Sci. – year: 1997 ident: BIB013 article-title: Faster construction of planar two-centers publication-title: Proc. 8th ACM-SIAM Sympos. Discrete Algorithms – year: 1998 ident: BIB002 article-title: An efficient publication-title: Proceedings of the First Workshop on High Performance Data Mining, Orlando, FL – year: 1964 ident: BIB006 article-title: Some fundamental concepts and synthesis procedures for pattern recognition preprocessors publication-title: International Conference on Microwaves, Circuit Theory, and Information Theory, Tokyo, Japan – volume: vol. 1643 start-page: 362 year: 1999 end-page: 371 ident: BIB028 article-title: A nearly linear-time approximation scheme for the Euclidean publication-title: Proceedings of the Seventh Annual European Symposium on Algorithms – start-page: 1 year: 1998 end-page: 10 ident: BIB029 article-title: Analysis of a local search heuristic for facility location problems publication-title: Proceedings of the Ninth Annual ACM-SIAM Symposium on Discrete Algorithms, San Francisco, CA – volume: 18 start-page: 125 year: 1997 end-page: 134 ident: BIB039 article-title: A near-linear algorithm for the planar 2-center problem publication-title: Discrete Comput. Geom. – year: 2000 ident: BIB036 publication-title: Proceedings of the Seventeenth International Conference on Machine Learning, Palo Alto, CA – start-page: 378 year: 1999 end-page: 388 ident: BIB009 article-title: Improved combinatorial algorithms for the facility location and publication-title: Proceedings of the 4th Annual IEEE Symposium on Foundations of Computer Science – year: 1968 ident: BIB016 article-title: An Introduction to Probability Theory and its Applications – volume: 15 start-page: 269 year: 2001 end-page: 285 ident: BIB007 article-title: Clustering using simulated annealing with probabilistic redistribution publication-title: Internat. J. Patt. Recog. Artif. Intell. – volume: 28 start-page: 129 year: 1982 end-page: 137 ident: BIB030 article-title: Least squares quantization in PCM publication-title: IEEE Trans. Inform. Theory – volume: vol. 2076 start-page: 249 year: 2001 end-page: 260 ident: BIB040 article-title: Quick publication-title: Proc. 28th Intl. Colloq. on Automata, Languages and Programming (ICALP) – start-page: 144 year: 1994 end-page: 155 ident: BIB034 article-title: Efficient and effective clustering methods for spatial data mining publication-title: Proceedings of the Twentieth International Conference on Very Large Databases, Santiago, Chile – volume: 21 start-page: 768 year: 1965 ident: BIB017 article-title: Cluster analysis of multivariate data: Efficiency vs. interpretability of classification publication-title: Biometrics – volume: 24 start-page: 61 year: 2000 end-page: 84 ident: BIB032 article-title: On approximate geometric publication-title: Discrete Comput. Geom. – year: 1992 ident: BIB019 article-title: Vector Quantization and Signal Compression – year: 1988 ident: BIB021 article-title: Algorithms for Clustering Data – volume: 31 start-page: 264 year: 1999 end-page: 323 ident: BIB023 article-title: Data clustering: A review publication-title: ACM Comput. Surv. – volume: 22 start-page: 4 year: 2000 end-page: 37 ident: BIB022 article-title: Statistical pattern recognition: A review publication-title: IEEE Trans. Patt. Anal. Mach. Intell. – volume: 24 year: 2002 ident: BIB024 article-title: An efficient publication-title: IEEE Trans. Patt. Anal. Mach. Intell. – volume: vol. 2409 year: 2002 ident: BIB037 article-title: Acceleration of publication-title: Algorithm Engineering and Experiments (Proc. ALENEX '02) – volume: 16 start-page: 155 year: 1996 end-page: 176 ident: BIB004 article-title: Accounting for boundary effects in nearest-neighbor searching publication-title: Discrete Comput. Geom. – volume: 220 start-page: 671 year: 1983 end-page: 680 ident: BIB026 article-title: Optimization by simulated annealing publication-title: Science – start-page: 1176 year: 1988 end-page: 1179 ident: BIB041 article-title: Simulated annealing and codebook design publication-title: Proc. IEEE Internat. Conf. on Acoustics, Speech, and Signal Processing (ICASSP) – year: 1973 ident: BIB011 article-title: Pattern Classification and Scene Analysis – volume: 33 start-page: 116 year: 1987 end-page: 123 ident: BIB012 article-title: Using simulated annealing to design good codes publication-title: IEEE Trans. Inform. Theory – volume: 6 start-page: 81 year: 1984 end-page: 87 ident: BIB038 publication-title: IEEE Trans. Patt. Anal. Mach. Intell. – year: 1979 ident: BIB018 article-title: Computers and Intractability: A Guide to the Theory of NP-Completeness – start-page: 658 year: 1998 end-page: 667 ident: BIB001 article-title: Exact and approximation algorithms for clustering publication-title: Proceedings of the Ninth Annual ACM-SIAM Symposium on Discrete Algorithms, San Francisco, CA – year: 1996 ident: BIB015 article-title: Advances in Knowledge Discovery and Data Mining – start-page: 339 year: 2002 end-page: 348 ident: BIB033 article-title: Optimal time bounds for approximate clustering publication-title: Proc. 18th Conf. on Uncertainty in Artif. Intell., Edmonton, Canada – start-page: 21 year: 2001 end-page: 29 ident: BIB005 article-title: Local search heuristics for publication-title: Proceedings of the 33rd Annual Symposium on Theory of Computing, Crete, Greece – volume: 41 start-page: 637 year: 1999 end-page: 676 ident: BIB010 article-title: Centroidal Voronoi tesselations: Applications and algorithms publication-title: SIAM Rev. – start-page: 332 year: 1994 end-page: 339 ident: BIB020 article-title: Applications of weighted Voronoi diagrams and randomization to variance-based publication-title: Proceedings of the Tenth Annual ACM Symposium on Computational Geometry, Stony Brook, NY – year: 1990 ident: BIB025 article-title: Finding Groups in Data: An Introduction to Cluster Analysis – start-page: 106 year: 1998 end-page: 113 ident: BIB003 article-title: Approximation schemes for Euclidean publication-title: Proceedings of the Thirtieth Annual ACM Symposium on Theory of Computing, Dallas, TX – start-page: 281 year: 1967 end-page: 296 ident: BIB031 article-title: Some methods for classification and analysis of multivariate observations publication-title: Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, vol. 1, Berkeley, CA – start-page: 332 year: 1994 ident: 10.1016/j.comgeo.2004.03.003_BIB020 article-title: Applications of weighted Voronoi diagrams and randomization to variance-based k-clustering – volume: vol. 2076 start-page: 249 year: 2001 ident: 10.1016/j.comgeo.2004.03.003_BIB040 article-title: Quick k-median, k-center, and facility location for sparse graphs – volume: 31 start-page: 264 issue: 3 year: 1999 ident: 10.1016/j.comgeo.2004.03.003_BIB023 article-title: Data clustering: A review publication-title: ACM Comput. Surv. doi: 10.1145/331499.331504 – volume: 1 start-page: 5 year: 1993 ident: 10.1016/j.comgeo.2004.03.003_BIB042 article-title: The Weber problem: History and perspective publication-title: Location Sci. – start-page: 339 year: 2002 ident: 10.1016/j.comgeo.2004.03.003_BIB033 article-title: Optimal time bounds for approximate clustering – start-page: 658 year: 1998 ident: 10.1016/j.comgeo.2004.03.003_BIB001 article-title: Exact and approximation algorithms for clustering – year: 1988 ident: 10.1016/j.comgeo.2004.03.003_BIB021 – volume: 16 start-page: 155 year: 1996 ident: 10.1016/j.comgeo.2004.03.003_BIB004 article-title: Accounting for boundary effects in nearest-neighbor searching publication-title: Discrete Comput. Geom. doi: 10.1007/BF02716805 – volume: 15 start-page: 269 year: 2001 ident: 10.1016/j.comgeo.2004.03.003_BIB007 article-title: Clustering using simulated annealing with probabilistic redistribution publication-title: Internat. J. Patt. Recog. Artif. Intell. doi: 10.1142/S0218001401000927 – volume: 220 start-page: 671 year: 1983 ident: 10.1016/j.comgeo.2004.03.003_BIB026 article-title: Optimization by simulated annealing publication-title: Science doi: 10.1126/science.220.4598.671 – volume: 6 start-page: 81 year: 1984 ident: 10.1016/j.comgeo.2004.03.003_BIB038 article-title: K-means-type algorithms: A generalized convergence theorem and characterization of local optimality publication-title: IEEE Trans. Patt. Anal. Mach. Intell. doi: 10.1109/TPAMI.1984.4767478 – volume: 24 year: 2002 ident: 10.1016/j.comgeo.2004.03.003_BIB024 article-title: An efficient k-means clustering algorithm: Analysis and implementation publication-title: IEEE Trans. Patt. Anal. Mach. Intell. doi: 10.1109/TPAMI.2002.1017616 – volume: 18 start-page: 125 year: 1997 ident: 10.1016/j.comgeo.2004.03.003_BIB039 article-title: A near-linear algorithm for the planar 2-center problem publication-title: Discrete Comput. Geom. doi: 10.1007/PL00009311 – volume: 33 start-page: 116 year: 1987 ident: 10.1016/j.comgeo.2004.03.003_BIB012 article-title: Using simulated annealing to design good codes publication-title: IEEE Trans. Inform. Theory doi: 10.1109/TIT.1987.1057277 – year: 1992 ident: 10.1016/j.comgeo.2004.03.003_BIB019 – volume: 28 start-page: 129 year: 1982 ident: 10.1016/j.comgeo.2004.03.003_BIB030 article-title: Least squares quantization in PCM publication-title: IEEE Trans. Inform. Theory doi: 10.1109/TIT.1982.1056489 – start-page: 378 year: 1999 ident: 10.1016/j.comgeo.2004.03.003_BIB009 article-title: Improved combinatorial algorithms for the facility location and k-medians problem – year: 1964 ident: 10.1016/j.comgeo.2004.03.003_BIB006 article-title: Some fundamental concepts and synthesis procedures for pattern recognition preprocessors – year: 1979 ident: 10.1016/j.comgeo.2004.03.003_BIB018 – year: 1998 ident: 10.1016/j.comgeo.2004.03.003_BIB002 article-title: An efficient k-means clustering algorithm – start-page: 1176 year: 1988 ident: 10.1016/j.comgeo.2004.03.003_BIB041 article-title: Simulated annealing and codebook design – year: 1996 ident: 10.1016/j.comgeo.2004.03.003_BIB015 – year: 1968 ident: 10.1016/j.comgeo.2004.03.003_BIB016 – year: 2000 ident: 10.1016/j.comgeo.2004.03.003_BIB036 article-title: x-means: Extending k-means with efficient estimation of the number of clusters – start-page: 144 year: 1994 ident: 10.1016/j.comgeo.2004.03.003_BIB034 article-title: Efficient and effective clustering methods for spatial data mining – volume: 22 start-page: 138 year: 1994 ident: 10.1016/j.comgeo.2004.03.003_BIB014 article-title: Clustering and the continuous k-means algorithm publication-title: Los Alamos Sci. – year: 1997 ident: 10.1016/j.comgeo.2004.03.003_BIB013 article-title: Faster construction of planar two-centers – year: 1989 ident: 10.1016/j.comgeo.2004.03.003_BIB027 – year: 1990 ident: 10.1016/j.comgeo.2004.03.003_BIB025 – volume: vol. 1643 start-page: 362 year: 1999 ident: 10.1016/j.comgeo.2004.03.003_BIB028 article-title: A nearly linear-time approximation scheme for the Euclidean k-median problem – start-page: 21 year: 2001 ident: 10.1016/j.comgeo.2004.03.003_BIB005 article-title: Local search heuristics for k-median and facility location problems – start-page: 281 year: 1967 ident: 10.1016/j.comgeo.2004.03.003_BIB031 article-title: Some methods for classification and analysis of multivariate observations – volume: 22 start-page: 4 issue: 1 year: 2000 ident: 10.1016/j.comgeo.2004.03.003_BIB022 article-title: Statistical pattern recognition: A review publication-title: IEEE Trans. Patt. Anal. Mach. Intell. doi: 10.1109/34.824819 – volume: 12 start-page: 341 year: 1991 ident: 10.1016/j.comgeo.2004.03.003_BIB008 article-title: Geometric clusterings publication-title: J. Algorithms doi: 10.1016/0196-6774(91)90007-L – volume: 41 start-page: 637 year: 1999 ident: 10.1016/j.comgeo.2004.03.003_BIB010 article-title: Centroidal Voronoi tesselations: Applications and algorithms publication-title: SIAM Rev. doi: 10.1137/S0036144599352836 – volume: 24 start-page: 61 year: 2000 ident: 10.1016/j.comgeo.2004.03.003_BIB032 article-title: On approximate geometric k-clustering publication-title: Discrete Comput. Geom. doi: 10.1007/s004540010019 – start-page: 277 year: 1999 ident: 10.1016/j.comgeo.2004.03.003_BIB035 article-title: Accelerating exact k-means algorithms with geometric reasoning – volume: vol. 2409 year: 2002 ident: 10.1016/j.comgeo.2004.03.003_BIB037 article-title: Acceleration of k-means and related clustering problems – start-page: 106 year: 1998 ident: 10.1016/j.comgeo.2004.03.003_BIB003 article-title: Approximation schemes for Euclidean k-median and related problems – start-page: 1 year: 1998 ident: 10.1016/j.comgeo.2004.03.003_BIB029 article-title: Analysis of a local search heuristic for facility location problems – year: 1973 ident: 10.1016/j.comgeo.2004.03.003_BIB011 – volume: 21 start-page: 768 year: 1965 ident: 10.1016/j.comgeo.2004.03.003_BIB017 article-title: Cluster analysis of multivariate data: Efficiency vs. interpretability of classification publication-title: Biometrics |
| SSID | ssj0002259 |
| Score | 2.2345397 |
| Snippet | In
k-means clustering we are given a set of
n data points in
d-dimensional space
R
d
and an integer
k, and the problem is to determine a set of
k points in ... |
| SourceID | crossref elsevier |
| SourceType | Enrichment Source Index Database Publisher |
| StartPage | 89 |
| SubjectTerms | Approximation algorithms Clustering Computational geometry k-means Local search |
| Title | A local search approximation algorithm for k-means clustering |
| URI | https://dx.doi.org/10.1016/j.comgeo.2004.03.003 |
| Volume | 28 |
| WOSCitedRecordID | wos000221974900003&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| journalDatabaseRights | – providerCode: PRVESC databaseName: Elsevier SD Freedom Collection Journals 2021 issn: 0925-7721 databaseCode: AIEXJ dateStart: 19950301 customDbUrl: isFulltext: true dateEnd: 20180131 titleUrlDefault: https://www.sciencedirect.com omitProxy: false ssIdentifier: ssj0002259 providerName: Elsevier |
| link | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV1Lb9NAEF5FKQc4IJ6i5SEfuFlb2d511j5wiKAIkFJVIki5Wev1Ok2b2FXqVOmv4a8y-3RKUaEHLpa1sZ3E82leO_MNQu9TUVGayAhXjFJMRUkwJ5TgrIqrVFa0SrgZNsGOj7PZLD8ZDH66XpirJWuabLvNL_6rqGENhK1aZ-8hbv9QWIBzEDocQexw_CfBj0Ntn0Kbz9Ck4duF6VAM-XLerhfd6UqXF57jlQRTFYrlRvElOCvmiAv0wAeXLJzLdiW79XVoK0HU3rwmet3ZAffamzegQ3QSdgrG2K9PWktzoCvpw8lhn4oGH_Wan26Mwlfp_PC7__QE4HPe9lQIyjH-dHgjX0H7uiqXeFQDdJnpi3Y6OMl2sJZgsqNSzYQha5xjU3N9S--bFMSZEtvc9HRSw11Lejvn9vZ_M3--KNHVu50V5ilqQictIlJoNtm9hKV5NkR7469Hs2_e2IM6NHSO9k-57kxdQnj71_zZ-9nxaKZP0GMbigRjA6GnaCCbZ-jRxPP4Xj5HH8aBBlNgwBTcAFPgwRQAmAILpqAH0wv04_PR9OMXbAduYEFY1OERRK8ipmWVMlDb4PrzNBdlmkhSMsa5jEtRw0I8ElWUi6imPEtFPSpZkvFKpDV5iYZN28hXKMizkss0qoVyAiHqLbMaYvGcEQgfSMTLfUTciyiEZaNXQ1GWxV1i2EfY33Vh2Fj-cj1z77iwHqXxFAsAzp13Htzzm16jhz3Y36Bht97It-iBuOoWl-t3FjW_ABeJmvM |
| linkProvider | Elsevier |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=A+local+search+approximation+algorithm+for+k-means+clustering&rft.jtitle=Computational+geometry+%3A+theory+and+applications&rft.au=Kanungo%2C+Tapas&rft.au=Mount%2C+David+M.&rft.au=Netanyahu%2C+Nathan+S.&rft.au=Piatko%2C+Christine+D.&rft.date=2004-06-01&rft.issn=0925-7721&rft.volume=28&rft.issue=2-3&rft.spage=89&rft.epage=112&rft_id=info:doi/10.1016%2Fj.comgeo.2004.03.003&rft.externalDBID=n%2Fa&rft.externalDocID=10_1016_j_comgeo_2004_03_003 |
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0925-7721&client=summon |
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0925-7721&client=summon |
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0925-7721&client=summon |