A local search approximation algorithm for k-means clustering

In k-means clustering we are given a set of n data points in d-dimensional space R d and an integer  k, and the problem is to determine a set of  k points in  R d , called centers, to minimize the mean squared distance from each data point to its nearest center. No exact polynomial-time algorithms a...

Celý popis

Uložené v:
Podrobná bibliografia
Vydané v:Computational geometry : theory and applications Ročník 28; číslo 2; s. 89 - 112
Hlavní autori: Kanungo, Tapas, Mount, David M., Netanyahu, Nathan S., Piatko, Christine D., Silverman, Ruth, Wu, Angela Y.
Médium: Journal Article
Jazyk:English
Vydavateľské údaje: Elsevier B.V 01.06.2004
Predmet:
ISSN:0925-7721
On-line prístup:Získať plný text
Tagy: Pridať tag
Žiadne tagy, Buďte prvý, kto otaguje tento záznam!
Abstract In k-means clustering we are given a set of n data points in d-dimensional space R d and an integer  k, and the problem is to determine a set of  k points in  R d , called centers, to minimize the mean squared distance from each data point to its nearest center. No exact polynomial-time algorithms are known for this problem. Although asymptotically efficient approximation algorithms exist, these algorithms are not practical due to the very high constant factors involved. There are many heuristics that are used in practice, but we know of no bounds on their performance. We consider the question of whether there exists a simple and practical approximation algorithm for k-means clustering. We present a local improvement heuristic based on swapping centers in and out. We prove that this yields a (9+ ε)-approximation algorithm. We present an example showing that any approach based on performing a fixed number of swaps achieves an approximation factor of at least (9− ε) in all sufficiently high dimensions. Thus, our approximation factor is almost tight for algorithms based on performing a fixed number of swaps. To establish the practical value of the heuristic, we present an empirical study that shows that, when combined with Lloyd's algorithm, this heuristic performs quite well in practice.
AbstractList In k-means clustering we are given a set of n data points in d-dimensional space R d and an integer  k, and the problem is to determine a set of  k points in  R d , called centers, to minimize the mean squared distance from each data point to its nearest center. No exact polynomial-time algorithms are known for this problem. Although asymptotically efficient approximation algorithms exist, these algorithms are not practical due to the very high constant factors involved. There are many heuristics that are used in practice, but we know of no bounds on their performance. We consider the question of whether there exists a simple and practical approximation algorithm for k-means clustering. We present a local improvement heuristic based on swapping centers in and out. We prove that this yields a (9+ ε)-approximation algorithm. We present an example showing that any approach based on performing a fixed number of swaps achieves an approximation factor of at least (9− ε) in all sufficiently high dimensions. Thus, our approximation factor is almost tight for algorithms based on performing a fixed number of swaps. To establish the practical value of the heuristic, we present an empirical study that shows that, when combined with Lloyd's algorithm, this heuristic performs quite well in practice.
Author Silverman, Ruth
Wu, Angela Y.
Piatko, Christine D.
Kanungo, Tapas
Mount, David M.
Netanyahu, Nathan S.
Author_xml – sequence: 1
  givenname: Tapas
  surname: Kanungo
  fullname: Kanungo, Tapas
  email: kanungo@almaden.ibm.com
  organization: IBM Almaden Research Center, San Jose, CA 95120, USA
– sequence: 2
  givenname: David M.
  surname: Mount
  fullname: Mount, David M.
  email: mount@cs.umd.edu
  organization: Department of Computer Science, University of Maryland, College Park, MD, USA
– sequence: 3
  givenname: Nathan S.
  surname: Netanyahu
  fullname: Netanyahu, Nathan S.
  email: nathan@macs.biu.ac.il
  organization: Department of Mathematics and Computer Science, Bar-Ilan University, Ramat-Gan 52900, Israel
– sequence: 4
  givenname: Christine D.
  surname: Piatko
  fullname: Piatko, Christine D.
  email: christine.piatko@jhuapl.edu
  organization: The Johns Hopkins University Applied Physics Laboratory, Laurel, MD, USA
– sequence: 5
  givenname: Ruth
  surname: Silverman
  fullname: Silverman, Ruth
  email: ruth@cfar.umd.edu
  organization: Center for Automation Research, University of Maryland, College Park, MD, USA
– sequence: 6
  givenname: Angela Y.
  surname: Wu
  fullname: Wu, Angela Y.
  email: awu@american.edu
  organization: Department of Computer Science, American University, Washington, DC, USA
BookMark eNqFz81KAzEQwPEcKthW38BDXmDXSbLZdAWFUvyCghc9h-xs0qbubkqyir69W-vJg54GBv7D_GZk0ofeEnLBIGfAystdjqHb2JBzgCIHkQOICZlCxWWmFGenZJbSDgA4l9WUXC9pG9C0NFkTcUvNfh_Dh-_M4ENPTbsJ0Q_bjroQ6WvWWdMniu1bGmz0_eaMnDjTJnv-M-fk5e72efWQrZ_uH1fLdYZCwZCVxUIhK-pGKlWJigsjK6wlt6JWyhjLanTjgpXYQIXgCrOQ6Mpa8YVpUDoxJ1fHuxhDStE6jX74fnGIxreagT7g9U4f8fqA1yD0iB_j4le8jyMwfv6X3RwzO8LevY06obc92sZHi4Nugv_7wBdFT3tD
CitedBy_id crossref_primary_10_1007_s10878_018_0340_4
crossref_primary_10_1016_j_str_2014_08_007
crossref_primary_10_1137_17M1127181
crossref_primary_10_1145_3749982
crossref_primary_10_1007_s10878_018_0261_2
crossref_primary_10_1109_TIA_2023_3249143
crossref_primary_10_3390_computation8040090
crossref_primary_10_1137_17M112717X
crossref_primary_10_1080_14942119_2022_2139586
crossref_primary_10_3390_rs11070875
crossref_primary_10_1109_ACCESS_2020_2975449
crossref_primary_10_1007_s10107_018_1269_1
crossref_primary_10_1016_j_is_2021_101804
crossref_primary_10_1145_2395116_2395117
crossref_primary_10_1007_s00521_019_04673_0
crossref_primary_10_1007_s12145_024_01676_x
crossref_primary_10_1109_TIT_2021_3122465
crossref_primary_10_1190_geo2015_0220_1
crossref_primary_10_1016_j_neuroimage_2009_06_014
crossref_primary_10_1016_j_adhoc_2016_11_009
crossref_primary_10_1007_s11440_023_01803_w
crossref_primary_10_1109_TEVC_2023_3296645
crossref_primary_10_1137_23M1551936
crossref_primary_10_1016_j_ipl_2008_03_013
crossref_primary_10_1016_j_ipl_2022_106251
crossref_primary_10_1016_j_ipl_2016_11_009
crossref_primary_10_1016_j_tcs_2020_07_022
crossref_primary_10_1016_j_ins_2020_07_010
crossref_primary_10_1016_j_conbuildmat_2023_131141
crossref_primary_10_1145_1498698_1537601
crossref_primary_10_1016_j_neucom_2010_11_013
crossref_primary_10_1142_S012905412246008X
crossref_primary_10_1142_S0217595922400073
crossref_primary_10_1186_s40537_018_0122_y
crossref_primary_10_1007_s00453_015_0043_5
crossref_primary_10_1007_s10878_020_00550_y
crossref_primary_10_1016_j_ins_2018_02_001
crossref_primary_10_1007_s00180_007_0090_8
crossref_primary_10_1142_S0217595920400059
crossref_primary_10_1016_j_patcog_2009_05_016
crossref_primary_10_3390_electronics11152396
crossref_primary_10_1007_s00180_019_00871_5
crossref_primary_10_1155_2014_506480
crossref_primary_10_1007_s10518_024_02007_7
crossref_primary_10_3390_rs13183628
crossref_primary_10_1007_s11761_018_00253_7
crossref_primary_10_26552_com_C_2024_010
crossref_primary_10_3758_s13415_019_00763_7
crossref_primary_10_1007_s10844_011_0158_3
crossref_primary_10_1007_s10878_020_00569_1
crossref_primary_10_1016_j_entcs_2019_08_050
crossref_primary_10_1017_S0960129521000104
crossref_primary_10_3390_s19132842
crossref_primary_10_1007_s10878_021_00737_x
crossref_primary_10_1145_2027216_2027217
crossref_primary_10_1016_j_tcs_2010_05_034
crossref_primary_10_1109_TPDS_2013_186
crossref_primary_10_1109_TKDE_2020_3018744
crossref_primary_10_1109_TKDE_2013_113
crossref_primary_10_1145_2450142_2450144
crossref_primary_10_1186_s40537_020_00325_6
crossref_primary_10_1007_s10489_018_1238_7
crossref_primary_10_1007_s11081_020_09503_0
crossref_primary_10_1016_j_energy_2020_119134
crossref_primary_10_1142_S0217595922400097
crossref_primary_10_1371_journal_pone_0217050
crossref_primary_10_1007_s10878_018_0278_6
crossref_primary_10_1007_s00224_024_10211_w
crossref_primary_10_23919_JSEE_2023_000023
crossref_primary_10_1080_01430750_2019_1630307
crossref_primary_10_1016_j_aei_2024_102799
crossref_primary_10_1049_iet_gtd_2018_6820
crossref_primary_10_1287_moor_2021_1216
crossref_primary_10_1137_18M1171321
crossref_primary_10_1142_S0217595922400140
crossref_primary_10_1016_j_datak_2010_12_002
crossref_primary_10_1587_transinf_E94_D_2271
crossref_primary_10_1007_s10898_018_00733_2
crossref_primary_10_57120_yalvac_1636844
crossref_primary_10_1016_j_ipl_2013_02_003
crossref_primary_10_1016_j_physa_2019_122992
crossref_primary_10_1109_ACCESS_2019_2943166
crossref_primary_10_1016_j_patcog_2014_03_017
crossref_primary_10_1016_j_tcs_2022_11_027
crossref_primary_10_1109_TPDS_2014_2306193
crossref_primary_10_1007_s40305_022_00394_9
crossref_primary_10_1016_j_jpdc_2024_104966
crossref_primary_10_3389_frai_2023_1156269
crossref_primary_10_1287_ijoc_2022_1166
crossref_primary_10_1142_S0217595919500064
crossref_primary_10_1214_21_AOS2140
crossref_primary_10_1145_3301446
crossref_primary_10_1111_itor_12808
crossref_primary_10_1016_j_compbiomed_2021_105193
crossref_primary_10_1016_j_neucom_2021_04_028
crossref_primary_10_1145_3311953
crossref_primary_10_1007_s13218_017_0519_3
crossref_primary_10_1016_j_ref_2022_06_007
crossref_primary_10_1177_1550147717707112
crossref_primary_10_1002_2050_7038_12757
crossref_primary_10_1111_j_1467_8659_2012_03137_x
crossref_primary_10_1002_cpe_7447
crossref_primary_10_1145_3392720
crossref_primary_10_1049_enc2_12089
crossref_primary_10_1016_j_tcs_2020_06_029
crossref_primary_10_21272_mmi_2019_3_10
crossref_primary_10_1007_s00454_011_9340_1
crossref_primary_10_1145_2133803_2184450
crossref_primary_10_1007_s10878_021_00734_0
crossref_primary_10_1145_3674508
crossref_primary_10_4028_www_scientific_net_AMM_209_211_925
crossref_primary_10_1137_070683921
crossref_primary_10_3390_a13060146
crossref_primary_10_2478_acss_2023_0001
crossref_primary_10_1007_s00453_025_01338_4
crossref_primary_10_1002_sam_10097
crossref_primary_10_1162_netn_a_00050
crossref_primary_10_1371_journal_pone_0049946
crossref_primary_10_1016_j_jclepro_2019_117669
crossref_primary_10_1007_s12667_022_00535_2
crossref_primary_10_1007_s10878_019_00450_w
crossref_primary_10_1016_j_tcs_2018_04_048
crossref_primary_10_1145_3477541
crossref_primary_10_1007_s00371_017_1352_2
Cites_doi 10.1145/331499.331504
10.1007/BF02716805
10.1142/S0218001401000927
10.1126/science.220.4598.671
10.1109/TPAMI.1984.4767478
10.1109/TPAMI.2002.1017616
10.1007/PL00009311
10.1109/TIT.1987.1057277
10.1109/TIT.1982.1056489
10.1109/34.824819
10.1016/0196-6774(91)90007-L
10.1137/S0036144599352836
10.1007/s004540010019
ContentType Journal Article
Copyright 2004 Elsevier B.V.
Copyright_xml – notice: 2004 Elsevier B.V.
DBID 6I.
AAFTH
AAYXX
CITATION
DOI 10.1016/j.comgeo.2004.03.003
DatabaseName ScienceDirect Open Access Titles
Elsevier:ScienceDirect:Open Access
CrossRef
DatabaseTitle CrossRef
DatabaseTitleList
DeliveryMethod fulltext_linktorsrc
Discipline Mathematics
EndPage 112
ExternalDocumentID 10_1016_j_comgeo_2004_03_003
S0925772104000215
GroupedDBID --K
--M
-DZ
.DC
.~1
0R~
1B1
1RT
1~.
1~5
29F
4.4
457
4G.
5GY
5VS
6I.
7-5
71M
8P~
9JN
AACTN
AAEDT
AAEDW
AAFTH
AAIAV
AAIKJ
AAKOC
AALRI
AAOAW
AAQFI
AAQXK
AAXUO
AAYFN
ABAOU
ABBOA
ABFNM
ABMAC
ABVKL
ABXDB
ABYKQ
ACAZW
ACDAQ
ACGFS
ACNNM
ACRLP
ACZNC
ADBBV
ADEZE
ADMUD
AEBSH
AEKER
AEXQZ
AFKWA
AFTJW
AGHFR
AGUBO
AGYEJ
AHHHB
AHZHX
AIALX
AIEXJ
AIGVJ
AIKHN
AITUG
AJBFU
AJOXV
ALMA_UNASSIGNED_HOLDINGS
AMFUW
AMRAJ
AOUOD
ARUGR
ASPBG
AVWKF
AXJTR
AZFZN
BKOJK
BLXMC
CS3
EBS
EFJIC
EFLBG
EJD
EO8
EO9
EP2
EP3
F5P
FDB
FEDTE
FGOYB
FIRID
FNPLU
FYGXN
G-2
G-Q
GBLVA
GBOLZ
HVGLF
HZ~
IHE
IXB
J1W
KOM
LG9
M26
M41
MHUIS
MO0
N9A
NCXOZ
O-L
O9-
OAUVE
OK1
OZT
P-8
P-9
P2P
PC.
Q38
R2-
RIG
RNS
ROL
RPZ
SDF
SDG
SDP
SES
SEW
SPC
SPCBC
SSV
SSW
SSZ
T5K
UHS
WUQ
XPP
ZMT
~G-
9DU
AATTM
AAXKI
AAYWO
AAYXX
ABJNI
ABWVN
ACLOT
ACRPL
ADNMO
ADVLN
AEIPS
AFJKZ
AGQPQ
AIIUN
ANKPU
APXCP
CITATION
EFKBS
~HD
ID FETCH-LOGICAL-c370t-6487c14bd57793923a59cb52e3b77aae1bcf9cb16cd09c0f4a85cf6b728adc5f3
ISICitedReferencesCount 377
ISICitedReferencesURI http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000221974900003&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
ISSN 0925-7721
IngestDate Tue Nov 18 22:15:30 EST 2025
Sat Nov 29 03:12:50 EST 2025
Fri Feb 23 02:30:56 EST 2024
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue 2
Keywords k-means
Local search
Computational geometry
Approximation algorithms
Clustering
Language English
License http://www.elsevier.com/open-access/userlicense/1.0
https://www.elsevier.com/tdm/userlicense/1.0
https://www.elsevier.com/open-access/userlicense/1.0
LinkModel OpenURL
MergedId FETCHMERGED-LOGICAL-c370t-6487c14bd57793923a59cb52e3b77aae1bcf9cb16cd09c0f4a85cf6b728adc5f3
OpenAccessLink https://dx.doi.org/10.1016/j.comgeo.2004.03.003
PageCount 24
ParticipantIDs crossref_citationtrail_10_1016_j_comgeo_2004_03_003
crossref_primary_10_1016_j_comgeo_2004_03_003
elsevier_sciencedirect_doi_10_1016_j_comgeo_2004_03_003
PublicationCentury 2000
PublicationDate 2004-06-01
PublicationDateYYYYMMDD 2004-06-01
PublicationDate_xml – month: 06
  year: 2004
  text: 2004-06-01
  day: 01
PublicationDecade 2000
PublicationTitle Computational geometry : theory and applications
PublicationYear 2004
Publisher Elsevier B.V
Publisher_xml – name: Elsevier B.V
References Fayyad, Piatetsky-Shapiro, Smyth, Uthurusamy (BIB015) 1996
Matoušek (BIB032) 2000; 24
Jain, Dubes (BIB021) 1988
Du, Faber, Gunzburger (BIB010) 1999; 41
Faber (BIB014) 1994; 22
Feller (BIB016) 1968
Kirkpatrick, Gelatt, Vecchi (BIB026) 1983; 220
Korupolu, Plaxton, Rajaraman (BIB029) 1998
Phillips (BIB037) 2002; vol. 2409
Garey, Johnson (BIB018) 1979
Arora, Raghavan, Rao (BIB003) 1998
Inaba, Katoh, Imai (BIB020) 1994
Vaisey, Gersho (BIB041) 1988
Capoyleas, Rote, Woeginger (BIB008) 1991; 12
Agarwal, Procopiuc (BIB001) 1998
Selim, Ismail (BIB038) 1984; 6
Forgey (BIB017) 1965; 21
Duda, Hart (BIB011) 1973
Eppstein (BIB013) 1997
Arya, Mount, Narayan (BIB004) 1996; 16
Arya, Garg, Khandekar, Pandit, Meyerson, Munagala (BIB005) 2001
Jain, Murty, Flynn (BIB023) 1999; 31
Kohonen (BIB027) 1989
Lloyd (BIB030) 1982; 28
Bandyopadhyay, Maulik, Pakhira (BIB007) 2001; 15
Pelleg, Moore (BIB035) 1999
Thorup (BIB040) 2001; vol. 2076
Charikar, Guha (BIB009) 1999
Pelleg, Moore (BIB036) 2000
Alsabti, Ranka, Singh (BIB002) 1998
Mettu, Plaxton (BIB033) 2002
Wesolowsky (BIB042) 1993; 1
Kaufman, Rousseeuw (BIB025) 1990
Gersho, Gray (BIB019) 1992
ElGamal, Hemanchandra, Shperling, Wei (BIB012) 1987; 33
Kanungo, Mount, Netanyahu, Piatko, Silverman, Wu (BIB024) 2002; 24
Ng, Han (BIB034) 1994
Jain, Duin, Mao (BIB022) 2000; 22
Kolliopoulos, Rao (BIB028) 1999; vol. 1643
Sharir (BIB039) 1997; 18
Ball, Hall (BIB006) 1964
MacQueen (BIB031) 1967
Alsabti (10.1016/j.comgeo.2004.03.003_BIB002) 1998
Capoyleas (10.1016/j.comgeo.2004.03.003_BIB008) 1991; 12
Charikar (10.1016/j.comgeo.2004.03.003_BIB009) 1999
Ng (10.1016/j.comgeo.2004.03.003_BIB034) 1994
Jain (10.1016/j.comgeo.2004.03.003_BIB021) 1988
Garey (10.1016/j.comgeo.2004.03.003_BIB018) 1979
Agarwal (10.1016/j.comgeo.2004.03.003_BIB001) 1998
Du (10.1016/j.comgeo.2004.03.003_BIB010) 1999; 41
Faber (10.1016/j.comgeo.2004.03.003_BIB014) 1994; 22
Kaufman (10.1016/j.comgeo.2004.03.003_BIB025) 1990
Bandyopadhyay (10.1016/j.comgeo.2004.03.003_BIB007) 2001; 15
Kirkpatrick (10.1016/j.comgeo.2004.03.003_BIB026) 1983; 220
Selim (10.1016/j.comgeo.2004.03.003_BIB038) 1984; 6
Gersho (10.1016/j.comgeo.2004.03.003_BIB019) 1992
Matoušek (10.1016/j.comgeo.2004.03.003_BIB032) 2000; 24
Ball (10.1016/j.comgeo.2004.03.003_BIB006) 1964
Jain (10.1016/j.comgeo.2004.03.003_BIB023) 1999; 31
Kanungo (10.1016/j.comgeo.2004.03.003_BIB024) 2002; 24
Wesolowsky (10.1016/j.comgeo.2004.03.003_BIB042) 1993; 1
MacQueen (10.1016/j.comgeo.2004.03.003_BIB031) 1967
Pelleg (10.1016/j.comgeo.2004.03.003_BIB035) 1999
Arya (10.1016/j.comgeo.2004.03.003_BIB005) 2001
Mettu (10.1016/j.comgeo.2004.03.003_BIB033) 2002
Vaisey (10.1016/j.comgeo.2004.03.003_BIB041) 1988
Duda (10.1016/j.comgeo.2004.03.003_BIB011) 1973
Forgey (10.1016/j.comgeo.2004.03.003_BIB017) 1965; 21
Kolliopoulos (10.1016/j.comgeo.2004.03.003_BIB028) 1999; vol. 1643
Eppstein (10.1016/j.comgeo.2004.03.003_BIB013) 1997
ElGamal (10.1016/j.comgeo.2004.03.003_BIB012) 1987; 33
Inaba (10.1016/j.comgeo.2004.03.003_BIB020) 1994
Fayyad (10.1016/j.comgeo.2004.03.003_BIB015) 1996
Arya (10.1016/j.comgeo.2004.03.003_BIB004) 1996; 16
Korupolu (10.1016/j.comgeo.2004.03.003_BIB029) 1998
Lloyd (10.1016/j.comgeo.2004.03.003_BIB030) 1982; 28
Pelleg (10.1016/j.comgeo.2004.03.003_BIB036) 2000
Kohonen (10.1016/j.comgeo.2004.03.003_BIB027) 1989
Phillips (10.1016/j.comgeo.2004.03.003_BIB037) 2002; vol. 2409
Jain (10.1016/j.comgeo.2004.03.003_BIB022) 2000; 22
Thorup (10.1016/j.comgeo.2004.03.003_BIB040) 2001; vol. 2076
Feller (10.1016/j.comgeo.2004.03.003_BIB016) 1968
Arora (10.1016/j.comgeo.2004.03.003_BIB003) 1998
Sharir (10.1016/j.comgeo.2004.03.003_BIB039) 1997; 18
References_xml – year: 1989
  ident: BIB027
  article-title: Self-Organization and Associative Memory
– start-page: 277
  year: 1999
  end-page: 281
  ident: BIB035
  article-title: Accelerating exact
  publication-title: Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Diego, CA
– volume: 1
  start-page: 5
  year: 1993
  end-page: 23
  ident: BIB042
  article-title: The Weber problem: History and perspective
  publication-title: Location Sci.
– volume: 12
  start-page: 341
  year: 1991
  end-page: 356
  ident: BIB008
  article-title: Geometric clusterings
  publication-title: J. Algorithms
– volume: 22
  start-page: 138
  year: 1994
  end-page: 144
  ident: BIB014
  article-title: Clustering and the continuous
  publication-title: Los Alamos Sci.
– year: 1997
  ident: BIB013
  article-title: Faster construction of planar two-centers
  publication-title: Proc. 8th ACM-SIAM Sympos. Discrete Algorithms
– year: 1998
  ident: BIB002
  article-title: An efficient
  publication-title: Proceedings of the First Workshop on High Performance Data Mining, Orlando, FL
– year: 1964
  ident: BIB006
  article-title: Some fundamental concepts and synthesis procedures for pattern recognition preprocessors
  publication-title: International Conference on Microwaves, Circuit Theory, and Information Theory, Tokyo, Japan
– volume: vol. 1643
  start-page: 362
  year: 1999
  end-page: 371
  ident: BIB028
  article-title: A nearly linear-time approximation scheme for the Euclidean
  publication-title: Proceedings of the Seventh Annual European Symposium on Algorithms
– start-page: 1
  year: 1998
  end-page: 10
  ident: BIB029
  article-title: Analysis of a local search heuristic for facility location problems
  publication-title: Proceedings of the Ninth Annual ACM-SIAM Symposium on Discrete Algorithms, San Francisco, CA
– volume: 18
  start-page: 125
  year: 1997
  end-page: 134
  ident: BIB039
  article-title: A near-linear algorithm for the planar 2-center problem
  publication-title: Discrete Comput. Geom.
– year: 2000
  ident: BIB036
  publication-title: Proceedings of the Seventeenth International Conference on Machine Learning, Palo Alto, CA
– start-page: 378
  year: 1999
  end-page: 388
  ident: BIB009
  article-title: Improved combinatorial algorithms for the facility location and
  publication-title: Proceedings of the 4th Annual IEEE Symposium on Foundations of Computer Science
– year: 1968
  ident: BIB016
  article-title: An Introduction to Probability Theory and its Applications
– volume: 15
  start-page: 269
  year: 2001
  end-page: 285
  ident: BIB007
  article-title: Clustering using simulated annealing with probabilistic redistribution
  publication-title: Internat. J. Patt. Recog. Artif. Intell.
– volume: 28
  start-page: 129
  year: 1982
  end-page: 137
  ident: BIB030
  article-title: Least squares quantization in PCM
  publication-title: IEEE Trans. Inform. Theory
– volume: vol. 2076
  start-page: 249
  year: 2001
  end-page: 260
  ident: BIB040
  article-title: Quick
  publication-title: Proc. 28th Intl. Colloq. on Automata, Languages and Programming (ICALP)
– start-page: 144
  year: 1994
  end-page: 155
  ident: BIB034
  article-title: Efficient and effective clustering methods for spatial data mining
  publication-title: Proceedings of the Twentieth International Conference on Very Large Databases, Santiago, Chile
– volume: 21
  start-page: 768
  year: 1965
  ident: BIB017
  article-title: Cluster analysis of multivariate data: Efficiency vs. interpretability of classification
  publication-title: Biometrics
– volume: 24
  start-page: 61
  year: 2000
  end-page: 84
  ident: BIB032
  article-title: On approximate geometric
  publication-title: Discrete Comput. Geom.
– year: 1992
  ident: BIB019
  article-title: Vector Quantization and Signal Compression
– year: 1988
  ident: BIB021
  article-title: Algorithms for Clustering Data
– volume: 31
  start-page: 264
  year: 1999
  end-page: 323
  ident: BIB023
  article-title: Data clustering: A review
  publication-title: ACM Comput. Surv.
– volume: 22
  start-page: 4
  year: 2000
  end-page: 37
  ident: BIB022
  article-title: Statistical pattern recognition: A review
  publication-title: IEEE Trans. Patt. Anal. Mach. Intell.
– volume: 24
  year: 2002
  ident: BIB024
  article-title: An efficient
  publication-title: IEEE Trans. Patt. Anal. Mach. Intell.
– volume: vol. 2409
  year: 2002
  ident: BIB037
  article-title: Acceleration of
  publication-title: Algorithm Engineering and Experiments (Proc. ALENEX '02)
– volume: 16
  start-page: 155
  year: 1996
  end-page: 176
  ident: BIB004
  article-title: Accounting for boundary effects in nearest-neighbor searching
  publication-title: Discrete Comput. Geom.
– volume: 220
  start-page: 671
  year: 1983
  end-page: 680
  ident: BIB026
  article-title: Optimization by simulated annealing
  publication-title: Science
– start-page: 1176
  year: 1988
  end-page: 1179
  ident: BIB041
  article-title: Simulated annealing and codebook design
  publication-title: Proc. IEEE Internat. Conf. on Acoustics, Speech, and Signal Processing (ICASSP)
– year: 1973
  ident: BIB011
  article-title: Pattern Classification and Scene Analysis
– volume: 33
  start-page: 116
  year: 1987
  end-page: 123
  ident: BIB012
  article-title: Using simulated annealing to design good codes
  publication-title: IEEE Trans. Inform. Theory
– volume: 6
  start-page: 81
  year: 1984
  end-page: 87
  ident: BIB038
  publication-title: IEEE Trans. Patt. Anal. Mach. Intell.
– year: 1979
  ident: BIB018
  article-title: Computers and Intractability: A Guide to the Theory of NP-Completeness
– start-page: 658
  year: 1998
  end-page: 667
  ident: BIB001
  article-title: Exact and approximation algorithms for clustering
  publication-title: Proceedings of the Ninth Annual ACM-SIAM Symposium on Discrete Algorithms, San Francisco, CA
– year: 1996
  ident: BIB015
  article-title: Advances in Knowledge Discovery and Data Mining
– start-page: 339
  year: 2002
  end-page: 348
  ident: BIB033
  article-title: Optimal time bounds for approximate clustering
  publication-title: Proc. 18th Conf. on Uncertainty in Artif. Intell., Edmonton, Canada
– start-page: 21
  year: 2001
  end-page: 29
  ident: BIB005
  article-title: Local search heuristics for
  publication-title: Proceedings of the 33rd Annual Symposium on Theory of Computing, Crete, Greece
– volume: 41
  start-page: 637
  year: 1999
  end-page: 676
  ident: BIB010
  article-title: Centroidal Voronoi tesselations: Applications and algorithms
  publication-title: SIAM Rev.
– start-page: 332
  year: 1994
  end-page: 339
  ident: BIB020
  article-title: Applications of weighted Voronoi diagrams and randomization to variance-based
  publication-title: Proceedings of the Tenth Annual ACM Symposium on Computational Geometry, Stony Brook, NY
– year: 1990
  ident: BIB025
  article-title: Finding Groups in Data: An Introduction to Cluster Analysis
– start-page: 106
  year: 1998
  end-page: 113
  ident: BIB003
  article-title: Approximation schemes for Euclidean
  publication-title: Proceedings of the Thirtieth Annual ACM Symposium on Theory of Computing, Dallas, TX
– start-page: 281
  year: 1967
  end-page: 296
  ident: BIB031
  article-title: Some methods for classification and analysis of multivariate observations
  publication-title: Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, vol. 1, Berkeley, CA
– start-page: 332
  year: 1994
  ident: 10.1016/j.comgeo.2004.03.003_BIB020
  article-title: Applications of weighted Voronoi diagrams and randomization to variance-based k-clustering
– volume: vol. 2076
  start-page: 249
  year: 2001
  ident: 10.1016/j.comgeo.2004.03.003_BIB040
  article-title: Quick k-median, k-center, and facility location for sparse graphs
– volume: 31
  start-page: 264
  issue: 3
  year: 1999
  ident: 10.1016/j.comgeo.2004.03.003_BIB023
  article-title: Data clustering: A review
  publication-title: ACM Comput. Surv.
  doi: 10.1145/331499.331504
– volume: 1
  start-page: 5
  year: 1993
  ident: 10.1016/j.comgeo.2004.03.003_BIB042
  article-title: The Weber problem: History and perspective
  publication-title: Location Sci.
– start-page: 339
  year: 2002
  ident: 10.1016/j.comgeo.2004.03.003_BIB033
  article-title: Optimal time bounds for approximate clustering
– start-page: 658
  year: 1998
  ident: 10.1016/j.comgeo.2004.03.003_BIB001
  article-title: Exact and approximation algorithms for clustering
– year: 1988
  ident: 10.1016/j.comgeo.2004.03.003_BIB021
– volume: 16
  start-page: 155
  year: 1996
  ident: 10.1016/j.comgeo.2004.03.003_BIB004
  article-title: Accounting for boundary effects in nearest-neighbor searching
  publication-title: Discrete Comput. Geom.
  doi: 10.1007/BF02716805
– volume: 15
  start-page: 269
  year: 2001
  ident: 10.1016/j.comgeo.2004.03.003_BIB007
  article-title: Clustering using simulated annealing with probabilistic redistribution
  publication-title: Internat. J. Patt. Recog. Artif. Intell.
  doi: 10.1142/S0218001401000927
– volume: 220
  start-page: 671
  year: 1983
  ident: 10.1016/j.comgeo.2004.03.003_BIB026
  article-title: Optimization by simulated annealing
  publication-title: Science
  doi: 10.1126/science.220.4598.671
– volume: 6
  start-page: 81
  year: 1984
  ident: 10.1016/j.comgeo.2004.03.003_BIB038
  article-title: K-means-type algorithms: A generalized convergence theorem and characterization of local optimality
  publication-title: IEEE Trans. Patt. Anal. Mach. Intell.
  doi: 10.1109/TPAMI.1984.4767478
– volume: 24
  year: 2002
  ident: 10.1016/j.comgeo.2004.03.003_BIB024
  article-title: An efficient k-means clustering algorithm: Analysis and implementation
  publication-title: IEEE Trans. Patt. Anal. Mach. Intell.
  doi: 10.1109/TPAMI.2002.1017616
– volume: 18
  start-page: 125
  year: 1997
  ident: 10.1016/j.comgeo.2004.03.003_BIB039
  article-title: A near-linear algorithm for the planar 2-center problem
  publication-title: Discrete Comput. Geom.
  doi: 10.1007/PL00009311
– volume: 33
  start-page: 116
  year: 1987
  ident: 10.1016/j.comgeo.2004.03.003_BIB012
  article-title: Using simulated annealing to design good codes
  publication-title: IEEE Trans. Inform. Theory
  doi: 10.1109/TIT.1987.1057277
– year: 1992
  ident: 10.1016/j.comgeo.2004.03.003_BIB019
– volume: 28
  start-page: 129
  year: 1982
  ident: 10.1016/j.comgeo.2004.03.003_BIB030
  article-title: Least squares quantization in PCM
  publication-title: IEEE Trans. Inform. Theory
  doi: 10.1109/TIT.1982.1056489
– start-page: 378
  year: 1999
  ident: 10.1016/j.comgeo.2004.03.003_BIB009
  article-title: Improved combinatorial algorithms for the facility location and k-medians problem
– year: 1964
  ident: 10.1016/j.comgeo.2004.03.003_BIB006
  article-title: Some fundamental concepts and synthesis procedures for pattern recognition preprocessors
– year: 1979
  ident: 10.1016/j.comgeo.2004.03.003_BIB018
– year: 1998
  ident: 10.1016/j.comgeo.2004.03.003_BIB002
  article-title: An efficient k-means clustering algorithm
– start-page: 1176
  year: 1988
  ident: 10.1016/j.comgeo.2004.03.003_BIB041
  article-title: Simulated annealing and codebook design
– year: 1996
  ident: 10.1016/j.comgeo.2004.03.003_BIB015
– year: 1968
  ident: 10.1016/j.comgeo.2004.03.003_BIB016
– year: 2000
  ident: 10.1016/j.comgeo.2004.03.003_BIB036
  article-title: x-means: Extending k-means with efficient estimation of the number of clusters
– start-page: 144
  year: 1994
  ident: 10.1016/j.comgeo.2004.03.003_BIB034
  article-title: Efficient and effective clustering methods for spatial data mining
– volume: 22
  start-page: 138
  year: 1994
  ident: 10.1016/j.comgeo.2004.03.003_BIB014
  article-title: Clustering and the continuous k-means algorithm
  publication-title: Los Alamos Sci.
– year: 1997
  ident: 10.1016/j.comgeo.2004.03.003_BIB013
  article-title: Faster construction of planar two-centers
– year: 1989
  ident: 10.1016/j.comgeo.2004.03.003_BIB027
– year: 1990
  ident: 10.1016/j.comgeo.2004.03.003_BIB025
– volume: vol. 1643
  start-page: 362
  year: 1999
  ident: 10.1016/j.comgeo.2004.03.003_BIB028
  article-title: A nearly linear-time approximation scheme for the Euclidean k-median problem
– start-page: 21
  year: 2001
  ident: 10.1016/j.comgeo.2004.03.003_BIB005
  article-title: Local search heuristics for k-median and facility location problems
– start-page: 281
  year: 1967
  ident: 10.1016/j.comgeo.2004.03.003_BIB031
  article-title: Some methods for classification and analysis of multivariate observations
– volume: 22
  start-page: 4
  issue: 1
  year: 2000
  ident: 10.1016/j.comgeo.2004.03.003_BIB022
  article-title: Statistical pattern recognition: A review
  publication-title: IEEE Trans. Patt. Anal. Mach. Intell.
  doi: 10.1109/34.824819
– volume: 12
  start-page: 341
  year: 1991
  ident: 10.1016/j.comgeo.2004.03.003_BIB008
  article-title: Geometric clusterings
  publication-title: J. Algorithms
  doi: 10.1016/0196-6774(91)90007-L
– volume: 41
  start-page: 637
  year: 1999
  ident: 10.1016/j.comgeo.2004.03.003_BIB010
  article-title: Centroidal Voronoi tesselations: Applications and algorithms
  publication-title: SIAM Rev.
  doi: 10.1137/S0036144599352836
– volume: 24
  start-page: 61
  year: 2000
  ident: 10.1016/j.comgeo.2004.03.003_BIB032
  article-title: On approximate geometric k-clustering
  publication-title: Discrete Comput. Geom.
  doi: 10.1007/s004540010019
– start-page: 277
  year: 1999
  ident: 10.1016/j.comgeo.2004.03.003_BIB035
  article-title: Accelerating exact k-means algorithms with geometric reasoning
– volume: vol. 2409
  year: 2002
  ident: 10.1016/j.comgeo.2004.03.003_BIB037
  article-title: Acceleration of k-means and related clustering problems
– start-page: 106
  year: 1998
  ident: 10.1016/j.comgeo.2004.03.003_BIB003
  article-title: Approximation schemes for Euclidean k-median and related problems
– start-page: 1
  year: 1998
  ident: 10.1016/j.comgeo.2004.03.003_BIB029
  article-title: Analysis of a local search heuristic for facility location problems
– year: 1973
  ident: 10.1016/j.comgeo.2004.03.003_BIB011
– volume: 21
  start-page: 768
  year: 1965
  ident: 10.1016/j.comgeo.2004.03.003_BIB017
  article-title: Cluster analysis of multivariate data: Efficiency vs. interpretability of classification
  publication-title: Biometrics
SSID ssj0002259
Score 2.2344437
Snippet In k-means clustering we are given a set of n data points in d-dimensional space R d and an integer  k, and the problem is to determine a set of  k points in ...
SourceID crossref
elsevier
SourceType Enrichment Source
Index Database
Publisher
StartPage 89
SubjectTerms Approximation algorithms
Clustering
Computational geometry
k-means
Local search
Title A local search approximation algorithm for k-means clustering
URI https://dx.doi.org/10.1016/j.comgeo.2004.03.003
Volume 28
WOSCitedRecordID wos000221974900003&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVESC
  databaseName: Elsevier SD Freedom Collection Journals 2021
  issn: 0925-7721
  databaseCode: AIEXJ
  dateStart: 19950301
  customDbUrl:
  isFulltext: true
  dateEnd: 20180131
  titleUrlDefault: https://www.sciencedirect.com
  omitProxy: false
  ssIdentifier: ssj0002259
  providerName: Elsevier
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV1Lb9NAEF5FKQc4IMpDtEDlAzdrK9u79q4PHCIogkqpKhGk3Kz1ep2mTewqdar01_BXGe_DTmlV6IHLKtrEa8fzaWZ2duYbhD4WRRGHqixxSHKJKS9ynCrBYc8ThgkPSslKoZtNsJMTPp2mp4PBL1cLc71gVcU3m_Tyv4oa5kDYbensI8TdLQoT8BmEDiOIHcZ_EvzI1_bJt_EMTRq-mZsKRV8sZvVq3pwtdXrhBV4qMFW-XKxbvgRnxRxxgW744IKFM1UvVbO68W0mSHs2r4let07AO-0tKtAhOgg7AWPczY9rS3OgM-n98WEfigYf9UacrY3Cb8P5_o_u21OAz0XdUyG0jvGXw1vxCtrnVbnAY9tAl5m6aKeDI76FtQiTLZVqOgxZ4xyanOs7et-EIM5bsc1MTSc13LWkt3PubP8P89clJbp8t_PMrNJ26KRZQDLNJrsTsTjlQ7Qz-n40Pe6MPahDQ-do_5SrztQphHef5n7vZ8ujmbxAz-1WxBsZCO2igapeomfjjsf36hX6NPI0mDwDJu8WmLwOTB6AybNg8nowvUY_vx5NPn_DtuEGloQFDU5g9ypDmhcxA7UNrr-IU5nHkSI5Y0KoMJclTISJLIJUBiUVPJZlkrOIi0LGJXmDhlVdqbfIo4GANRIhwPehlIpU5SwkomSRlIoUag8R9yIyadno26Yoi-whMewh3F11adhY_vJ75t5xZj1K4ylmAJwHr9x_5J3eoac92N-jYbNaqw_oibxu5lerA4ua3wyQnFg
linkProvider Elsevier
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=A+local+search+approximation+algorithm+for+k-means+clustering&rft.jtitle=Computational+geometry+%3A+theory+and+applications&rft.au=Kanungo%2C+Tapas&rft.au=Mount%2C+David+M.&rft.au=Netanyahu%2C+Nathan+S.&rft.au=Piatko%2C+Christine+D.&rft.date=2004-06-01&rft.issn=0925-7721&rft.volume=28&rft.issue=2-3&rft.spage=89&rft.epage=112&rft_id=info:doi/10.1016%2Fj.comgeo.2004.03.003&rft.externalDBID=n%2Fa&rft.externalDocID=10_1016_j_comgeo_2004_03_003
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0925-7721&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0925-7721&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0925-7721&client=summon