A Graph Theoretic Criterion for Determining the Number of Clusters in a Data Set

This article is concerned with procedures for determining the number of clusters in a data set. Most of the procedures or stopping rules currently in use involve finding internally coherent and externally isolated clusters, but do not derive from the formal structure of the respective clustering mod...

Celý popis

Uložené v:
Podrobná bibliografia
Vydané v:Multivariate behavioral research Ročník 27; číslo 4; s. 541 - 565
Hlavní autori: Krolak-Schwedt, Sabine, Eckes, Thomas
Médium: Journal Article
Jazyk:English
Vydavateľské údaje: Fort Worth, TX Lawrence Erlbaum Associates, Inc 01.10.1992
Society of Multivariate Experimental Psychology
Predmet:
ISSN:0027-3171, 1532-7906
On-line prístup:Získať plný text
Tagy: Pridať tag
Žiadne tagy, Buďte prvý, kto otaguje tento záznam!
Abstract This article is concerned with procedures for determining the number of clusters in a data set. Most of the procedures or stopping rules currently in use involve finding internally coherent and externally isolated clusters, but do not derive from the formal structure of the respective clustering model. Based on the graph theoretic concepts of minimal spanning tree, maximal spanning tree, and homomorphic function, a new criterion is advanced that yields a well-defined clustering solution. Its performance in determining the number of clusters in several empirical data sets is evaluated by comparing it to four prominent stopping rules. It is shown that the proposed criterion not only possesses mathematically attractive properties but also may contribute to solving the number-of-clusters problem.
AbstractList This article is concerned with procedures for determining the number of clusters in a data set. Most of the procedures or stopping rules currently in use involve finding internally coherent and externally isolated clusters, but do not derive from the formal structure of the respective clustering model. Based on the graph theoretic concepts of minimal spanning tree, maximal spanning tree, and homomorphic function, a new criterion is advanced that yields a well-defined clustering solution. Its performance in determining the number of clusters in several empirical data sets is evaluated by comparing it to four prominent stopping rules. It is shown that the proposed criterion not only possesses mathematically attractive properties but also may contribute to solving the number-of-clusters problem.
Procedures for determining the number of clusters in a data set are explored. A proposed stopping rule, the GRAPH criterion, is compared to four stopping rules currently in use. The GRAPH criterion's mathematically attractive properties and utility in solving the number-of-clusters problem are demonstrated. (SLD)
Author Eckes, Thomas
Krolak-Schwedt, Sabine
Author_xml – sequence: 1
  givenname: Sabine
  surname: Krolak-Schwedt
  fullname: Krolak-Schwedt, Sabine
– sequence: 2
  givenname: Thomas
  surname: Eckes
  fullname: Eckes, Thomas
BackLink http://eric.ed.gov/ERICWebPortal/detail?accno=EJ464856$$DView record in ERIC
http://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&idt=4754570$$DView record in Pascal Francis
https://www.ncbi.nlm.nih.gov/pubmed/26811133$$D View this record in MEDLINE/PubMed
BookMark eNp9kU9vEzEQxS1URNOWL4AQ8oFDLwv-s2vvHjhUaSmgCiqRuzV2bGK0awfbK9RvX0dJegCpJ4_n_d5I8-YMnYQYLEJvKPlAGZEfM-04kwMRk05MklbxF2ix6zW75glaEMJkw6mkp-gs59-EENG1wyt0ykRPKeV8ge6v8G2C7QavNjYmW7zBy-SLTT4G7GLC17Z-Jh98-IXLxuLv86RtwtHh5TjnqmXsAwZ8DQXwT1su0EsHY7avD-85Wn2-WS2_NHc_br8ur-4a0xFeGicAhCTaMaqNJuCEYZ1d19q5upTsKAfL-rqWHlpHOAwdtIMm3GrXQ8_P0eV-7DbFP7PNRU0-GzuOEGycs6JSkIEwRoeKvjugs57sWm2TnyA9qGMIFXh_ACAbGF2CYHx-4lrZtZ0kFXu7x2o45km9-daKtu9Elfu9bFLMOVmnjC9Qao4lgR8VJWp3NfX_1aqV_WM9Tn_W9Glv8qHeaYK_MY1rVeBhjOm4An_G_wj0B6zo
CODEN MVBRAV
CitedBy_id crossref_primary_10_1016_j_foodqual_2013_12_004
crossref_primary_10_1038_s41467_020_15956_9
crossref_primary_10_1037_0021_843X_109_1_74
crossref_primary_10_3390_vaccines9121428
crossref_primary_10_1081_STA_120037266
crossref_primary_10_1080_24709360_2019_1615770
crossref_primary_10_1007_s00357_010_9069_1
crossref_primary_10_3389_fphar_2023_1136184
crossref_primary_10_1016_S0898_1221_99_00090_5
Cites_doi 10.1007/BF02294245
10.1007/BF02294065
10.1007/BF02294012
10.2307/2529943
10.1007/BF01890078
10.1037/h0029393
10.1037//0033-2909.83.6.1072
10.1177/001316448004000320
10.1177/014662168701100401
10.1207/s15327906mbr1603_7
10.1007/BF02296969
10.1007/BF01908064
10.1146/annurev.es.05.110174.000533
10.1090/S0002-9939-1956-0078686-7
10.2307/2344237
10.4135/9781412986359
10.1207/s15327906mbr2003_4
10.1002/9780470316641
10.1111/j.2044-8317.1974.tb00524.x
10.1080/01621459.1975.10480256
10.1207/s15327906mbr1004_7
ContentType Journal Article
Copyright Copyright Taylor & Francis Group, LLC 1992
1993 INIST-CNRS
Copyright_xml – notice: Copyright Taylor & Francis Group, LLC 1992
– notice: 1993 INIST-CNRS
DBID AAYXX
CITATION
7SW
BJH
BNH
BNI
BNJ
BNO
ERI
PET
REK
WWN
IQODW
NPM
7X8
DOI 10.1207/s15327906mbr2704_3
DatabaseName CrossRef
ERIC
ERIC (Ovid)
ERIC
ERIC
ERIC (Legacy Platform)
ERIC( SilverPlatter )
ERIC
ERIC PlusText (Legacy Platform)
Education Resources Information Center (ERIC)
ERIC
Pascal-Francis
PubMed
MEDLINE - Academic
DatabaseTitle CrossRef
ERIC
PubMed
MEDLINE - Academic
DatabaseTitleList PubMed
MEDLINE - Academic

ERIC
Database_xml – sequence: 1
  dbid: NPM
  name: PubMed
  url: http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed
  sourceTypes: Index Database
– sequence: 2
  dbid: 7X8
  name: MEDLINE - Academic
  url: https://search.proquest.com/medline
  sourceTypes: Aggregation Database
DeliveryMethod fulltext_linktorsrc
Discipline Psychology
EISSN 1532-7906
ERIC EJ464856
EndPage 565
ExternalDocumentID 26811133
4754570
EJ464856
10_1207_s15327906mbr2704_3
9677200
Genre Journal Article
GroupedDBID --Z
-~X
.7I
07M
0R~
123
4.4
53G
8VB
ABIVO
ABJNI
ABLJU
ABPPZ
ABVXC
ABWZE
ABZLS
ACGFS
ACHQT
ACIWK
ACNCT
ADCVX
ADIUE
ADLFI
ADXAZ
AECIN
AENEX
AEPSL
AETEA
AEYOC
AFFNX
AJWEG
ALEEW
ALMA_UNASSIGNED_HOLDINGS
AQTUD
C5A
CAG
CBZAQ
CKOZC
COF
CS3
C~T
DU5
EBS
EJD
EMOBN
F5P
FEDTE
FXNIP
H13
HVGLF
HZ~
H~9
JLMOS
L7Y
MS~
NA5
NEJ
NW-
O9-
OHT
P-O
P2P
PQQKQ
QWB
QZZOY
TDBHL
TFH
TFL
TFW
TN5
TNTFI
TWZ
UA1
UAP
WH7
XOL
YNT
YQT
ZCG
ZL0
.GJ
.QK
0BK
5VS
AAGDL
AAGZJ
AAHIA
AAMFJ
AAMIU
AANPH
AAPUL
AATTQ
AAYXX
AAZMC
ABBZI
ABCCY
ABFIM
ABLIJ
ABPEM
ABRLO
ABRYG
ABTAI
ABXUL
ABXYU
ACPKE
ACRBO
ACTIO
ACTOA
ADAHI
ADEWX
ADKVQ
AEFOU
AEISY
AEKEX
AEOZL
AEXSR
AEZRU
AFHDM
AFRVT
AGDLA
AGMYJ
AGRBW
AHDZW
AIJEM
AIXGP
AIYEW
AKBVH
ALLRG
ALQZU
AVBZW
AWYRJ
BEJHT
BLEHA
BMOTO
BOHLJ
CCCUG
CITATION
CQ1
DGFLZ
DGXZK
DKSSO
EFRLQ
EGDCR
E~B
E~C
G-F
GTTXZ
HF~
IPNFZ
J.O
KYCEM
LJTGL
LPU
M4Z
RBICI
RIG
RNANH
ROL
ROSJB
RSYQP
S-F
STATR
TASJS
TBQAZ
TEH
TRJHH
TUROJ
UT5
UT9
VAE
ZXP
~01
~S~
7SW
BJH
BNH
BNI
BNJ
BNO
ERI
PET
REK
WWN
08R
AAAVI
ABJVF
ABPTK
ABPTX
ABQHQ
ABSSG
ACLSK
AEGYZ
AFOLD
AFWLO
FUNRP
IQODW
KDLKA
V1K
NPM
7X8
ID FETCH-LOGICAL-c503t-f6aa670bf21bcb0af6c25edbcbff3277513ae28270b94f03a95a49b03ebf8a83
IEDL.DBID TFW
ISICitedReferencesCount 9
ISICitedReferencesURI http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=10_1207_s15327906mbr2704_3&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
ISSN 0027-3171
IngestDate Wed Oct 01 14:12:39 EDT 2025
Thu Apr 03 07:01:24 EDT 2025
Sun Oct 29 17:07:48 EDT 2023
Tue Dec 02 16:32:33 EST 2025
Sat Nov 29 06:40:25 EST 2025
Tue Nov 18 21:15:15 EST 2025
Mon Oct 20 23:41:18 EDT 2025
IsDoiOpenAccess false
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue 4
Keywords Cluster analysis
Statistical method
Selection criterion
Stopping rule
Graph theory
Language English
License CC BY 4.0
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c503t-f6aa670bf21bcb0af6c25edbcbff3277513ae28270b94f03a95a49b03ebf8a83
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
OpenAccessLink http://orbilu.uni.lu/handle/10993/8214
PMID 26811133
PQID 1760902219
PQPubID 23479
PageCount 25
ParticipantIDs crossref_citationtrail_10_1207_s15327906mbr2704_3
eric_primary_EJ464856
proquest_miscellaneous_1760902219
informaworld_taylorfrancis_310_1207_s15327906mbr2704_3
pascalfrancis_primary_4754570
pubmed_primary_26811133
crossref_primary_10_1207_s15327906mbr2704_3
PublicationCentury 1900
PublicationDate 1992-10-01
PublicationDateYYYYMMDD 1992-10-01
PublicationDate_xml – month: 10
  year: 1992
  text: 1992-10-01
  day: 01
PublicationDecade 1990
PublicationPlace Fort Worth, TX
PublicationPlace_xml – name: Fort Worth, TX
– name: United States
PublicationTitle Multivariate behavioral research
PublicationTitleAlternate Multivariate Behav Res
PublicationYear 1992
Publisher Lawrence Erlbaum Associates, Inc
Society of Multivariate Experimental Psychology
Publisher_xml – name: Lawrence Erlbaum Associates, Inc
– name: Society of Multivariate Experimental Psychology
References p_4_125
p_31_152
p_20_141
p_15_136
Milligan G. W. (p_26_147) 1985; 50
Carroll J. D. (p_7_128) 1983; 48
p_30_151
p_22_143
Baker F. B. (p_3_124) 1975; 70
p_2_123
p_33_154
p_29_150
Corter J. E. (p_11_132) 1986; 51
Everitt B. (p_13_134) 1979; 35
p_9_130
p_1_122
Calinski T. (p_5_126) 1974; 3
p_34_155
p_23_144
Kruskal J. B. (p_24_145) 1956; 7
Milligan G. W. (p_27_148) 1987; 11
p_6_127
Milligan G. W. (p_28_149) 1980; 40
Goodman L. A. (p_16_137) 1954; 49
Dalrymple-Alford E. C. (p_12_133) 1970; 74
p_8_129
Hubert L. (p_19_140) 1974; 27
Cormack R. M. (p_10_131) 1971; 134
p_32_153
p_21_142
p_25_146
Gower J. C. (p_17_138) 1969; 18
p_18_139
p_14_135
References_xml – volume: 50
  start-page: 159
  year: 1985
  ident: p_26_147
  publication-title: Psychometrika
  doi: 10.1007/BF02294245
– ident: p_34_155
– ident: p_8_129
– volume: 51
  start-page: 429
  year: 1986
  ident: p_11_132
  publication-title: Psychometrika
  doi: 10.1007/BF02294065
– volume: 48
  start-page: 157
  year: 1983
  ident: p_7_128
  publication-title: Psychometrika
  doi: 10.1007/BF02294012
– ident: p_1_122
– volume: 35
  start-page: 169
  year: 1979
  ident: p_13_134
  publication-title: Biometrics
  doi: 10.2307/2529943
– ident: p_15_136
  doi: 10.1007/BF01890078
– ident: p_4_125
– volume: 74
  start-page: 32
  year: 1970
  ident: p_12_133
  publication-title: Psychological Bulletin
  doi: 10.1037/h0029393
– ident: p_20_141
  doi: 10.1037//0033-2909.83.6.1072
– volume: 40
  start-page: 755
  year: 1980
  ident: p_28_149
  publication-title: Educational and Psychological Measurement
  doi: 10.1177/001316448004000320
– volume: 11
  start-page: 329
  year: 1987
  ident: p_27_148
  publication-title: Applied Psychological Measurement
  doi: 10.1177/014662168701100401
– ident: p_25_146
  doi: 10.1207/s15327906mbr1603_7
– volume: 49
  start-page: 732
  year: 1954
  ident: p_16_137
  publication-title: Journal of the American Statistical Association
– ident: p_9_130
– ident: p_21_142
– ident: p_23_144
– ident: p_6_127
  doi: 10.1007/BF02296969
– ident: p_18_139
  doi: 10.1007/BF01908064
– ident: p_29_150
  doi: 10.1146/annurev.es.05.110174.000533
– volume: 7
  start-page: 48
  year: 1956
  ident: p_24_145
  publication-title: Proceedings of the American Statistical Society
  doi: 10.1090/S0002-9939-1956-0078686-7
– volume: 134
  start-page: 321
  year: 1971
  ident: p_10_131
  publication-title: Journal of theRoya 1 Statistical Society (Series A)
  doi: 10.2307/2344237
– ident: p_14_135
– ident: p_2_123
  doi: 10.4135/9781412986359
– ident: p_31_152
  doi: 10.1207/s15327906mbr2003_4
– ident: p_33_154
– volume: 18
  start-page: 54
  year: 1969
  ident: p_17_138
  publication-title: Journal of the Royal Statistical Society (Series C)
– ident: p_32_153
  doi: 10.1002/9780470316641
– volume: 3
  start-page: 1
  year: 1974
  ident: p_5_126
  publication-title: Communications in Statistics
– volume: 27
  start-page: 14
  year: 1974
  ident: p_19_140
  publication-title: British Journal ofMathematica 1 and Statistical Psychology
  doi: 10.1111/j.2044-8317.1974.tb00524.x
– ident: p_22_143
– volume: 70
  start-page: 1
  year: 1975
  ident: p_3_124
  publication-title: Journal of the American Statistical Association
  doi: 10.1080/01621459.1975.10480256
– ident: p_30_151
  doi: 10.1207/s15327906mbr1004_7
SSID ssj0006549
Score 1.4180921
Snippet This article is concerned with procedures for determining the number of clusters in a data set. Most of the procedures or stopping rules currently in use...
Procedures for determining the number of clusters in a data set are explored. A proposed stopping rule, the GRAPH criterion, is compared to four stopping rules...
SourceID proquest
pubmed
pascalfrancis
eric
crossref
informaworld
SourceType Aggregation Database
Index Database
Enrichment Source
Publisher
StartPage 541
SubjectTerms Biological and medical sciences
Cluster Analysis
Data Collection
Equations (Mathematics)
Evaluation Criteria
Fundamental and applied biological sciences. Psychology
Graph Theory
Hierarchical Cluster Analysis
Mathematical Models
Psychology. Psychoanalysis. Psychiatry
Psychology. Psychophysiology
Psychometrics. Statistics. Methodology
Statistics. Mathematics
Stopping Rules
Title A Graph Theoretic Criterion for Determining the Number of Clusters in a Data Set
URI https://www.tandfonline.com/doi/abs/10.1207/s15327906mbr2704_3
http://eric.ed.gov/ERICWebPortal/detail?accno=EJ464856
https://www.ncbi.nlm.nih.gov/pubmed/26811133
https://www.proquest.com/docview/1760902219
Volume 27
WOSCitedRecordID wos10_1207_s15327906mbr2704_3&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVAWR
  databaseName: Taylor and Francis Online Journals
  customDbUrl:
  eissn: 1532-7906
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0006549
  issn: 0027-3171
  databaseCode: TFW
  dateStart: 19660101
  isFulltext: true
  titleUrlDefault: https://www.tandfonline.com
  providerName: Taylor & Francis
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwpV1La9wwEB7SkEMufaRJ67YJKvQWDLJk63EMSTYlhyXQhe7NSLYEgdRb1t5C_31HstZ0S5tDc_NBEpLmKfnTNwCfSq3algufszZc3ThX5LbVNq9sQdtG6YrZNhabkPO5Wi71Xbpw6xOsMpyh_UgUEX11MG5jxwokLD57qziTmopvds0kLetA9olpfUD0LWZfJ0csqpT9snAVJ4v0ZubvQ-zEpQR93iEvDahJ0-PG-bHixb9T0hiaZi-euqiX8DwlpeRi1KJXsOe6IzicfOPP13B3QW4CtTVZbB8-klAkARew6gjOnlwlWA1GQoI5JZnHSiNk5cnlwyaQMfTkviOGXJnBkC9uOIbF7Hpx-TlPxRjypqJ8yL0wRkhqPStsY6nxomGVa_Hbe5y4rApuHJ7fsIkuPeVGV6bUlnJnvTKKn8B-t-rc2wCmQrvnqtHoajCEGmtRkZRlTdGyUrkyg2IribpJROWhXsZDHQ4sLFIs_7lXGZxPfb6PNB2Ptj4OAp5aXt-WAtVHZCB-l3g9xEuTJO-aPzbg6Y5uTCOXEjNUSTP4uNWVGs03_JMxnVtt-rqQIiBjMW5k8GZUoqkzEwojEefv_nda7-Ewoosj9vAD7A_rjTuFg-bHcN-vz-CZXKqzaDK_AGd_FTs
linkProvider Taylor & Francis
linkToHtml http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwpV3da9UwFD_oFNyLn3OrcxrBNym0-c7j2HadOi8DC-6tJG0Cg9k7bnsH_vcmaW7ZFd2DvvUhCUnO9-nJ-QG8p0q2LeEux21I3Vhb5qZVJmemLNpGKoZNG8EmxHwuLy7UecI57VNZZYih3dgoIurqINwhGT02Q4jv3hjBQhX8h1liUdCa3IcHAZouRF_V7PukijlL_i8OyThRplczf15jwzKl4ueN9qWhblL3_urciHnxd6c0GqfZk_8-1lN4nPxSdDgy0jO4Z7vnsD2px58v4PwQfQzdrVG1fvuIAk6CP8GiQ3776DhV1nhjiLxbieYRbAQtHDq6WoV-DD267JBGx3rQ6JsddqCanVRHp3nCY8gbVpAhd1xrLgrjcGkaU2jHG8xs67-d8xsXrCTa-hDOD1HUFUQrpqkyBbHGSS3JS9jqFp3dC_VUXvSJbJTXNt6KamM8L0mDm7LFVFqaQbkmRd2kXuUBMuOqDjELjl2Wf7-rDD5Mc67HTh13jt4JFJ5GnnymnErGM-C3SV4PMW-SCF6TuxY82GCOaWUqvJMqigzerZml9hIcfsvozi5WfV0KHopjvenIYHfkomky5tIbI0Je_eu23sKj0-rrWX32af5lH7ZjsXEsRXwNW8NyZQ_gYXMzXPbLN1FyfgE6chh2
linkToPdf http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwpV3Li9QwGA-6iuzF56pVVyN4k0Kbd47Lzo5PhgEH3FtJmgQW1s4y7Qj-935JM8UR3YPeekhCku-Z9Jffh9AbppVzVISSuHh1431dWqdtyW1duVZpTqxLxSbkYqHOz_UyX7j1GVYZz9BhJIpIvjoa95ULIxdCevbGKZG6Et_shsiKNfQmugV5M49avZp_nTyx4Dn9JfEuTtb50cyfx9gLTBn7vMdeGmGTpoedC2PJi7_npCk2ze_976ruo7s5K8Unoxo9QDd89xAdTs7xxyO0PMHvIrc1Xu1ePuJYJQEWsO4wzB7PMq4GQiGGpBIvUqkRvA749HIb2Rh6fNFhg2dmMPiLH47Qan62On1f5moMZcsrOpRBGCNkZQOpbWsrE0RLuHfwHQJMXPKaGg8HOGiiWaio0dwwbSvqbVBG0cfooFt3_mlEU4HhU9Vq8DUQQ421oEnKkrZ2hCnPClTvJNG0mak8Fsy4bOKJhSSO5d_3qkBvpz5XI0_Hta2PooCnlmcfmWCKiwKJXyXeDOnWJMu7odcNeLynG9PITEKKKqsCvd7pSgP2G3_KmM6vt31TSxGhsRA4CvRkVKKpMxEKQhGlz_51Wq_QneVs3nz-sPj0HB0mpHHCIb5AB8Nm64_R7fb7cNFvXia7-QniZRco
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=A+Graph+Theoretic+Criterion+for+Determining+the+Number+of+Clusters+in+a+Data+Set&rft.jtitle=Multivariate+behavioral+research&rft.au=Krolak-Schwedt%2C+S&rft.au=Eckes%2C+T&rft.date=1992-10-01&rft.issn=0027-3171&rft.volume=27&rft.issue=4&rft.spage=541&rft_id=info:doi/10.1207%2Fs15327906mbr2704_3&rft_id=info%3Apmid%2F26811133&rft.externalDocID=26811133
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0027-3171&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0027-3171&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0027-3171&client=summon