A Graph Theoretic Criterion for Determining the Number of Clusters in a Data Set
This article is concerned with procedures for determining the number of clusters in a data set. Most of the procedures or stopping rules currently in use involve finding internally coherent and externally isolated clusters, but do not derive from the formal structure of the respective clustering mod...
Uložené v:
| Vydané v: | Multivariate behavioral research Ročník 27; číslo 4; s. 541 - 565 |
|---|---|
| Hlavní autori: | , |
| Médium: | Journal Article |
| Jazyk: | English |
| Vydavateľské údaje: |
Fort Worth, TX
Lawrence Erlbaum Associates, Inc
01.10.1992
Society of Multivariate Experimental Psychology |
| Predmet: | |
| ISSN: | 0027-3171, 1532-7906 |
| On-line prístup: | Získať plný text |
| Tagy: |
Pridať tag
Žiadne tagy, Buďte prvý, kto otaguje tento záznam!
|
| Abstract | This article is concerned with procedures for determining the number of clusters in a data set. Most of the procedures or stopping rules currently in use involve finding internally coherent and externally isolated clusters, but do not derive from the formal structure of the respective clustering model. Based on the graph theoretic concepts of minimal spanning tree, maximal spanning tree, and homomorphic function, a new criterion is advanced that yields a well-defined clustering solution. Its performance in determining the number of clusters in several empirical data sets is evaluated by comparing it to four prominent stopping rules. It is shown that the proposed criterion not only possesses mathematically attractive properties but also may contribute to solving the number-of-clusters problem. |
|---|---|
| AbstractList | This article is concerned with procedures for determining the number of clusters in a data set. Most of the procedures or stopping rules currently in use involve finding internally coherent and externally isolated clusters, but do not derive from the formal structure of the respective clustering model. Based on the graph theoretic concepts of minimal spanning tree, maximal spanning tree, and homomorphic function, a new criterion is advanced that yields a well-defined clustering solution. Its performance in determining the number of clusters in several empirical data sets is evaluated by comparing it to four prominent stopping rules. It is shown that the proposed criterion not only possesses mathematically attractive properties but also may contribute to solving the number-of-clusters problem. Procedures for determining the number of clusters in a data set are explored. A proposed stopping rule, the GRAPH criterion, is compared to four stopping rules currently in use. The GRAPH criterion's mathematically attractive properties and utility in solving the number-of-clusters problem are demonstrated. (SLD) |
| Author | Eckes, Thomas Krolak-Schwedt, Sabine |
| Author_xml | – sequence: 1 givenname: Sabine surname: Krolak-Schwedt fullname: Krolak-Schwedt, Sabine – sequence: 2 givenname: Thomas surname: Eckes fullname: Eckes, Thomas |
| BackLink | http://eric.ed.gov/ERICWebPortal/detail?accno=EJ464856$$DView record in ERIC http://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&idt=4754570$$DView record in Pascal Francis https://www.ncbi.nlm.nih.gov/pubmed/26811133$$D View this record in MEDLINE/PubMed |
| BookMark | eNp9kU9vEzEQxS1URNOWL4AQ8oFDLwv-s2vvHjhUaSmgCiqRuzV2bGK0awfbK9RvX0dJegCpJ4_n_d5I8-YMnYQYLEJvKPlAGZEfM-04kwMRk05MklbxF2ix6zW75glaEMJkw6mkp-gs59-EENG1wyt0ykRPKeV8ge6v8G2C7QavNjYmW7zBy-SLTT4G7GLC17Z-Jh98-IXLxuLv86RtwtHh5TjnqmXsAwZ8DQXwT1su0EsHY7avD-85Wn2-WS2_NHc_br8ur-4a0xFeGicAhCTaMaqNJuCEYZ1d19q5upTsKAfL-rqWHlpHOAwdtIMm3GrXQ8_P0eV-7DbFP7PNRU0-GzuOEGycs6JSkIEwRoeKvjugs57sWm2TnyA9qGMIFXh_ACAbGF2CYHx-4lrZtZ0kFXu7x2o45km9-daKtu9Elfu9bFLMOVmnjC9Qao4lgR8VJWp3NfX_1aqV_WM9Tn_W9Glv8qHeaYK_MY1rVeBhjOm4An_G_wj0B6zo |
| CODEN | MVBRAV |
| CitedBy_id | crossref_primary_10_1016_j_foodqual_2013_12_004 crossref_primary_10_1038_s41467_020_15956_9 crossref_primary_10_1037_0021_843X_109_1_74 crossref_primary_10_3390_vaccines9121428 crossref_primary_10_1081_STA_120037266 crossref_primary_10_1080_24709360_2019_1615770 crossref_primary_10_1007_s00357_010_9069_1 crossref_primary_10_3389_fphar_2023_1136184 crossref_primary_10_1016_S0898_1221_99_00090_5 |
| Cites_doi | 10.1007/BF02294245 10.1007/BF02294065 10.1007/BF02294012 10.2307/2529943 10.1007/BF01890078 10.1037/h0029393 10.1037//0033-2909.83.6.1072 10.1177/001316448004000320 10.1177/014662168701100401 10.1207/s15327906mbr1603_7 10.1007/BF02296969 10.1007/BF01908064 10.1146/annurev.es.05.110174.000533 10.1090/S0002-9939-1956-0078686-7 10.2307/2344237 10.4135/9781412986359 10.1207/s15327906mbr2003_4 10.1002/9780470316641 10.1111/j.2044-8317.1974.tb00524.x 10.1080/01621459.1975.10480256 10.1207/s15327906mbr1004_7 |
| ContentType | Journal Article |
| Copyright | Copyright Taylor & Francis Group, LLC 1992 1993 INIST-CNRS |
| Copyright_xml | – notice: Copyright Taylor & Francis Group, LLC 1992 – notice: 1993 INIST-CNRS |
| DBID | AAYXX CITATION 7SW BJH BNH BNI BNJ BNO ERI PET REK WWN IQODW NPM 7X8 |
| DOI | 10.1207/s15327906mbr2704_3 |
| DatabaseName | CrossRef ERIC ERIC (Ovid) ERIC ERIC ERIC (Legacy Platform) ERIC( SilverPlatter ) ERIC ERIC PlusText (Legacy Platform) Education Resources Information Center (ERIC) ERIC Pascal-Francis PubMed MEDLINE - Academic |
| DatabaseTitle | CrossRef ERIC PubMed MEDLINE - Academic |
| DatabaseTitleList | PubMed MEDLINE - Academic ERIC |
| Database_xml | – sequence: 1 dbid: NPM name: PubMed url: http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed sourceTypes: Index Database – sequence: 2 dbid: 7X8 name: MEDLINE - Academic url: https://search.proquest.com/medline sourceTypes: Aggregation Database |
| DeliveryMethod | fulltext_linktorsrc |
| Discipline | Psychology |
| EISSN | 1532-7906 |
| ERIC | EJ464856 |
| EndPage | 565 |
| ExternalDocumentID | 26811133 4754570 EJ464856 10_1207_s15327906mbr2704_3 9677200 |
| Genre | Journal Article |
| GroupedDBID | --Z -~X .7I 07M 0R~ 123 4.4 53G 8VB ABIVO ABJNI ABLJU ABPPZ ABVXC ABWZE ABZLS ACGFS ACHQT ACIWK ACNCT ADCVX ADIUE ADLFI ADXAZ AECIN AENEX AEPSL AETEA AEYOC AFFNX AJWEG ALEEW ALMA_UNASSIGNED_HOLDINGS AQTUD C5A CAG CBZAQ CKOZC COF CS3 C~T DU5 EBS EJD EMOBN F5P FEDTE FXNIP H13 HVGLF HZ~ H~9 JLMOS L7Y MS~ NA5 NEJ NW- O9- OHT P-O P2P PQQKQ QWB QZZOY TDBHL TFH TFL TFW TN5 TNTFI TWZ UA1 UAP WH7 XOL YNT YQT ZCG ZL0 .GJ .QK 0BK 5VS AAGDL AAGZJ AAHIA AAMFJ AAMIU AANPH AAPUL AATTQ AAYXX AAZMC ABBZI ABCCY ABFIM ABLIJ ABPEM ABRLO ABRYG ABTAI ABXUL ABXYU ACPKE ACRBO ACTIO ACTOA ADAHI ADEWX ADKVQ AEFOU AEISY AEKEX AEOZL AEXSR AEZRU AFHDM AFRVT AGDLA AGMYJ AGRBW AHDZW AIJEM AIXGP AIYEW AKBVH ALLRG ALQZU AVBZW AWYRJ BEJHT BLEHA BMOTO BOHLJ CCCUG CITATION CQ1 DGFLZ DGXZK DKSSO EFRLQ EGDCR E~B E~C G-F GTTXZ HF~ IPNFZ J.O KYCEM LJTGL LPU M4Z RBICI RIG RNANH ROL ROSJB RSYQP S-F STATR TASJS TBQAZ TEH TRJHH TUROJ UT5 UT9 VAE ZXP ~01 ~S~ 7SW BJH BNH BNI BNJ BNO ERI PET REK WWN 08R AAAVI ABJVF ABPTK ABPTX ABQHQ ABSSG ACLSK AEGYZ AFOLD AFWLO FUNRP IQODW KDLKA V1K NPM 7X8 |
| ID | FETCH-LOGICAL-c503t-f6aa670bf21bcb0af6c25edbcbff3277513ae28270b94f03a95a49b03ebf8a83 |
| IEDL.DBID | TFW |
| ISICitedReferencesCount | 9 |
| ISICitedReferencesURI | http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=10_1207_s15327906mbr2704_3&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| ISSN | 0027-3171 |
| IngestDate | Wed Oct 01 14:12:39 EDT 2025 Thu Apr 03 07:01:24 EDT 2025 Sun Oct 29 17:07:48 EDT 2023 Tue Dec 02 16:32:33 EST 2025 Sat Nov 29 06:40:25 EST 2025 Tue Nov 18 21:15:15 EST 2025 Mon Oct 20 23:41:18 EDT 2025 |
| IsDoiOpenAccess | false |
| IsOpenAccess | true |
| IsPeerReviewed | true |
| IsScholarly | true |
| Issue | 4 |
| Keywords | Cluster analysis Statistical method Selection criterion Stopping rule Graph theory |
| Language | English |
| License | CC BY 4.0 |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-LOGICAL-c503t-f6aa670bf21bcb0af6c25edbcbff3277513ae28270b94f03a95a49b03ebf8a83 |
| Notes | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 |
| OpenAccessLink | http://orbilu.uni.lu/handle/10993/8214 |
| PMID | 26811133 |
| PQID | 1760902219 |
| PQPubID | 23479 |
| PageCount | 25 |
| ParticipantIDs | crossref_citationtrail_10_1207_s15327906mbr2704_3 eric_primary_EJ464856 proquest_miscellaneous_1760902219 informaworld_taylorfrancis_310_1207_s15327906mbr2704_3 pascalfrancis_primary_4754570 pubmed_primary_26811133 crossref_primary_10_1207_s15327906mbr2704_3 |
| PublicationCentury | 1900 |
| PublicationDate | 1992-10-01 |
| PublicationDateYYYYMMDD | 1992-10-01 |
| PublicationDate_xml | – month: 10 year: 1992 text: 1992-10-01 day: 01 |
| PublicationDecade | 1990 |
| PublicationPlace | Fort Worth, TX |
| PublicationPlace_xml | – name: Fort Worth, TX – name: United States |
| PublicationTitle | Multivariate behavioral research |
| PublicationTitleAlternate | Multivariate Behav Res |
| PublicationYear | 1992 |
| Publisher | Lawrence Erlbaum Associates, Inc Society of Multivariate Experimental Psychology |
| Publisher_xml | – name: Lawrence Erlbaum Associates, Inc – name: Society of Multivariate Experimental Psychology |
| References | p_4_125 p_31_152 p_20_141 p_15_136 Milligan G. W. (p_26_147) 1985; 50 Carroll J. D. (p_7_128) 1983; 48 p_30_151 p_22_143 Baker F. B. (p_3_124) 1975; 70 p_2_123 p_33_154 p_29_150 Corter J. E. (p_11_132) 1986; 51 Everitt B. (p_13_134) 1979; 35 p_9_130 p_1_122 Calinski T. (p_5_126) 1974; 3 p_34_155 p_23_144 Kruskal J. B. (p_24_145) 1956; 7 Milligan G. W. (p_27_148) 1987; 11 p_6_127 Milligan G. W. (p_28_149) 1980; 40 Goodman L. A. (p_16_137) 1954; 49 Dalrymple-Alford E. C. (p_12_133) 1970; 74 p_8_129 Hubert L. (p_19_140) 1974; 27 Cormack R. M. (p_10_131) 1971; 134 p_32_153 p_21_142 p_25_146 Gower J. C. (p_17_138) 1969; 18 p_18_139 p_14_135 |
| References_xml | – volume: 50 start-page: 159 year: 1985 ident: p_26_147 publication-title: Psychometrika doi: 10.1007/BF02294245 – ident: p_34_155 – ident: p_8_129 – volume: 51 start-page: 429 year: 1986 ident: p_11_132 publication-title: Psychometrika doi: 10.1007/BF02294065 – volume: 48 start-page: 157 year: 1983 ident: p_7_128 publication-title: Psychometrika doi: 10.1007/BF02294012 – ident: p_1_122 – volume: 35 start-page: 169 year: 1979 ident: p_13_134 publication-title: Biometrics doi: 10.2307/2529943 – ident: p_15_136 doi: 10.1007/BF01890078 – ident: p_4_125 – volume: 74 start-page: 32 year: 1970 ident: p_12_133 publication-title: Psychological Bulletin doi: 10.1037/h0029393 – ident: p_20_141 doi: 10.1037//0033-2909.83.6.1072 – volume: 40 start-page: 755 year: 1980 ident: p_28_149 publication-title: Educational and Psychological Measurement doi: 10.1177/001316448004000320 – volume: 11 start-page: 329 year: 1987 ident: p_27_148 publication-title: Applied Psychological Measurement doi: 10.1177/014662168701100401 – ident: p_25_146 doi: 10.1207/s15327906mbr1603_7 – volume: 49 start-page: 732 year: 1954 ident: p_16_137 publication-title: Journal of the American Statistical Association – ident: p_9_130 – ident: p_21_142 – ident: p_23_144 – ident: p_6_127 doi: 10.1007/BF02296969 – ident: p_18_139 doi: 10.1007/BF01908064 – ident: p_29_150 doi: 10.1146/annurev.es.05.110174.000533 – volume: 7 start-page: 48 year: 1956 ident: p_24_145 publication-title: Proceedings of the American Statistical Society doi: 10.1090/S0002-9939-1956-0078686-7 – volume: 134 start-page: 321 year: 1971 ident: p_10_131 publication-title: Journal of theRoya 1 Statistical Society (Series A) doi: 10.2307/2344237 – ident: p_14_135 – ident: p_2_123 doi: 10.4135/9781412986359 – ident: p_31_152 doi: 10.1207/s15327906mbr2003_4 – ident: p_33_154 – volume: 18 start-page: 54 year: 1969 ident: p_17_138 publication-title: Journal of the Royal Statistical Society (Series C) – ident: p_32_153 doi: 10.1002/9780470316641 – volume: 3 start-page: 1 year: 1974 ident: p_5_126 publication-title: Communications in Statistics – volume: 27 start-page: 14 year: 1974 ident: p_19_140 publication-title: British Journal ofMathematica 1 and Statistical Psychology doi: 10.1111/j.2044-8317.1974.tb00524.x – ident: p_22_143 – volume: 70 start-page: 1 year: 1975 ident: p_3_124 publication-title: Journal of the American Statistical Association doi: 10.1080/01621459.1975.10480256 – ident: p_30_151 doi: 10.1207/s15327906mbr1004_7 |
| SSID | ssj0006549 |
| Score | 1.4180921 |
| Snippet | This article is concerned with procedures for determining the number of clusters in a data set. Most of the procedures or stopping rules currently in use... Procedures for determining the number of clusters in a data set are explored. A proposed stopping rule, the GRAPH criterion, is compared to four stopping rules... |
| SourceID | proquest pubmed pascalfrancis eric crossref informaworld |
| SourceType | Aggregation Database Index Database Enrichment Source Publisher |
| StartPage | 541 |
| SubjectTerms | Biological and medical sciences Cluster Analysis Data Collection Equations (Mathematics) Evaluation Criteria Fundamental and applied biological sciences. Psychology Graph Theory Hierarchical Cluster Analysis Mathematical Models Psychology. Psychoanalysis. Psychiatry Psychology. Psychophysiology Psychometrics. Statistics. Methodology Statistics. Mathematics Stopping Rules |
| Title | A Graph Theoretic Criterion for Determining the Number of Clusters in a Data Set |
| URI | https://www.tandfonline.com/doi/abs/10.1207/s15327906mbr2704_3 http://eric.ed.gov/ERICWebPortal/detail?accno=EJ464856 https://www.ncbi.nlm.nih.gov/pubmed/26811133 https://www.proquest.com/docview/1760902219 |
| Volume | 27 |
| WOSCitedRecordID | wos10_1207_s15327906mbr2704_3&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| journalDatabaseRights | – providerCode: PRVAWR databaseName: Taylor and Francis Online Journals customDbUrl: eissn: 1532-7906 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0006549 issn: 0027-3171 databaseCode: TFW dateStart: 19660101 isFulltext: true titleUrlDefault: https://www.tandfonline.com providerName: Taylor & Francis |
| link | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwpV1La9wwEB7SkEMufaRJ67YJKvQWDLJk63EMSTYlhyXQhe7NSLYEgdRb1t5C_31HstZ0S5tDc_NBEpLmKfnTNwCfSq3algufszZc3ThX5LbVNq9sQdtG6YrZNhabkPO5Wi71Xbpw6xOsMpyh_UgUEX11MG5jxwokLD57qziTmopvds0kLetA9olpfUD0LWZfJ0csqpT9snAVJ4v0ZubvQ-zEpQR93iEvDahJ0-PG-bHixb9T0hiaZi-euqiX8DwlpeRi1KJXsOe6IzicfOPP13B3QW4CtTVZbB8-klAkARew6gjOnlwlWA1GQoI5JZnHSiNk5cnlwyaQMfTkviOGXJnBkC9uOIbF7Hpx-TlPxRjypqJ8yL0wRkhqPStsY6nxomGVa_Hbe5y4rApuHJ7fsIkuPeVGV6bUlnJnvTKKn8B-t-rc2wCmQrvnqtHoajCEGmtRkZRlTdGyUrkyg2IribpJROWhXsZDHQ4sLFIs_7lXGZxPfb6PNB2Ptj4OAp5aXt-WAtVHZCB-l3g9xEuTJO-aPzbg6Y5uTCOXEjNUSTP4uNWVGs03_JMxnVtt-rqQIiBjMW5k8GZUoqkzEwojEefv_nda7-Ewoosj9vAD7A_rjTuFg-bHcN-vz-CZXKqzaDK_AGd_FTs |
| linkProvider | Taylor & Francis |
| linkToHtml | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwpV3da9UwFD_oFNyLn3OrcxrBNym0-c7j2HadOi8DC-6tJG0Cg9k7bnsH_vcmaW7ZFd2DvvUhCUnO9-nJ-QG8p0q2LeEux21I3Vhb5qZVJmemLNpGKoZNG8EmxHwuLy7UecI57VNZZYih3dgoIurqINwhGT02Q4jv3hjBQhX8h1liUdCa3IcHAZouRF_V7PukijlL_i8OyThRplczf15jwzKl4ueN9qWhblL3_urciHnxd6c0GqfZk_8-1lN4nPxSdDgy0jO4Z7vnsD2px58v4PwQfQzdrVG1fvuIAk6CP8GiQ3776DhV1nhjiLxbieYRbAQtHDq6WoV-DD267JBGx3rQ6JsddqCanVRHp3nCY8gbVpAhd1xrLgrjcGkaU2jHG8xs67-d8xsXrCTa-hDOD1HUFUQrpqkyBbHGSS3JS9jqFp3dC_VUXvSJbJTXNt6KamM8L0mDm7LFVFqaQbkmRd2kXuUBMuOqDjELjl2Wf7-rDD5Mc67HTh13jt4JFJ5GnnymnErGM-C3SV4PMW-SCF6TuxY82GCOaWUqvJMqigzerZml9hIcfsvozi5WfV0KHopjvenIYHfkomky5tIbI0Je_eu23sKj0-rrWX32af5lH7ZjsXEsRXwNW8NyZQ_gYXMzXPbLN1FyfgE6chh2 |
| linkToPdf | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwpV3Li9QwGA-6iuzF56pVVyN4k0Kbd47Lzo5PhgEH3FtJmgQW1s4y7Qj-935JM8UR3YPeekhCku-Z9Jffh9AbppVzVISSuHh1431dWqdtyW1duVZpTqxLxSbkYqHOz_UyX7j1GVYZz9BhJIpIvjoa95ULIxdCevbGKZG6Et_shsiKNfQmugV5M49avZp_nTyx4Dn9JfEuTtb50cyfx9gLTBn7vMdeGmGTpoedC2PJi7_npCk2ze_976ruo7s5K8Unoxo9QDd89xAdTs7xxyO0PMHvIrc1Xu1ePuJYJQEWsO4wzB7PMq4GQiGGpBIvUqkRvA749HIb2Rh6fNFhg2dmMPiLH47Qan62On1f5moMZcsrOpRBGCNkZQOpbWsrE0RLuHfwHQJMXPKaGg8HOGiiWaio0dwwbSvqbVBG0cfooFt3_mlEU4HhU9Vq8DUQQ421oEnKkrZ2hCnPClTvJNG0mak8Fsy4bOKJhSSO5d_3qkBvpz5XI0_Hta2PooCnlmcfmWCKiwKJXyXeDOnWJMu7odcNeLynG9PITEKKKqsCvd7pSgP2G3_KmM6vt31TSxGhsRA4CvRkVKKpMxEKQhGlz_51Wq_QneVs3nz-sPj0HB0mpHHCIb5AB8Nm64_R7fb7cNFvXia7-QniZRco |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=A+Graph+Theoretic+Criterion+for+Determining+the+Number+of+Clusters+in+a+Data+Set&rft.jtitle=Multivariate+behavioral+research&rft.au=Krolak-Schwedt%2C+S&rft.au=Eckes%2C+T&rft.date=1992-10-01&rft.issn=0027-3171&rft.volume=27&rft.issue=4&rft.spage=541&rft_id=info:doi/10.1207%2Fs15327906mbr2704_3&rft_id=info%3Apmid%2F26811133&rft.externalDocID=26811133 |
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0027-3171&client=summon |
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0027-3171&client=summon |
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0027-3171&client=summon |