PERFORMANCE COMPARISON OF K-MEANS, PARALLEL K-MEANS AND K-MEANS++.
Uloženo v:
| Název: | PERFORMANCE COMPARISON OF K-MEANS, PARALLEL K-MEANS AND K-MEANS++. |
|---|---|
| Autoři: | Aliguliyev, Ramiz, Tahirzada, Shalala F. |
| Zdroj: | Reliability: Theory & Applications; 2025 Special Issue, Vol. 20, p169-176, 8p |
| Témata: | CLUSTERING algorithms, PATTERN recognition systems, DATA structures, PARALLEL programming, MULTICORE processors, K-means clustering |
| Abstrakt: | K-means clustering is a fundamental unsupervised machine learning technique widely applied in various domains such as data analysis, pattern recognition, and clustering-based tasks. However, its efficiency and scalability can be challenged, particularly when dealing with large-scale datasets and complex data structures. This thesis explores strategies to improve the performance of the K-means clustering algorithm through parallelism and iterative techniques. Parallelism leverages modern parallel computing architectures, including multi-core processors and distributed frameworks like Apache Spark, to enhance computational efficiency and scalability. On the other hand, an iterative approach involves refining clustering results through multiple iterations, adjusting cluster centroids, and optimizing convergence criteria. It delves into the design frameworks of these approaches, highlighting their respective advantages and limitations. Comparative analyses are conducted to evaluate the effectiveness of parallelism and iterative techniques in terms of execution time, scalability, clustering accuracy, and convergence speed. The findings contribute to advancing the understanding of how parallelism and iterative strategies can significantly improve K-means clustering performance, especially in the context of big data and complex datasets. By comparatively analyzing parallelism and iterative approaches, this paper aims to contribute to the development of more efficient and scalable clustering algorithms in the Big Data context. [ABSTRACT FROM AUTHOR] |
| Copyright of Reliability: Theory & Applications is the property of International Group on Reliability and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.) | |
| Databáze: | Complementary Index |
| FullText | Text: Availability: 0 CustomLinks: – Url: https://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=EBSCO&SrcAuth=EBSCO&DestApp=WOS&ServiceName=TransferToWoS&DestLinkType=GeneralSearchSummary&Func=Links&author=Aliguliyev%20R Name: ISI Category: fullText Text: Nájsť tento článok vo Web of Science Icon: https://imagesrvr.epnet.com/ls/20docs.gif MouseOverText: Nájsť tento článok vo Web of Science |
|---|---|
| Header | DbId: edb DbLabel: Complementary Index An: 186169142 RelevancyScore: 1007 AccessLevel: 6 PubType: Academic Journal PubTypeId: academicJournal PreciseRelevancyScore: 1007.31671142578 |
| IllustrationInfo | |
| Items | – Name: Title Label: Title Group: Ti Data: PERFORMANCE COMPARISON OF K-MEANS, PARALLEL K-MEANS AND K-MEANS++. – Name: Author Label: Authors Group: Au Data: <searchLink fieldCode="AR" term="%22Aliguliyev%2C+Ramiz%22">Aliguliyev, Ramiz</searchLink><br /><searchLink fieldCode="AR" term="%22Tahirzada%2C+Shalala+F%2E%22">Tahirzada, Shalala F.</searchLink> – Name: TitleSource Label: Source Group: Src Data: Reliability: Theory & Applications; 2025 Special Issue, Vol. 20, p169-176, 8p – Name: Subject Label: Subject Terms Group: Su Data: <searchLink fieldCode="DE" term="%22CLUSTERING+algorithms%22">CLUSTERING algorithms</searchLink><br /><searchLink fieldCode="DE" term="%22PATTERN+recognition+systems%22">PATTERN recognition systems</searchLink><br /><searchLink fieldCode="DE" term="%22DATA+structures%22">DATA structures</searchLink><br /><searchLink fieldCode="DE" term="%22PARALLEL+programming%22">PARALLEL programming</searchLink><br /><searchLink fieldCode="DE" term="%22MULTICORE+processors%22">MULTICORE processors</searchLink><br /><searchLink fieldCode="DE" term="%22K-means+clustering%22">K-means clustering</searchLink> – Name: Abstract Label: Abstract Group: Ab Data: K-means clustering is a fundamental unsupervised machine learning technique widely applied in various domains such as data analysis, pattern recognition, and clustering-based tasks. However, its efficiency and scalability can be challenged, particularly when dealing with large-scale datasets and complex data structures. This thesis explores strategies to improve the performance of the K-means clustering algorithm through parallelism and iterative techniques. Parallelism leverages modern parallel computing architectures, including multi-core processors and distributed frameworks like Apache Spark, to enhance computational efficiency and scalability. On the other hand, an iterative approach involves refining clustering results through multiple iterations, adjusting cluster centroids, and optimizing convergence criteria. It delves into the design frameworks of these approaches, highlighting their respective advantages and limitations. Comparative analyses are conducted to evaluate the effectiveness of parallelism and iterative techniques in terms of execution time, scalability, clustering accuracy, and convergence speed. The findings contribute to advancing the understanding of how parallelism and iterative strategies can significantly improve K-means clustering performance, especially in the context of big data and complex datasets. By comparatively analyzing parallelism and iterative approaches, this paper aims to contribute to the development of more efficient and scalable clustering algorithms in the Big Data context. [ABSTRACT FROM AUTHOR] – Name: Abstract Label: Group: Ab Data: <i>Copyright of Reliability: Theory & Applications is the property of International Group on Reliability and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract.</i> (Copyright applies to all Abstracts.) |
| PLink | https://erproxy.cvtisr.sk/sfx/access?url=https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&db=edb&AN=186169142 |
| RecordInfo | BibRecord: BibEntity: Languages: – Code: eng Text: English PhysicalDescription: Pagination: PageCount: 8 StartPage: 169 Subjects: – SubjectFull: CLUSTERING algorithms Type: general – SubjectFull: PATTERN recognition systems Type: general – SubjectFull: DATA structures Type: general – SubjectFull: PARALLEL programming Type: general – SubjectFull: MULTICORE processors Type: general – SubjectFull: K-means clustering Type: general Titles: – TitleFull: PERFORMANCE COMPARISON OF K-MEANS, PARALLEL K-MEANS AND K-MEANS++. Type: main BibRelationships: HasContributorRelationships: – PersonEntity: Name: NameFull: Aliguliyev, Ramiz – PersonEntity: Name: NameFull: Tahirzada, Shalala F. IsPartOfRelationships: – BibEntity: Dates: – D: 02 M: 01 Text: 2025 Special Issue Type: published Y: 2025 Identifiers: – Type: issn-print Value: 19322321 Numbering: – Type: volume Value: 20 Titles: – TitleFull: Reliability: Theory & Applications Type: main |
| ResultId | 1 |
Nájsť tento článok vo Web of Science