PERFORMANCE COMPARISON OF K-MEANS, PARALLEL K-MEANS AND K-MEANS++.

Uloženo v:
Podrobná bibliografie
Název: PERFORMANCE COMPARISON OF K-MEANS, PARALLEL K-MEANS AND K-MEANS++.
Autoři: Aliguliyev, Ramiz, Tahirzada, Shalala F.
Zdroj: Reliability: Theory & Applications; 2025 Special Issue, Vol. 20, p169-176, 8p
Témata: CLUSTERING algorithms, PATTERN recognition systems, DATA structures, PARALLEL programming, MULTICORE processors, K-means clustering
Abstrakt: K-means clustering is a fundamental unsupervised machine learning technique widely applied in various domains such as data analysis, pattern recognition, and clustering-based tasks. However, its efficiency and scalability can be challenged, particularly when dealing with large-scale datasets and complex data structures. This thesis explores strategies to improve the performance of the K-means clustering algorithm through parallelism and iterative techniques. Parallelism leverages modern parallel computing architectures, including multi-core processors and distributed frameworks like Apache Spark, to enhance computational efficiency and scalability. On the other hand, an iterative approach involves refining clustering results through multiple iterations, adjusting cluster centroids, and optimizing convergence criteria. It delves into the design frameworks of these approaches, highlighting their respective advantages and limitations. Comparative analyses are conducted to evaluate the effectiveness of parallelism and iterative techniques in terms of execution time, scalability, clustering accuracy, and convergence speed. The findings contribute to advancing the understanding of how parallelism and iterative strategies can significantly improve K-means clustering performance, especially in the context of big data and complex datasets. By comparatively analyzing parallelism and iterative approaches, this paper aims to contribute to the development of more efficient and scalable clustering algorithms in the Big Data context. [ABSTRACT FROM AUTHOR]
Copyright of Reliability: Theory & Applications is the property of International Group on Reliability and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
Databáze: Complementary Index
FullText Text:
  Availability: 0
CustomLinks:
  – Url: https://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=EBSCO&SrcAuth=EBSCO&DestApp=WOS&ServiceName=TransferToWoS&DestLinkType=GeneralSearchSummary&Func=Links&author=Aliguliyev%20R
    Name: ISI
    Category: fullText
    Text: Nájsť tento článok vo Web of Science
    Icon: https://imagesrvr.epnet.com/ls/20docs.gif
    MouseOverText: Nájsť tento článok vo Web of Science
Header DbId: edb
DbLabel: Complementary Index
An: 186169142
RelevancyScore: 1007
AccessLevel: 6
PubType: Academic Journal
PubTypeId: academicJournal
PreciseRelevancyScore: 1007.31671142578
IllustrationInfo
Items – Name: Title
  Label: Title
  Group: Ti
  Data: PERFORMANCE COMPARISON OF K-MEANS, PARALLEL K-MEANS AND K-MEANS++.
– Name: Author
  Label: Authors
  Group: Au
  Data: <searchLink fieldCode="AR" term="%22Aliguliyev%2C+Ramiz%22">Aliguliyev, Ramiz</searchLink><br /><searchLink fieldCode="AR" term="%22Tahirzada%2C+Shalala+F%2E%22">Tahirzada, Shalala F.</searchLink>
– Name: TitleSource
  Label: Source
  Group: Src
  Data: Reliability: Theory & Applications; 2025 Special Issue, Vol. 20, p169-176, 8p
– Name: Subject
  Label: Subject Terms
  Group: Su
  Data: <searchLink fieldCode="DE" term="%22CLUSTERING+algorithms%22">CLUSTERING algorithms</searchLink><br /><searchLink fieldCode="DE" term="%22PATTERN+recognition+systems%22">PATTERN recognition systems</searchLink><br /><searchLink fieldCode="DE" term="%22DATA+structures%22">DATA structures</searchLink><br /><searchLink fieldCode="DE" term="%22PARALLEL+programming%22">PARALLEL programming</searchLink><br /><searchLink fieldCode="DE" term="%22MULTICORE+processors%22">MULTICORE processors</searchLink><br /><searchLink fieldCode="DE" term="%22K-means+clustering%22">K-means clustering</searchLink>
– Name: Abstract
  Label: Abstract
  Group: Ab
  Data: K-means clustering is a fundamental unsupervised machine learning technique widely applied in various domains such as data analysis, pattern recognition, and clustering-based tasks. However, its efficiency and scalability can be challenged, particularly when dealing with large-scale datasets and complex data structures. This thesis explores strategies to improve the performance of the K-means clustering algorithm through parallelism and iterative techniques. Parallelism leverages modern parallel computing architectures, including multi-core processors and distributed frameworks like Apache Spark, to enhance computational efficiency and scalability. On the other hand, an iterative approach involves refining clustering results through multiple iterations, adjusting cluster centroids, and optimizing convergence criteria. It delves into the design frameworks of these approaches, highlighting their respective advantages and limitations. Comparative analyses are conducted to evaluate the effectiveness of parallelism and iterative techniques in terms of execution time, scalability, clustering accuracy, and convergence speed. The findings contribute to advancing the understanding of how parallelism and iterative strategies can significantly improve K-means clustering performance, especially in the context of big data and complex datasets. By comparatively analyzing parallelism and iterative approaches, this paper aims to contribute to the development of more efficient and scalable clustering algorithms in the Big Data context. [ABSTRACT FROM AUTHOR]
– Name: Abstract
  Label:
  Group: Ab
  Data: <i>Copyright of Reliability: Theory & Applications is the property of International Group on Reliability and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract.</i> (Copyright applies to all Abstracts.)
PLink https://erproxy.cvtisr.sk/sfx/access?url=https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&db=edb&AN=186169142
RecordInfo BibRecord:
  BibEntity:
    Languages:
      – Code: eng
        Text: English
    PhysicalDescription:
      Pagination:
        PageCount: 8
        StartPage: 169
    Subjects:
      – SubjectFull: CLUSTERING algorithms
        Type: general
      – SubjectFull: PATTERN recognition systems
        Type: general
      – SubjectFull: DATA structures
        Type: general
      – SubjectFull: PARALLEL programming
        Type: general
      – SubjectFull: MULTICORE processors
        Type: general
      – SubjectFull: K-means clustering
        Type: general
    Titles:
      – TitleFull: PERFORMANCE COMPARISON OF K-MEANS, PARALLEL K-MEANS AND K-MEANS++.
        Type: main
  BibRelationships:
    HasContributorRelationships:
      – PersonEntity:
          Name:
            NameFull: Aliguliyev, Ramiz
      – PersonEntity:
          Name:
            NameFull: Tahirzada, Shalala F.
    IsPartOfRelationships:
      – BibEntity:
          Dates:
            – D: 02
              M: 01
              Text: 2025 Special Issue
              Type: published
              Y: 2025
          Identifiers:
            – Type: issn-print
              Value: 19322321
          Numbering:
            – Type: volume
              Value: 20
          Titles:
            – TitleFull: Reliability: Theory & Applications
              Type: main
ResultId 1