Network Information Processing Analysis Based on Big Data Parallel Graph Partitioning Algorithm

This research proposes an efficient parallel graph partitioning algorithm for the big data environment, aiming to solve the bottlenecks of traditional clustering techniques in terms of processing speed and scalability. The algorithm adopts a multi-level graph partitioning framework, decomposing the...

Celý popis

Uloženo v:
Podrobná bibliografie
Vydáno v:Journal of computing and information technology Ročník 33; číslo 3; s. 139 - 155
Hlavní autoři: Guan, Keqing, Kong, Xianli
Médium: Journal Article Paper
Jazyk:angličtina
Vydáno: Sveuciliste U Zagrebu 01.09.2025
Sveučilište u Zagrebu Fakultet elektrotehnike i računarstva
University of Zagreb Faculty of Electrical Engineering and Computing
Témata:
ISSN:1330-1136, 1846-3908
On-line přístup:Získat plný text
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Abstract This research proposes an efficient parallel graph partitioning algorithm for the big data environment, aiming to solve the bottlenecks of traditional clustering techniques in terms of processing speed and scalability. The algorithm adopts a multi-level graph partitioning framework, decomposing the network information processing task into multiple levels, gradually simplifying the graph structure and backtracking refinement, thereby significantly reducing the computational complexity while ensuring the partitioning quality. The algorithm focuses on balancing the node cohesion within partitions and the edge cutting cost of inter-partition communication. By constructing a global objective function, it minimizes the number of edges across partitions and the workload differences among various sub-graphs, thereby achieving a more balanced partitioning result. The research results show that this algorithm achieves a resource utilization rate of 0.95. In the Hadoop cluster environment, 95% of the computing resources are effectively used for actual task processing, which is significantly higher than that of the competing algorithms. The energy efficiency ratio reaches 0.98, indicating that the number of tasks completed per unit of energy consumption is close to the optimal level, which is superior to the 0.78 to 0.67 range of existing methods, reflecting the advantages of this algorithm in green computing. The load imbalance rate is only 0.00395, and the point weight imbalance rate is 0.00141, which are much lower values than those of the comparison algorithm. This indicates that the algorithm achieves a high degree of balance in task allocation and node weight distribution, effectively avoiding resource waste and performance bottlenecks.
AbstractList This research proposes an efficient parallel graph partitioning algorithm for the big data environment, aiming to solve the bottlenecks of traditional clustering techniques in terms of processing speed and scalability. The algorithm adopts a multi-level graph partitioning framework, decomposing the network information processing task into multiple levels, gradually simplifying the graph structure and backtracking refinement, thereby significantly reducing the computational complexity while ensuring the partitioning quality. The algorithm focuses on balancing the node cohesion within partitions and the edge cutting cost of inter-partition communication. By constructing a global objective function, it minimizes the number of edges across partitions and the workload differences among various sub-graphs, thereby achieving a more balanced partitioning result. The research results show that this algorithm achieves a resource utilization rate of 0.95. In the Hadoop cluster environment, 95% of the computing resources are effectively used for actual task processing, which is significantly higher than that of the competing algorithms. The energy efficiency ratio reaches 0.98, indicating that the number of tasks completed per unit of energy consumption is close to the optimal level, which is superior to the 0.78 to 0.67 range of existing methods, reflecting the advantages of this algorithm in green computing. The load imbalance rate is only 0.00395, and the point weight imbalance rate is 0.00141, which are much lower values than those of the comparison algorithm. This indicates that the algorithm achieves a high degree of balance in task allocation and node weight distribution, effectively avoiding resource waste and performance bottlenecks.
This research proposes an efficient parallel graph partitioning algorithm for the big data environment, aiming to solve the bottlenecks of traditional clustering techniques in terms of processing speed and scalability. The algorithm adopts a multi-level graph partitioning framework, decomposing the network information processing task into multiple levels, gradually simplifying the graph structure and backtracking refinement, thereby significantly reducing the computational complexity while ensuring the partitioning quality. The algorithm focuses on balancing the node cohesion within partitions and the edge cutting cost of inter-partition communication. By constructing a global objective function, it minimizes the number of edges across partitions and the workload differences among various sub-graphs, thereby achieving a more balanced partitioning result. The research results show that this algorithm achieves a resource utilization rate of 0.95. In the Hadoop cluster environment, 95% of the computing resources are effectively used for actual task processing, which is significantly higher than that of the competing algorithms. The energy efficiency ratio reaches 0.98, indicating that the number of tasks completed per unit of energy consumption is close to the optimal level, which is superior to the 0.78 to 0.67 range of existing methods, reflecting the advantages of this algorithm in green computing. The load imbalance rate is only 0.00395, and the point weight imbalance rate is 0.00141, which are much lower values than those of the comparison algorithm. This indicates that the algorithm achieves a high degree of balance in task allocation and node weight distribution, effectively avoiding resource waste and performance bottlenecks. ACM CCS (2012) Classification: Information systems [right arrow] Data management systems [right arrow] Database design and models [right arrow] Graph-based database models [right arrow] Network data models Keywords: big data, parallel graph partitioning algorithm, network information processing, distributed, network split
Audience Academic
Author Guan, Keqing
Kong, Xianli
Author_xml – sequence: 1
  givenname: Keqing
  surname: Guan
  fullname: Guan, Keqing
  organization: Institute for Big Data Research, Liaoning University of International Business and Economcs, Dalian, China
– sequence: 2
  givenname: Xianli
  surname: Kong
  fullname: Kong, Xianli
  organization: School of Economics, Dongbei University of Finance & Economics, Dalian, China
BookMark eNptUk1P3DAUtCoqQSk_gFuknnrI1t-xjwttYSXUImjPluOPrCGJkW3U8u_xblCllWof_N54ZvRkzwdwNMfZAXCO4ApDRvAXE0qtMFshCJmE-B04QYLylkgojmpNCGwRIvwYnOX8AOsiknOKToD64cqfmB6bzexjmnQJcW5uUzQu5zAPzXrW40sOubnQ2dmmXl6Eofmqi25uddLj6MbmKumn7a4tYSffy8YhplC200fw3usxu7O38xT8_v7t1-V1e_PzanO5vmkN4Ri3yHLvPEYMW8o7R3rWEyNE1yGOjWS9ENwR1nFBHZHYSo88FtqKnknK-gqegs3ia6N-UE8pTDq9qKiD2gMxDWo3nxmd8lYSiqjwXddT7ry0WArhobeGek509WoXr20y-vHAbEFyMq6WihAOYVf5nxb-oKt9qA9ZkjZTyEatBZOCS0RYZa3-w6rbuimY-qE-VPxA8PlAUDnF_S2Dfs5Zbe7vDrlo4ZoUc07O_5saQbXPiKoZUbuMqLeMkFfcS67x
CODEN CJCTEM
ContentType Journal Article
Paper
Copyright COPYRIGHT 2025 Sveuciliste U Zagrebu
Copyright_xml – notice: COPYRIGHT 2025 Sveuciliste U Zagrebu
DBID AAYXX
CITATION
ISR
VP8
DOA
DOI 10.20532/cit.2025.1005902
DatabaseName CrossRef
Gale In Context: Science
Portal of Croatian Scientific and Professional Journals – HRČAK
DOAJ Directory of Open Access Journals
DatabaseTitle CrossRef
DatabaseTitleList
CrossRef




Database_xml – sequence: 1
  dbid: DOA
  name: DOAJ Directory of Open Access Journals
  url: https://www.doaj.org/
  sourceTypes: Open Website
DeliveryMethod fulltext_linktorsrc
Discipline Computer Science
EISSN 1846-3908
EndPage 155
ExternalDocumentID oai_doaj_org_article_fd934148f77b46ef9d2988f0fdc4f63a
oai_hrcak_srce_hr_336007
A859869135
10_20532_cit_2025_1005902
GeographicLocations China
GeographicLocations_xml – name: China
GroupedDBID .4S
.DC
29B
29K
2WC
5GY
5VS
77I
AAYXX
ADMLS
ALMA_UNASSIGNED_HOLDINGS
ARCSS
BAIFH
BBTPI
CITATION
CS3
D-I
DU5
E3Z
EBS
EDO
EJD
EN8
EOJEC
GROUPED_DOAJ
I-F
IAO
ICD
ISR
ITC
IVC
KQ8
KWQ
MK~
ML~
M~E
OBODZ
OK1
OVT
P2P
PV9
RZL
TR2
TUS
VP8
XH6
ID FETCH-LOGICAL-c3622-1d6fef2152d467e3b5b3c8877162c95b886e357684e392d9f1f28ad8b5945b4e3
IEDL.DBID DOA
ISSN 1330-1136
IngestDate Fri Oct 03 12:41:29 EDT 2025
Tue Sep 30 04:10:28 EDT 2025
Sat Nov 29 13:46:57 EST 2025
Sat Nov 29 10:29:19 EST 2025
Thu Nov 13 15:56:47 EST 2025
Sat Nov 29 07:26:24 EST 2025
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue 3
Language English
License cc-by-nd: openAccess
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c3622-1d6fef2152d467e3b5b3c8877162c95b886e357684e392d9f1f28ad8b5945b4e3
Notes 336007
OpenAccessLink https://doaj.org/article/fd934148f77b46ef9d2988f0fdc4f63a
PageCount 17
ParticipantIDs doaj_primary_oai_doaj_org_article_fd934148f77b46ef9d2988f0fdc4f63a
hrcak_primary_oai_hrcak_srce_hr_336007
gale_infotracmisc_A859869135
gale_infotracacademiconefile_A859869135
gale_incontextgauss_ISR_A859869135
crossref_primary_10_20532_cit_2025_1005902
PublicationCentury 2000
PublicationDate 20250901
PublicationDateYYYYMMDD 2025-09-01
PublicationDate_xml – month: 09
  year: 2025
  text: 20250901
  day: 01
PublicationDecade 2020
PublicationTitle Journal of computing and information technology
PublicationYear 2025
Publisher Sveuciliste U Zagrebu
Sveučilište u Zagrebu Fakultet elektrotehnike i računarstva
University of Zagreb Faculty of Electrical Engineering and Computing
Publisher_xml – name: Sveuciliste U Zagrebu
– name: Sveučilište u Zagrebu Fakultet elektrotehnike i računarstva
– name: University of Zagreb Faculty of Electrical Engineering and Computing
SSID ssj0000396641
Score 2.3158536
Snippet This research proposes an efficient parallel graph partitioning algorithm for the big data environment, aiming to solve the bottlenecks of traditional...
SourceID doaj
hrcak
gale
crossref
SourceType Open Website
Open Access Repository
Aggregation Database
Index Database
StartPage 139
SubjectTerms Algorithms
Big data
Database design
distributed, network split
Electronic data processing
Energy efficiency
Methods
network information processing
parallel graph partitioning algorithm
Social networks
Title Network Information Processing Analysis Based on Big Data Parallel Graph Partitioning Algorithm
URI https://hrcak.srce.hr/336007
https://doaj.org/article/fd934148f77b46ef9d2988f0fdc4f63a
Volume 33
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVAON
  databaseName: DOAJ Directory of Open Access Journals
  customDbUrl:
  eissn: 1846-3908
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0000396641
  issn: 1330-1136
  databaseCode: DOA
  dateStart: 20160101
  isFulltext: true
  titleUrlDefault: https://www.doaj.org/
  providerName: Directory of Open Access Journals
– providerCode: PRVHPJ
  databaseName: ROAD: Directory of Open Access Scholarly Resources
  customDbUrl:
  eissn: 1846-3908
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0000396641
  issn: 1330-1136
  databaseCode: M~E
  dateStart: 20000101
  isFulltext: true
  titleUrlDefault: https://road.issn.org
  providerName: ISSN International Centre
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV1Lb9QwELZQ4cCFN2KhIAshkJCiJraT2McutMCBVcVD6s3yc7ui7KJsypHfzoztrTYnLlyixJ5I9jfOzFgZf0PIq9YLqyJTVROFqoTtJNpBV_nGM8GZApdZp2IT_WIhz8_V2V6pL8wJy_TAGbij6BUYWiFj31vRhag8U1LGOnonYsdTaFT3am8zlWwwhzBeNPk3JsPqB0duhamTrMW8AOQsmTiixNd_bZVvXgzO_NjzM6f3yJ0SINLjPLD75EZYPyB3d8UXaPkWHxK9yAnctBwoQoBpSfsHd0R3bCN0Dn7KU-icr5b0vRkNPTMDVlC5pB-QrRofM2NReu1yuRlW48XPR-T76cm3dx-rUi2hcuCEWNX4LoaIZWo9GL_AbWu5AxOCFFFOtVbKLnDcXYgAMZFXsYlMGi9tq0RrofExOVhv1uEJoag85UPHTB9FbZRVEvAORvA6eNuEGXm7g07_yqQYGjYTCWcNOGvEWRecZ2SO4F4LIp91agAt66Jl_S8tz8hLVI1Gxoo1psQszdV2qz99_aKPJVLMq4a3M_KmCMXNOBhnygkDmBSSXE0kDyeS8Em5SffrtAImY84t28EFuNWcI7H_0_8xt2fkNuKVE9cOycE4XIXn5Jb7Pa62w4u0ruH6-c_JX_Nd_Rc
linkProvider Directory of Open Access Journals
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Network+Information+Processing+Analysis+Based+on+Big+Data+Parallel+Graph+Partitioning+Algorithm&rft.jtitle=Journal+of+computing+and+information+technology&rft.au=Kong%2C+Keqing+Guan+and+Xianli&rft.date=2025-09-01&rft.pub=Sveuciliste+U+Zagrebu&rft.issn=1330-1136&rft.volume=33&rft.issue=3&rft.spage=139&rft_id=info:doi/10.20532%2Fcit.2025.1005902&rft.externalDocID=A859869135
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1330-1136&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1330-1136&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1330-1136&client=summon