Network Information Processing Analysis Based on Big Data Parallel Graph Partitioning Algorithm
This research proposes an efficient parallel graph partitioning algorithm for the big data environment, aiming to solve the bottlenecks of traditional clustering techniques in terms of processing speed and scalability. The algorithm adopts a multi-level graph partitioning framework, decomposing the...
Saved in:
| Published in: | Journal of computing and information technology Vol. 33; no. 3; pp. 139 - 155 |
|---|---|
| Main Authors: | , |
| Format: | Journal Article Paper |
| Language: | English |
| Published: |
Sveuciliste U Zagrebu
01.09.2025
Sveučilište u Zagrebu Fakultet elektrotehnike i računarstva University of Zagreb Faculty of Electrical Engineering and Computing |
| Subjects: | |
| ISSN: | 1330-1136, 1846-3908 |
| Online Access: | Get full text |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| Abstract | This research proposes an efficient parallel graph partitioning algorithm for the big data environment, aiming to solve the bottlenecks of traditional clustering techniques in terms of processing speed and scalability. The algorithm adopts a multi-level graph partitioning framework, decomposing the network information processing task into multiple levels, gradually simplifying the graph structure and backtracking refinement, thereby significantly reducing the computational complexity while ensuring the partitioning quality. The algorithm focuses on balancing the node cohesion within partitions and the edge cutting cost of inter-partition communication. By constructing a global objective function, it minimizes the number of edges across partitions and the workload differences among various sub-graphs, thereby achieving a more balanced partitioning result. The research results show that this algorithm achieves a resource utilization rate of 0.95. In the Hadoop cluster environment, 95% of the computing resources are effectively used for actual task processing, which is significantly higher than that of the competing algorithms. The energy efficiency ratio reaches 0.98, indicating that the number of tasks completed per unit of energy consumption is close to the optimal level, which is superior to the 0.78 to 0.67 range of existing methods, reflecting the advantages of this algorithm in green computing. The load imbalance rate is only 0.00395, and the point weight imbalance rate is 0.00141, which are much lower values than those of the comparison algorithm. This indicates that the algorithm achieves a high degree of balance in task allocation and node weight distribution, effectively avoiding resource waste and performance bottlenecks. |
|---|---|
| AbstractList | This research proposes an efficient parallel graph partitioning algorithm for the big data environment, aiming to solve the bottlenecks of traditional clustering techniques in terms of processing speed and scalability. The algorithm adopts a multi-level graph partitioning framework, decomposing the network information processing task into multiple levels, gradually simplifying the graph structure and backtracking refinement, thereby significantly reducing the computational complexity while ensuring the partitioning quality. The algorithm focuses on balancing the node cohesion within partitions and the edge cutting cost of inter-partition communication. By constructing a global objective function, it minimizes the number of edges across partitions and the workload differences among various sub-graphs, thereby achieving a more balanced partitioning result. The research results show that this algorithm achieves a resource utilization rate of 0.95. In the Hadoop cluster environment, 95% of the computing resources are effectively used for actual task processing, which is significantly higher than that of the competing algorithms. The energy efficiency ratio reaches 0.98, indicating that the number of tasks completed per unit of energy consumption is close to the optimal level, which is superior to the 0.78 to 0.67 range of existing methods, reflecting the advantages of this algorithm in green computing. The load imbalance rate is only 0.00395, and the point weight imbalance rate is 0.00141, which are much lower values than those of the comparison algorithm. This indicates that the algorithm achieves a high degree of balance in task allocation and node weight distribution, effectively avoiding resource waste and performance bottlenecks. This research proposes an efficient parallel graph partitioning algorithm for the big data environment, aiming to solve the bottlenecks of traditional clustering techniques in terms of processing speed and scalability. The algorithm adopts a multi-level graph partitioning framework, decomposing the network information processing task into multiple levels, gradually simplifying the graph structure and backtracking refinement, thereby significantly reducing the computational complexity while ensuring the partitioning quality. The algorithm focuses on balancing the node cohesion within partitions and the edge cutting cost of inter-partition communication. By constructing a global objective function, it minimizes the number of edges across partitions and the workload differences among various sub-graphs, thereby achieving a more balanced partitioning result. The research results show that this algorithm achieves a resource utilization rate of 0.95. In the Hadoop cluster environment, 95% of the computing resources are effectively used for actual task processing, which is significantly higher than that of the competing algorithms. The energy efficiency ratio reaches 0.98, indicating that the number of tasks completed per unit of energy consumption is close to the optimal level, which is superior to the 0.78 to 0.67 range of existing methods, reflecting the advantages of this algorithm in green computing. The load imbalance rate is only 0.00395, and the point weight imbalance rate is 0.00141, which are much lower values than those of the comparison algorithm. This indicates that the algorithm achieves a high degree of balance in task allocation and node weight distribution, effectively avoiding resource waste and performance bottlenecks. ACM CCS (2012) Classification: Information systems [right arrow] Data management systems [right arrow] Database design and models [right arrow] Graph-based database models [right arrow] Network data models Keywords: big data, parallel graph partitioning algorithm, network information processing, distributed, network split |
| Audience | Academic |
| Author | Guan, Keqing Kong, Xianli |
| Author_xml | – sequence: 1 givenname: Keqing surname: Guan fullname: Guan, Keqing organization: Institute for Big Data Research, Liaoning University of International Business and Economcs, Dalian, China – sequence: 2 givenname: Xianli surname: Kong fullname: Kong, Xianli organization: School of Economics, Dongbei University of Finance & Economics, Dalian, China |
| BookMark | eNptUk1P3DAUtCoqQSk_gFuknnrI1t-xjwttYSXUImjPluOPrCGJkW3U8u_xblCllWof_N54ZvRkzwdwNMfZAXCO4ApDRvAXE0qtMFshCJmE-B04QYLylkgojmpNCGwRIvwYnOX8AOsiknOKToD64cqfmB6bzexjmnQJcW5uUzQu5zAPzXrW40sOubnQ2dmmXl6Eofmqi25uddLj6MbmKumn7a4tYSffy8YhplC200fw3usxu7O38xT8_v7t1-V1e_PzanO5vmkN4Ri3yHLvPEYMW8o7R3rWEyNE1yGOjWS9ENwR1nFBHZHYSo88FtqKnknK-gqegs3ia6N-UE8pTDq9qKiD2gMxDWo3nxmd8lYSiqjwXddT7ry0WArhobeGek509WoXr20y-vHAbEFyMq6WihAOYVf5nxb-oKt9qA9ZkjZTyEatBZOCS0RYZa3-w6rbuimY-qE-VPxA8PlAUDnF_S2Dfs5Zbe7vDrlo4ZoUc07O_5saQbXPiKoZUbuMqLeMkFfcS67x |
| CODEN | CJCTEM |
| ContentType | Journal Article Paper |
| Copyright | COPYRIGHT 2025 Sveuciliste U Zagrebu |
| Copyright_xml | – notice: COPYRIGHT 2025 Sveuciliste U Zagrebu |
| DBID | AAYXX CITATION ISR VP8 DOA |
| DOI | 10.20532/cit.2025.1005902 |
| DatabaseName | CrossRef Gale In Context: Science Portal of Croatian Scientific and Professional Journals – HRČAK Directory of Open Access Journals |
| DatabaseTitle | CrossRef |
| DatabaseTitleList | CrossRef |
| Database_xml | – sequence: 1 dbid: DOA name: DOAJ (Directory of Open Access Journals) url: https://www.doaj.org/ sourceTypes: Open Website |
| DeliveryMethod | fulltext_linktorsrc |
| Discipline | Computer Science |
| EISSN | 1846-3908 |
| EndPage | 155 |
| ExternalDocumentID | oai_doaj_org_article_fd934148f77b46ef9d2988f0fdc4f63a oai_hrcak_srce_hr_336007 A859869135 10_20532_cit_2025_1005902 |
| GeographicLocations | China |
| GeographicLocations_xml | – name: China |
| GroupedDBID | .4S .DC 29B 29K 2WC 5GY 5VS 77I AAYXX ADMLS ALMA_UNASSIGNED_HOLDINGS ARCSS BAIFH BBTPI CITATION CS3 D-I DU5 E3Z EBS EDO EJD EN8 EOJEC GROUPED_DOAJ I-F IAO ICD ISR ITC IVC KQ8 KWQ MK~ ML~ M~E OBODZ OK1 OVT P2P PV9 RZL TR2 TUS VP8 XH6 |
| ID | FETCH-LOGICAL-c3622-1d6fef2152d467e3b5b3c8877162c95b886e357684e392d9f1f28ad8b5945b4e3 |
| IEDL.DBID | DOA |
| ISSN | 1330-1136 |
| IngestDate | Fri Oct 03 12:41:29 EDT 2025 Tue Sep 30 04:10:28 EDT 2025 Sat Nov 29 13:46:57 EST 2025 Sat Nov 29 10:29:19 EST 2025 Thu Nov 13 15:56:47 EST 2025 Sat Nov 29 07:26:24 EST 2025 |
| IsDoiOpenAccess | true |
| IsOpenAccess | true |
| IsPeerReviewed | true |
| IsScholarly | true |
| Issue | 3 |
| Language | English |
| License | cc-by-nd: openAccess |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-LOGICAL-c3622-1d6fef2152d467e3b5b3c8877162c95b886e357684e392d9f1f28ad8b5945b4e3 |
| Notes | 336007 |
| OpenAccessLink | https://doaj.org/article/fd934148f77b46ef9d2988f0fdc4f63a |
| PageCount | 17 |
| ParticipantIDs | doaj_primary_oai_doaj_org_article_fd934148f77b46ef9d2988f0fdc4f63a hrcak_primary_oai_hrcak_srce_hr_336007 gale_infotracmisc_A859869135 gale_infotracacademiconefile_A859869135 gale_incontextgauss_ISR_A859869135 crossref_primary_10_20532_cit_2025_1005902 |
| PublicationCentury | 2000 |
| PublicationDate | 20250901 |
| PublicationDateYYYYMMDD | 2025-09-01 |
| PublicationDate_xml | – month: 09 year: 2025 text: 20250901 day: 01 |
| PublicationDecade | 2020 |
| PublicationTitle | Journal of computing and information technology |
| PublicationYear | 2025 |
| Publisher | Sveuciliste U Zagrebu Sveučilište u Zagrebu Fakultet elektrotehnike i računarstva University of Zagreb Faculty of Electrical Engineering and Computing |
| Publisher_xml | – name: Sveuciliste U Zagrebu – name: Sveučilište u Zagrebu Fakultet elektrotehnike i računarstva – name: University of Zagreb Faculty of Electrical Engineering and Computing |
| SSID | ssj0000396641 |
| Score | 2.3157592 |
| Snippet | This research proposes an efficient parallel graph partitioning algorithm for the big data environment, aiming to solve the bottlenecks of traditional... |
| SourceID | doaj hrcak gale crossref |
| SourceType | Open Website Open Access Repository Aggregation Database Index Database |
| StartPage | 139 |
| SubjectTerms | Algorithms Big data Database design distributed, network split Electronic data processing Energy efficiency Methods network information processing parallel graph partitioning algorithm Social networks |
| Title | Network Information Processing Analysis Based on Big Data Parallel Graph Partitioning Algorithm |
| URI | https://hrcak.srce.hr/336007 https://doaj.org/article/fd934148f77b46ef9d2988f0fdc4f63a |
| Volume | 33 |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| journalDatabaseRights | – providerCode: PRVAON databaseName: DOAJ (Directory of Open Access Journals) customDbUrl: eissn: 1846-3908 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0000396641 issn: 1330-1136 databaseCode: DOA dateStart: 20160101 isFulltext: true titleUrlDefault: https://www.doaj.org/ providerName: Directory of Open Access Journals – providerCode: PRVHPJ databaseName: ROAD: Directory of Open Access Scholarly Resources customDbUrl: eissn: 1846-3908 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0000396641 issn: 1330-1136 databaseCode: M~E dateStart: 20000101 isFulltext: true titleUrlDefault: https://road.issn.org providerName: ISSN International Centre |
| link | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV1Lb9QwELZQ4cCFN2KhIAshkJCixo8k9rELLXBgVfGQerMcP7Yryi7Kphz57czY3mpz4sIlSmxHsr9x5qGMvyHklQgOzHRTV5JZUUkleaV976tQR-a5tRAy-FRsolss1Pm5Ptsr9YU5YZkeOAN3FL0GRStV7LpetiFqz7VSsY7eydiK5BrVnd4LppIOFuDGS5Z_Y3KsfnDkVpg6yRvMC0DOkokhSnz911r55sXg7I89O3N6j9wpDiI9zhO7T26E9QNyd1d8gZZv8SExi5zATcuBIgSYlrR_MEd0xzZC52CnPIXO-WpJ39vR0jM7YAWVS_oB2arxMTMWpdcul5thNV78fES-n558e_exKtUSKgdGiFfMtzFELFPrQfkF0Te9cKBCkCLK6aZXqg0CowsZwCfyOrLIlfWqb7Rsemh8TA7Wm3V4Qqhsed06q33HouyktL7D2jRdyyILPDYz8nYHnfmVSTEMBBMJZwM4G8TZFJxnZI7gXg9EPuvUAFI2RcrmX1KekZcoGoOMFWtMiVnaq-3WfPr6xRwrpJjXTMC03pRBcTMO1tlywgAWhSRXk5GHk5HwSblJ9-u0AyZzzi3bwQW4NUIgsf_T_7G2Z-Q24pUT1w7JwThchefklvs9rrbDi7Sv4fr5z8lfQYH8mg |
| linkProvider | Directory of Open Access Journals |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Network+Information+Processing+Analysis+Based+on+Big+Data+Parallel+Graph+Partitioning+Algorithm&rft.jtitle=Journal+of+computing+and+information+technology&rft.au=Kong%2C+Keqing+Guan+and+Xianli&rft.date=2025-09-01&rft.pub=Sveuciliste+U+Zagrebu&rft.issn=1330-1136&rft.volume=33&rft.issue=3&rft.spage=139&rft_id=info:doi/10.20532%2Fcit.2025.1005902&rft.externalDocID=A859869135 |
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1330-1136&client=summon |
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1330-1136&client=summon |
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1330-1136&client=summon |