Operator parallel optimization strategy for distributed databases

Gespeichert in:
Bibliographische Detailangaben
Titel: Operator parallel optimization strategy for distributed databases
Autoren: LIU Wenjie, LYU Jingchao
Quelle: Xibei Gongye Daxue Xuebao, Vol 42, Iss 3, Pp 453-459 (2024)
Verlagsinformationen: EDP Sciences, 2024.
Publikationsjahr: 2024
Schlagwörter: data partitioning, distributed database, load balancing, parallel query, TL1-4050, query optimization, Motor vehicles. Aeronautics. Astronautics
Beschreibung: With the continuous development of network technology, the scale of data has shown explosive growth, which leads gradually to replacing traditional single machine databases with distributed databases. Distributed databases solve large-scale data storage problems through collaborative work among nodes, but due to increased communication costs between nodes, its query efficiency is not as good as a standalone database. In a distributed architecture, the data of storage nodes is only used as redundancy for multiple backups, providing data recovery in case of system failure, and it is not utilized to improve query efficiency. In response to the above issues, this article proposes an operator parallel optimization strategy for distributed databases. By splitting key physical operators, the split sub requests are evenly distributed to multiple nodes in the storage layer, which are processed in parallel by multiple nodes, thereby reducing query response time. The above strategy has been applied on distributed database CBase, and experiments have shown that the parallel optimization strategy proposed in this paper can significantly shorten SQL request query time and improve system resource utilization.
Publikationsart: Article
ISSN: 2609-7125
1000-2758
DOI: 10.1051/jnwpu/20244230453
Zugangs-URL: https://doaj.org/article/92c918954ed949dc851824610f875183
Rights: CC BY
Dokumentencode: edsair.doi.dedup.....745dfe13efba767126f6a3c482561f0c
Datenbank: OpenAIRE
Beschreibung
Abstract:With the continuous development of network technology, the scale of data has shown explosive growth, which leads gradually to replacing traditional single machine databases with distributed databases. Distributed databases solve large-scale data storage problems through collaborative work among nodes, but due to increased communication costs between nodes, its query efficiency is not as good as a standalone database. In a distributed architecture, the data of storage nodes is only used as redundancy for multiple backups, providing data recovery in case of system failure, and it is not utilized to improve query efficiency. In response to the above issues, this article proposes an operator parallel optimization strategy for distributed databases. By splitting key physical operators, the split sub requests are evenly distributed to multiple nodes in the storage layer, which are processed in parallel by multiple nodes, thereby reducing query response time. The above strategy has been applied on distributed database CBase, and experiments have shown that the parallel optimization strategy proposed in this paper can significantly shorten SQL request query time and improve system resource utilization.
ISSN:26097125
10002758
DOI:10.1051/jnwpu/20244230453