Static Analysis Techniques for Fixing Software Defects in MPI-Based Parallel Programs.

Uloženo v:
Podrobná bibliografie
Název: Static Analysis Techniques for Fixing Software Defects in MPI-Based Parallel Programs.
Autoři: Al-Johany, Norah Abdullah, Sharaf, Sanaa Abdullah, Eassa, Fathy Elbouraey, Alnanih, Reem Abdulaziz
Zdroj: Computers, Materials & Continua; 2024, Vol. 79 Issue 2, p3139-3173, 35p
Témata: SOFTWARE engineering, PARALLEL programming, DISTRIBUTED computing, EVIDENCE gaps, COMPUTER systems
Abstrakt: The Message Passing Interface (MPI) is a widely accepted standard for parallel computing on distributed memory systems.However, MPI implementations can contain defects that impact the reliability and performance of parallel applications. Detecting and correcting these defects is crucial, yet there is a lack of published models specifically designed for correcting MPI defects. To address this, we propose a model for detecting and correcting MPI defects (DC_MPI), which aims to detect and correct defects in various types of MPI communication, including blocking point-to-point (BPTP), nonblocking point-to-point (NBPTP), and collective communication (CC). The defects addressed by the DC_MPI model include illegal MPI calls, deadlocks (DL), race conditions (RC), and message mismatches (MM). To assess the effectiveness of the DC_MPI model, we performed experiments on a dataset consisting of 40 MPI codes. The results indicate that the model achieved a detection rate of 37 out of 40 codes, resulting in an overall detection accuracy of 92.5%. Additionally, the execution duration of the DC_MPI model ranged from 0.81 to 1.36 s. These findings show that the DC_MPI model is useful in detecting and correcting defects in MPI implementations, thereby enhancing the reliability and performance of parallel applications. The DC_MPImodel fills an important research gap and provides a valuable tool for improving the quality ofMPI-based parallel computing systems. [ABSTRACT FROM AUTHOR]
Copyright of Computers, Materials & Continua is the property of Tech Science Press and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
Databáze: Complementary Index
Popis
Abstrakt:The Message Passing Interface (MPI) is a widely accepted standard for parallel computing on distributed memory systems.However, MPI implementations can contain defects that impact the reliability and performance of parallel applications. Detecting and correcting these defects is crucial, yet there is a lack of published models specifically designed for correcting MPI defects. To address this, we propose a model for detecting and correcting MPI defects (DC_MPI), which aims to detect and correct defects in various types of MPI communication, including blocking point-to-point (BPTP), nonblocking point-to-point (NBPTP), and collective communication (CC). The defects addressed by the DC_MPI model include illegal MPI calls, deadlocks (DL), race conditions (RC), and message mismatches (MM). To assess the effectiveness of the DC_MPI model, we performed experiments on a dataset consisting of 40 MPI codes. The results indicate that the model achieved a detection rate of 37 out of 40 codes, resulting in an overall detection accuracy of 92.5%. Additionally, the execution duration of the DC_MPI model ranged from 0.81 to 1.36 s. These findings show that the DC_MPI model is useful in detecting and correcting defects in MPI implementations, thereby enhancing the reliability and performance of parallel applications. The DC_MPImodel fills an important research gap and provides a valuable tool for improving the quality ofMPI-based parallel computing systems. [ABSTRACT FROM AUTHOR]
ISSN:15462218
DOI:10.32604/cmc.2024.047392