RLD-Net: an enhanced deep learning model for accurate pavement distress detection using UAV captured images
Accurate detection of pavement distresses, particularly cracks, is crucial for road maintenance and safety. However, traditional pavement damage detection methods are often difficult to effectively capture small-scale or subtle damage in complex and crowded environment, which limits the detection ac...
Saved in:
| Published in: | Multiscale and Multidisciplinary Modeling, Experiments and Design Vol. 8; no. 8; p. 353 |
|---|---|
| Main Authors: | , , , , |
| Format: | Journal Article |
| Language: | English |
| Published: |
Cham
Springer International Publishing
01.08.2025
Springer Nature B.V |
| Subjects: | |
| ISSN: | 2520-8160, 2520-8179 |
| Online Access: | Get full text |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| Summary: | Accurate detection of pavement distresses, particularly cracks, is crucial for road maintenance and safety. However, traditional pavement damage detection methods are often difficult to effectively capture small-scale or subtle damage in complex and crowded environment, which limits the detection accuracy and robustness. This study proposes RLD-Net, an enhanced detection framework based on the You Only Look Once Extended (YOLOX) model, aimed at improving the detection of pavement distresses, particularly road cracks. The RLD-Net incorporates three novel modules: the Regional Dynamic Perception (RDP) module for improving feature extraction, the Location Enhanced Attention Module (LEAM) for reducing background noise, and the Dynamic Context-Detail Fusion Module (DCFM) for multi-scale feature fusion. These modules work together to significantly improve the accuracy and efficiency of pavement damage detection. And compared with other models, it has better recognition and detection ability. The proposed framework was evaluated using unmanned aerial vehicle (UAV)-captured images under various environmental conditions. Experimental results demonstrate that RLD-Net significantly improves detection accuracy, achieving higher mean Average Precision at 0.5 Intersection over Union (mAP@0.5) scores and better Average Recall (AR) compared to the baseline YOLOX model. The fully integrated RLD-Net achieved an mAP@0.5 of 84.7% and an AR of 68.6%, marking a significant improvement in both precision and recall. Despite a slight increase in computational complexity, the RLD-Net maintains excellent performance with high speed. |
|---|---|
| Bibliography: | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 |
| ISSN: | 2520-8160 2520-8179 |
| DOI: | 10.1007/s41939-025-00950-9 |