DCW-YOLO: Road Object Detection Algorithms for Autonomous Driving
Aiming at the problems of multiple parameters and poor detection accuracy of object detection network in automatic driving scenarios, an object detection algorithm based on improved YOLOv8 is proposed. First, a dynamic head framework is used to unify the object detection head and the attention mecha...
Uložené v:
| Vydané v: | IEEE access Ročník 13; s. 125676 - 125688 |
|---|---|
| Hlavní autori: | , , |
| Médium: | Journal Article |
| Jazyk: | English |
| Vydavateľské údaje: |
Piscataway
IEEE
2025
The Institute of Electrical and Electronics Engineers, Inc. (IEEE) |
| Predmet: | |
| ISSN: | 2169-3536, 2169-3536 |
| On-line prístup: | Získať plný text |
| Tagy: |
Pridať tag
Žiadne tagy, Buďte prvý, kto otaguje tento záznam!
|
| Shrnutí: | Aiming at the problems of multiple parameters and poor detection accuracy of object detection network in automatic driving scenarios, an object detection algorithm based on improved YOLOv8 is proposed. First, a dynamic head framework is used to unify the object detection head and the attention mechanism, and the attention mechanism is used for scale-awareness, spatial-awareness, and task-awareness, respectively, which significantly improves the representation capability of the object detection head without increasing the computational overhead. Second, the Coordinate Attention mechanism is embedded in the SPPF layer, which embeds the target's location information into the channel attention to offer more precise localization for the model, suppress irrelevant aspects, and enable greater integration of local and global characteristics. Finally, the deleterious gradients generated by low-quality examples are reduced using the Wise-IoU v3 bounding box loss function in conjunction with a dynamic non-monotonic focusing mechanism utilizing an anchor box gradient gain assignment strategy. On the challenging public dataset KITTI, the accuracy is improved by 2.1% compared to the benchmark algorithm. In addition, the excellent performance on CCTSDB2021 and VOC highlights the generalization performance of the improved model. |
|---|---|
| Bibliografia: | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 |
| ISSN: | 2169-3536 2169-3536 |
| DOI: | 10.1109/ACCESS.2024.3364681 |