Lightweight visual localization algorithm for UAVs

The Lightv8nPnP lightweight visual positioning algorithm model has been introduced to make deep learning-based drone visual positioning algorithms more lightweight. The core objective of this research is to develop an efficient visual positioning algorithm model that can achieve accurate 3D position...

Celý popis

Uložené v:

Podrobná bibliografia
Vydané v:	Scientific reports Ročník 15; číslo 1; s. 6069 - 14
Hlavní autori:	Wang, Yuhang, Feng, Xuefeng, Li, Feng, Xian, Qinglong, Jia, Zhen-Hong, Du, Zongdong, Liu, Chang
Médium:	Journal Article
Jazyk:	English
Vydavateľské údaje:	London Nature Publishing Group UK 19.02.2025 Nature Publishing Group Nature Portfolio
Predmet:	639/705 639/705/117 Algorithms Deep learning Drones Humanities and Social Sciences Lightweight Load distribution Localization multidisciplinary Science Science (multidisciplinary) Vision-based positioning Visual discrimination learning Deep learning Vision-based positioning Lightweight
ISSN:	2045-2322, 2045-2322
On-line prístup:	Získať plný text
Tagy:	Pridať tag Žiadne tagy, Buďte prvý, kto otaguje tento záznam!

Popis
Shrnutí:	The Lightv8nPnP lightweight visual positioning algorithm model has been introduced to make deep learning-based drone visual positioning algorithms more lightweight. The core objective of this research is to develop an efficient visual positioning algorithm model that can achieve accurate 3D positioning for drones. To enhance model performance, several optimizations are proposed. Firstly, to reduce the complexity of the detection head module, GhostConv is introduced into the detection head module, constructing the GDetect detection head module. Secondly, to address the issues of imbalanced sample difficulty and uneven pixel quality in our custom dataset that result in suboptimal detection performance, Wise-IoU is introduced as the model’s bounding box regression loss function. Lastly, based on the characteristics of the drone aerial dataset samples, modifications are made to the YOLOv8n network structure to reduce redundant feature maps, resulting in the creation of the TrimYOLO network structure. Experimental results demonstrate that the Lightv8nPnP algorithm reduces the number of parameters and computational load compared to benchmark algorithms, achieves a detection rate of 186 frames per second, and maintains a positioning error of less than 5.5 centimeters across the X, Y, and Z axes in three-dimensional space.
Bibliografia:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 content type line 23
ISSN:	2045-2322 2045-2322
DOI:	10.1038/s41598-025-88089-y