A method for improving the robustness of neural network for aerial image matching
Uloženo v:
| Název: | A method for improving the robustness of neural network for aerial image matching |
|---|---|
| Autoři: | Artem Korobov, Yuriy Moskalenko, Maksym Vynohradov, Vladyslav Babych |
| Zdroj: | Авіаційно-космічна техніка та технологія, Vol 0, Iss 5, Pp 76-87 (2025) |
| Informace o vydavateli: | National Aerospace University «Kharkiv Aviation Institute», 2025. |
| Rok vydání: | 2025 |
| Sbírka: | LCC:Motor vehicles. Aeronautics. Astronautics |
| Témata: | співставлення зображень, робастність, нейронні мережі, змагальні атаки, змагальне навчання, Motor vehicles. Aeronautics. Astronautics, TL1-4050 |
| Popis: | The subject of study in this article is neural network–based methods for aerial image matching, which are widely used in navigation, localization, and mapping tasks. A key challenge lies in the sensitivity of such methods to visual disturbances and scene novelty caused by shadows, illumination changes, and terrain variability, which limits their robustness in real-world conditions—particularly under constrained computational resources. This paper investigates an approach to enhancing the robustness and cross-domain generalization of computationally efficient aerial image matching models by combining adversarial procedural noise with a modified activation function. The goal is to develop a training methodology that simultaneously increases the resilience of models to perturbations and improves their transferability across different observation domains. The research objectives are as follows: (1) to analyze existing methods for improving the robustness of neural networks and assess their applicability to aerial image matching tasks; (2) to develop a training approach incorporating the synthesis of adversarial procedural noises (Perlin, Gabor, Worley) and the replacement of the standard ReLU with a hybrid activation function, LeakyReLU6, which constrains activation amplitudes and reduces sensitivity to local disturbances; (3) to conduct a comprehensive experimental evaluation of detector-based architectures (SuperPoint + LightGlue) and detector-free models (EfficientLoFTR) using the Aerial Image Matching Benchmark dataset; (4) to verify cross-domain generalization on the HPatches dataset; and (5) to perform an ablation study to isolate the contribution of each component. Results. The proposed methodology achieved over a 4.2% absolute improvement in AUC@1px matching accuracy on noisy test data for both classes of models. The ablation study revealed a synergistic effect from combining procedural noise with LeakyReLU6 — in particular, for the SuperPoint + LightGlue combination, improvements reached +3.0% AUC@1px and +2.7% AUC@3px, while for EfficientLoFTR, gains of +2.2% and +2.6% were observed, respectively. Additionally, testing on HPatches showed a 0.83% smaller performance drop compared to baseline training, confirming a higher level of cross-domain generalization. Conclusions. The proposed approach enhances the noise robustness and cross-domain generalization of feature-matching models and can be easily extended to various neural network architectures. Future work will focus on investigating the influence of procedural noise hyperparameters, applying meta-learning on corrupted data, and introducing architectural improvements to further strengthen resilience and robustness. Scientific novelty. The novelty of this work lies in the first integration of adversarial learning with procedural noise and a bounded activation function (LeakyReLU6, using the Straight-Through Estimator (STE) in the backward pass), which produced a synergistic effect that improved the robustness and generalization of aerial image matching models without a significant increase in computational cost. |
| Druh dokumentu: | article |
| Popis souboru: | electronic resource |
| Jazyk: | English Ukrainian |
| ISSN: | 1727-7337 2663-2217 |
| Relation: | http://nti.khai.edu/ojs/index.php/aktt/article/view/3147; https://doaj.org/toc/1727-7337; https://doaj.org/toc/2663-2217 |
| DOI: | 10.32620/aktt.2025.5.07 |
| Přístupová URL adresa: | https://doaj.org/article/1a4236d71f5e4a3c9b7fcfa432fe2087 |
| Přístupové číslo: | edsdoj.1a4236d71f5e4a3c9b7fcfa432fe2087 |
| Databáze: | Directory of Open Access Journals |
| Abstrakt: | The subject of study in this article is neural network–based methods for aerial image matching, which are widely used in navigation, localization, and mapping tasks. A key challenge lies in the sensitivity of such methods to visual disturbances and scene novelty caused by shadows, illumination changes, and terrain variability, which limits their robustness in real-world conditions—particularly under constrained computational resources. This paper investigates an approach to enhancing the robustness and cross-domain generalization of computationally efficient aerial image matching models by combining adversarial procedural noise with a modified activation function. The goal is to develop a training methodology that simultaneously increases the resilience of models to perturbations and improves their transferability across different observation domains. The research objectives are as follows: (1) to analyze existing methods for improving the robustness of neural networks and assess their applicability to aerial image matching tasks; (2) to develop a training approach incorporating the synthesis of adversarial procedural noises (Perlin, Gabor, Worley) and the replacement of the standard ReLU with a hybrid activation function, LeakyReLU6, which constrains activation amplitudes and reduces sensitivity to local disturbances; (3) to conduct a comprehensive experimental evaluation of detector-based architectures (SuperPoint + LightGlue) and detector-free models (EfficientLoFTR) using the Aerial Image Matching Benchmark dataset; (4) to verify cross-domain generalization on the HPatches dataset; and (5) to perform an ablation study to isolate the contribution of each component. Results. The proposed methodology achieved over a 4.2% absolute improvement in AUC@1px matching accuracy on noisy test data for both classes of models. The ablation study revealed a synergistic effect from combining procedural noise with LeakyReLU6 — in particular, for the SuperPoint + LightGlue combination, improvements reached +3.0% AUC@1px and +2.7% AUC@3px, while for EfficientLoFTR, gains of +2.2% and +2.6% were observed, respectively. Additionally, testing on HPatches showed a 0.83% smaller performance drop compared to baseline training, confirming a higher level of cross-domain generalization. Conclusions. The proposed approach enhances the noise robustness and cross-domain generalization of feature-matching models and can be easily extended to various neural network architectures. Future work will focus on investigating the influence of procedural noise hyperparameters, applying meta-learning on corrupted data, and introducing architectural improvements to further strengthen resilience and robustness. Scientific novelty. The novelty of this work lies in the first integration of adversarial learning with procedural noise and a bounded activation function (LeakyReLU6, using the Straight-Through Estimator (STE) in the backward pass), which produced a synergistic effect that improved the robustness and generalization of aerial image matching models without a significant increase in computational cost. |
|---|---|
| ISSN: | 17277337 26632217 |
| DOI: | 10.32620/aktt.2025.5.07 |
Nájsť tento článok vo Web of Science