A Comprehensive Survey of Image Augmentation Techniques for Deep Learning

•We examine challenges and vicinity distribution to demonstrate the necessity of image augmentation for deep learning.•We present a comprehensive survey on image augmentation with a novel informative taxonomy that encompasses a wider range of algorithms.•We discuss the current situation and future d...

Celý popis

Uložené v:
Podrobná bibliografia
Vydané v:Pattern recognition Ročník 137; s. 109347
Hlavní autori: Xu, Mingle, Yoon, Sook, Fuentes, Alvaro, Park, Dong Sun
Médium: Journal Article
Jazyk:English
Vydavateľské údaje: Elsevier Ltd 01.05.2023
Predmet:
ISSN:0031-3203, 1873-5142
On-line prístup:Získať plný text
Tagy: Pridať tag
Žiadne tagy, Buďte prvý, kto otaguje tento záznam!
Popis
Shrnutí:•We examine challenges and vicinity distribution to demonstrate the necessity of image augmentation for deep learning.•We present a comprehensive survey on image augmentation with a novel informative taxonomy that encompasses a wider range of algorithms.•We discuss the current situation and future direction for image augmentation, along with three relevant topics: understanding image augmentation, new strategy to leverage image augmentation, and feature augmentation. Although deep learning has achieved satisfactory performance in computer vision, a large volume of images is required. However, collecting images is often expensive and challenging. Many image augmentation algorithms have been proposed to alleviate this issue. Understanding existing algorithms is, therefore, essential for finding suitable and developing novel methods for a given task. In this study, we perform a comprehensive survey of image augmentation for deep learning using a novel informative taxonomy. To examine the basic objective of image augmentation, we introduce challenges in computer vision tasks and vicinity distribution. The algorithms are then classified among three categories: model-free, model-based, and optimizing policy-based. The model-free category employs the methods from image processing, whereas the model-based approach leverages image generation models to synthesize images. In contrast, the optimizing policy-based approach aims to find an optimal combination of operations. Based on this analysis, we believe that our survey enhances the understanding necessary for choosing suitable methods and designing novel algorithms.
ISSN:0031-3203
1873-5142
DOI:10.1016/j.patcog.2023.109347