Boosting the Generalization Capability in Cross-Domain Few-shot Learning via Noise-enhanced Supervised Autoencoder

State of the art (SOTA) few-shot learning (FSL) methods suffer significant performance drop in the presence of domain differences between source and target datasets. The strong discrimination ability on the source dataset does not necessarily translate to high classification accuracy on the target d...

Full description

Saved in:

Bibliographic Details
Published in:	Proceedings / IEEE International Conference on Computer Vision pp. 9404 - 9414
Main Authors:	Liang, Hanwen, Zhang, Qiong, Dai, Peng, Lu, Juwei
Format:	Conference Proceeding
Language:	English
Published:	IEEE 01.10.2021
Subjects:	Boosting Computer vision Correlation Feature extraction Optimization and learning methods Predictive models Recognition and classification Representation learning Transfer learning Transfer/Low-shot/Semi/Unsupervised Learning
ISSN:	2380-7504
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

Description
Summary:	State of the art (SOTA) few-shot learning (FSL) methods suffer significant performance drop in the presence of domain differences between source and target datasets. The strong discrimination ability on the source dataset does not necessarily translate to high classification accuracy on the target dataset. In this work, we address this cross-domain few-shot learning (CDFSL) problem by boosting the generalization capability of the model. Specifically, we teach the model to capture broader variations of the feature distributions with a novel noise-enhanced supervised autoencoder (NSAE). NSAE trains the model by jointly reconstructing inputs and predicting the labels of inputs as well as their reconstructed pairs. Theoretical analysis based on intra-class correlation (ICC) shows that the feature embeddings learned from NSAE have stronger discrimination and generalization abilities in the target domain. We also take advantage of NSAE structure and propose a two-step fine-tuning procedure that achieves better adaption and improves classification performance in the target domain. Extensive experiments and ablation studies are conducted to demonstrate the effectiveness of the proposed method. Experimental results show that our proposed method consistently outperforms SOTA methods under various conditions.
ISSN:	2380-7504
DOI:	10.1109/ICCV48922.2021.00929