Traffic sign classification with deformable convolution based on denoising residual convolutional autoencoder region localization

Traffic sign classification is particularly important in the field of autonomous driving. Image classification methods based on convolution have been widely studied. Traditional convolution, due to its fixed sampling and operation mode, is difficult to capture information in specific irregular key a...

Full description

Saved in:
Bibliographic Details
Published in:Engineering Research Express Vol. 7; no. 4; pp. 45257 - 45276
Main Authors: Pan, Hao, Guo, Qianlu, Yuan, Decheng, Pan, Duotao, Li, Dong, Yu, Qianxin
Format: Journal Article
Language:English
Published: IOP Publishing 31.12.2025
Subjects:
ISSN:2631-8695, 2631-8695
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Traffic sign classification is particularly important in the field of autonomous driving. Image classification methods based on convolution have been widely studied. Traditional convolution, due to its fixed sampling and operation mode, is difficult to capture information in specific irregular key areas. Although deformable convolution solves the fixed sampling problem of standard convolution, the images captured in reality are easily disturbed by noise, leading to incorrect offset learning and thus destroying the spatial consistency of features. To address these issues, this paper proposes an image classification method based on denoising residual convolutional autoencoder for important region localization and deformable convolution (DRCAE-DCN). The denoising residual convolutional autoencoder is used to extract feature information from the images with added noise and restore them to clean images without noise. The importance of the extracted features is calculated through a multi-head attention mechanism, and the features with high importance are selected to locate the regions in the original image. Deformable convolution is then applied to these regions. By screening important features and localizing regions in the image, it provides a basis for the offset learning of deformable convolution. The important features are used as auxiliary information to fuse with the features extracted by deformable convolution, compensating for the deficiency of deformable convolution in global information extraction, and then image classification is performed. The classification accuracy on the GTSRB, CCTSDB and BelgiumTS datasets has been significantly improved, demonstrating the effectiveness of this method.
Bibliography:ERX-110424.R3
ISSN:2631-8695
2631-8695
DOI:10.1088/2631-8695/ae1886