Table structure recognition based on dual-branch encoder-decoder segmentation

Gespeichert in:
Bibliographische Detailangaben
Titel: Table structure recognition based on dual-branch encoder-decoder segmentation
Autoren: Dajun Xiao, Xialing Xu, Yue Zhang, Tao Liu, Xin Li, Yongtian Qiao
Quelle: Journal of Computational Methods in Sciences and Engineering.
Verlagsinformationen: SAGE Publications, 2025.
Publikationsjahr: 2025
Beschreibung: Digitization of paper documents is crucial for the management of modern power grid enterprise. Table structure recognition, which identifies table cells, presents challenges due to diverse table formats. This paper introduces a novel table structure recognition method based on dual-branch encoder-decoder segmentation. The proposed approach converts table structure extraction into row and column segmentation sub-problems, which utilizes a single encoder for feature extraction and two independent decoder branches for segment prediction. In this framework, a Conv-Res-CBAM unit is proposed to enhance feature extraction and transmission. Additionally, the Tesseract OCR engine is incorporated for character recognition. Extensive experiments on two public datasets and a self-collected dataset demonstrate the superiority of our method.
Publikationsart: Article
Sprache: English
ISSN: 1875-8983
1472-7978
DOI: 10.1177/14727978251361868
Rights: URL: https://journals.sagepub.com/page/policies/text-and-data-mining-license
Dokumentencode: edsair.doi...........b1227fea5aad47095f86f532e672d91d
Datenbank: OpenAIRE
Beschreibung
Abstract:Digitization of paper documents is crucial for the management of modern power grid enterprise. Table structure recognition, which identifies table cells, presents challenges due to diverse table formats. This paper introduces a novel table structure recognition method based on dual-branch encoder-decoder segmentation. The proposed approach converts table structure extraction into row and column segmentation sub-problems, which utilizes a single encoder for feature extraction and two independent decoder branches for segment prediction. In this framework, a Conv-Res-CBAM unit is proposed to enhance feature extraction and transmission. Additionally, the Tesseract OCR engine is incorporated for character recognition. Extensive experiments on two public datasets and a self-collected dataset demonstrate the superiority of our method.
ISSN:18758983
14727978
DOI:10.1177/14727978251361868