An optimized CNN system to recognize handwritten characters in ancient documents in Grantha script
An optical character recognition (OCR) system plays an important role in the digitization of ancient handwritten text document. Various adversaries of ancient documents such as ink stains, faded portion of text, humidity spots, and similar-shaped characters make the task of character recognition cha...
Uloženo v:
| Vydáno v: | International journal of information technology (Singapore. Online) Ročník 15; číslo 4; s. 1975 - 1983 |
|---|---|
| Hlavní autoři: | , |
| Médium: | Journal Article |
| Jazyk: | angličtina |
| Vydáno: |
Singapore
Springer Nature Singapore
01.04.2023
Springer Nature B.V |
| Témata: | |
| ISSN: | 2511-2104, 2511-2112 |
| On-line přístup: | Získat plný text |
| Tagy: |
Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
|
| Shrnutí: | An optical character recognition (OCR) system plays an important role in the digitization of ancient handwritten text document. Various adversaries of ancient documents such as ink stains, faded portion of text, humidity spots, and similar-shaped characters make the task of character recognition challenging and tedious. This research study proposes an optimized convolution neural network (CNN) based OCR system to recognize each and every character present in the ancient document handwritten in Grantha script. A set of convolutional layers present in the proposed system extracts the deep hierarchical feature vectors from the input character image. Two fully connected neural network (FCNN) layers have classified these feature vectors into its correct class. The values of the hyper-parameters of CNN architecture such as the number of filters in each convolution layer, size of the filter in each convolution layer, number of FCNN layers, and neurons in each FCNN layer have been optimized using the
Bayesian optimization
technique. The major contribution of this work is the proposal of an optimized CNN architecture to perform OCR in ancient documents in Grantha script. A character recognition accuracy of 99.30% has been obtained from the proposed OCR system on the ancient handwritten documents in Grantha script. The experimental results demonstrate that the proposed OCR method outperforms the existing state-of-the-art methods in this regard. |
|---|---|
| Bibliografie: | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 |
| ISSN: | 2511-2104 2511-2112 |
| DOI: | 10.1007/s41870-023-01247-1 |