ROBDD-TrOCRBERTa: a novel robust-optimized blurred document text deblurring and completion with DCGAN-TrOCR and DistilRoBERTa
Blurred text documents such as historical documents, handwritten manuscripts, old newspapers, moist invoices or legal agreements, old books, hand written notes often present readability challenges because the quality of the text has deteriorated over time. The proposed Robust Optimized Blurred Docum...
Gespeichert in:
| Veröffentlicht in: | International journal of information technology (Singapore. Online) Jg. 16; H. 7; S. 4611 - 4619 |
|---|---|
| Hauptverfasser: | , |
| Format: | Journal Article |
| Sprache: | Englisch |
| Veröffentlicht: |
Singapore
Springer Nature Singapore
01.10.2024
Springer Nature B.V |
| Schlagworte: | |
| ISSN: | 2511-2104, 2511-2112 |
| Online-Zugang: | Volltext |
| Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
| Zusammenfassung: | Blurred text documents such as historical documents, handwritten manuscripts, old newspapers, moist invoices or legal agreements, old books, hand written notes often present readability challenges because the quality of the text has deteriorated over time. The proposed Robust Optimized Blurred Document Text Deblurring and Text Recognition (ROBDD-TrOCRBERTa) method tackles the challenge of improving readability in deteriorated text documents such as historical documents, handwritten manuscripts, and old newspapers. This innovative approach is divided into two phases. First, it employs Deblurring using DCGAN to enhance image quality by reducing noise and blurriness. Subsequently, it leverages TrOCR integrated with DistilRoBERTa for efficient text recognition and completion. Experimental results show that this method is effective in various real-world scenarios, making it a promising solution for automated document analysis and digitization in challenging conditions. |
|---|---|
| Bibliographie: | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 |
| ISSN: | 2511-2104 2511-2112 |
| DOI: | 10.1007/s41870-024-02073-9 |