Enhanced Named Entity Recognition algorithm for financial document verification

Many enterprise systems are document-intensive and require extensive manual verification. The verification process has challenge in terms of time and remaining bugs. A general automatic or semi-automatic document verification system would be useful. However, as the nature of the natural language, th...

Celý popis

Uloženo v:
Podrobná bibliografie
Vydáno v:The Journal of supercomputing Ročník 79; číslo 17; s. 19431 - 19451
Hlavní autoři: Toprak, Ahmet, Turan, Metin
Médium: Journal Article
Jazyk:angličtina
Vydáno: New York Springer US 01.11.2023
Springer Nature B.V
Témata:
ISSN:0920-8542, 1573-0484
On-line přístup:Získat plný text
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Abstract Many enterprise systems are document-intensive and require extensive manual verification. The verification process has challenge in terms of time and remaining bugs. A general automatic or semi-automatic document verification system would be useful. However, as the nature of the natural language, the context is an important factor. In this research, the target context is selected to be the financial documents, which have been highly interested recently. An automatic document verification model based on only entities (mostly faced within financial documents) was experimented. The summary report was verified with original documents, such that entities in the summary were searched for matching in the original documents. Verification process success was evaluated by comparison of the named entity algorithms in the literature. The special Kaggle data set ready for this purpose was used for entity matching from the summary within the original documents. The average document verification accuracy of named entity finding algorithms for only financial type documents was 85.36%, where the proposed entity recognition algorithm reached 88.80%. On the other hand, the average document verification time of the experimented algorithms and the developed algorithm is 2.43 and 2.48 s respectively. As a conclusion, when both the BERT-base-cased classification model and rule-based approaches are applied specific to the context, it enhances the entity verification process with an insignificant time cost. Consequently, even we used limited data and rules, it is seen that there exists opportunity to automatize the document verification process with the support of both the BERT-base-cased classification model and rule-based approaches.
AbstractList Many enterprise systems are document-intensive and require extensive manual verification. The verification process has challenge in terms of time and remaining bugs. A general automatic or semi-automatic document verification system would be useful. However, as the nature of the natural language, the context is an important factor. In this research, the target context is selected to be the financial documents, which have been highly interested recently. An automatic document verification model based on only entities (mostly faced within financial documents) was experimented. The summary report was verified with original documents, such that entities in the summary were searched for matching in the original documents. Verification process success was evaluated by comparison of the named entity algorithms in the literature. The special Kaggle data set ready for this purpose was used for entity matching from the summary within the original documents. The average document verification accuracy of named entity finding algorithms for only financial type documents was 85.36%, where the proposed entity recognition algorithm reached 88.80%. On the other hand, the average document verification time of the experimented algorithms and the developed algorithm is 2.43 and 2.48 s respectively. As a conclusion, when both the BERT-base-cased classification model and rule-based approaches are applied specific to the context, it enhances the entity verification process with an insignificant time cost. Consequently, even we used limited data and rules, it is seen that there exists opportunity to automatize the document verification process with the support of both the BERT-base-cased classification model and rule-based approaches.
Many enterprise systems are document-intensive and require extensive manual verification. The verification process has challenge in terms of time and remaining bugs. A general automatic or semi-automatic document verification system would be useful. However, as the nature of the natural language, the context is an important factor. In this research, the target context is selected to be the financial documents, which have been highly interested recently. An automatic document verification model based on only entities (mostly faced within financial documents) was experimented. The summary report was verified with original documents, such that entities in the summary were searched for matching in the original documents. Verification process success was evaluated by comparison of the named entity algorithms in the literature. The special Kaggle data set ready for this purpose was used for entity matching from the summary within the original documents. The average document verification accuracy of named entity finding algorithms for only financial type documents was 85.36%, where the proposed entity recognition algorithm reached 88.80%. On the other hand, the average document verification time of the experimented algorithms and the developed algorithm is 2.43 and 2.48 s respectively. As a conclusion, when both the BERT-base-cased classification model and rule-based approaches are applied specific to the context, it enhances the entity verification process with an insignificant time cost. Consequently, even we used limited data and rules, it is seen that there exists opportunity to automatize the document verification process with the support of both the BERT-base-cased classification model and rule-based approaches.
Author Turan, Metin
Toprak, Ahmet
Author_xml – sequence: 1
  givenname: Ahmet
  surname: Toprak
  fullname: Toprak, Ahmet
  email: ce.ahmet.toprak@gmail.com
  organization: Department of Computer Engineering, Istanbul Ticaret University
– sequence: 2
  givenname: Metin
  surname: Turan
  fullname: Turan, Metin
  organization: Department of Computer Engineering, Istanbul Ticaret University
BookMark eNp9kM1KAzEURoNUsK2-gKsB16P5G5MspVQrFAui65DJZNqUmaQmqdC3N-0Igotucgmcc-_HNwEj550B4BbBewQhe4gIYcxKiEkJK8JQSS_AGFUsfymnIzCGAsOSVxRfgUmMWwghJYyMwWruNspp0xRvqs_v3CWbDsW70X7tbLLeFapb-2DTpi9aH4rWusxb1RWN1_veuFR8m2Bbq9WRvgaXreqiufmdU_D5PP-YLcrl6uV19rQsNUEilQg3gpuGImEqqqHBghFFBWW8gQ2qOeJGwNpgzJWiiuR8dR7UEKx5TRpGpuBu2LsL_mtvYpJbvw8un5QEV48VryAWmeIDpYOPMZhWaptOOVNQtpMIymN9cqhP5vrkqT5Js4r_qbtgexUO5yUySDHDbm3CX6oz1g-dm4S3
CitedBy_id crossref_primary_10_1007_s44163_025_00347_0
crossref_primary_10_1177_10944281251334777
crossref_primary_10_1038_s41598_025_06789_x
crossref_primary_10_37394_23202_2025_24_50
crossref_primary_10_1109_ACCESS_2024_3477270
Cites_doi 10.1016/j.artint.2005.03.001
10.1109/ICME.2000.871093
10.1162/tacl_a_00088
10.1109/WI-IAT.2011.258
10.18653/v1/P19-1139
10.1007/978-3-642-22546-8_21
10.1109/TENCON.2015.7372818
10.1145/1571941.1571989
10.1109/ACCESS.2015.2431493
10.3115/1699510.1699512
10.1109/ICDAR.2011.66
10.1109/DAS.2018.74
10.1109/EISIC.2015.21
10.1109/ACCESS.2021.3129786
10.3115/1609822.1609823
10.1109/ICVGIP.2008.67
10.1109/DAS.2016.75
10.1109/SITIS.2015.70
10.1109/EDOC.2016.7579376
10.1109/HICSS.2004.1265265
10.1145/1321440.1321542
10.1016/j.patrec.2005.03.024
10.1075/li.30.1.03nad
ContentType Journal Article
Copyright The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2023. Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2023.
Copyright_xml – notice: The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2023. Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
– notice: The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2023.
DBID AAYXX
CITATION
8FE
8FG
ABJCF
AFKRA
ARAPS
AZQEC
BENPR
BGLVJ
CCPQU
DWQXO
GNUQQ
HCIFZ
JQ2
K7-
L6V
M7S
P5Z
P62
PHGZM
PHGZT
PKEHL
PQEST
PQGLB
PQQKQ
PQUKI
PRINS
PTHSS
DOI 10.1007/s11227-023-05371-4
DatabaseName CrossRef
ProQuest SciTech Collection
ProQuest Technology Collection
Materials Science & Engineering Collection
ProQuest Central UK/Ireland
Advanced Technologies & Computer Science Collection
ProQuest Central Essentials
ProQuest Central
Technology collection
ProQuest One Community College
ProQuest Central Korea
ProQuest Central Student
SciTech Premium Collection
ProQuest Computer Science Collection
Computer Science Database
ProQuest Engineering Collection
Engineering Database
Advanced Technologies & Aerospace Database
ProQuest Advanced Technologies & Aerospace Collection
ProQuest Central Premium
ProQuest One Academic (New)
ProQuest One Academic Middle East (New)
ProQuest One Academic Eastern Edition (DO NOT USE)
ProQuest One Applied & Life Sciences
ProQuest One Academic (retired)
ProQuest One Academic UKI Edition
ProQuest Central China
Engineering Collection
DatabaseTitle CrossRef
Computer Science Database
ProQuest Central Student
Technology Collection
ProQuest One Academic Middle East (New)
ProQuest Advanced Technologies & Aerospace Collection
ProQuest Central Essentials
ProQuest Computer Science Collection
SciTech Premium Collection
ProQuest One Community College
ProQuest Central China
ProQuest Central
ProQuest One Applied & Life Sciences
ProQuest Engineering Collection
ProQuest Central Korea
ProQuest Central (New)
Engineering Collection
Advanced Technologies & Aerospace Collection
Engineering Database
ProQuest One Academic Eastern Edition
ProQuest Technology Collection
ProQuest SciTech Collection
Advanced Technologies & Aerospace Database
ProQuest One Academic UKI Edition
Materials Science & Engineering Collection
ProQuest One Academic
ProQuest One Academic (New)
DatabaseTitleList
Computer Science Database
Database_xml – sequence: 1
  dbid: BENPR
  name: ProQuest Central
  url: https://www.proquest.com/central
  sourceTypes: Aggregation Database
DeliveryMethod fulltext_linktorsrc
Discipline Computer Science
EISSN 1573-0484
EndPage 19451
ExternalDocumentID 10_1007_s11227_023_05371_4
GroupedDBID -4Z
-59
-5G
-BR
-EM
-Y2
-~C
.4S
.86
.DC
.VR
06D
0R~
0VY
123
199
1N0
1SB
2.D
203
28-
29L
2J2
2JN
2JY
2KG
2KM
2LR
2P1
2VQ
2~H
30V
4.4
406
408
409
40D
40E
5QI
5VS
67Z
6NX
78A
8TC
8UJ
95-
95.
95~
96X
AAAVM
AABHQ
AACDK
AAHNG
AAIAL
AAJBT
AAJKR
AANZL
AAOBN
AARHV
AARTL
AASML
AATNV
AATVU
AAUYE
AAWCG
AAYIU
AAYOK
AAYQN
AAYTO
AAYZH
ABAKF
ABBBX
ABBXA
ABDBF
ABDPE
ABDZT
ABECU
ABFTD
ABFTV
ABHLI
ABHQN
ABJNI
ABJOX
ABKCH
ABKTR
ABMNI
ABMQK
ABNWP
ABQBU
ABQSL
ABSXP
ABTEG
ABTHY
ABTKH
ABTMW
ABULA
ABWNU
ABXPI
ACAOD
ACBXY
ACDTI
ACGFS
ACHSB
ACHXU
ACKNC
ACMDZ
ACMLO
ACOKC
ACOMO
ACPIV
ACUHS
ACZOJ
ADHHG
ADHIR
ADIMF
ADINQ
ADKNI
ADKPE
ADMLS
ADQRH
ADRFC
ADTPH
ADURQ
ADYFF
ADZKW
AEBTG
AEFIE
AEFQL
AEGAL
AEGNC
AEJHL
AEJRE
AEKMD
AEMSY
AENEX
AEOHA
AEPYU
AESKC
AETLH
AEVLU
AEXYK
AFBBN
AFEXP
AFGCZ
AFLOW
AFQWF
AFWTZ
AFZKB
AGAYW
AGDGC
AGGDS
AGJBK
AGMZJ
AGQEE
AGQMX
AGRTI
AGWIL
AGWZB
AGYKE
AHAVH
AHBYD
AHSBF
AHYZX
AI.
AIAKS
AIGIU
AIIXL
AILAN
AITGF
AJBLW
AJRNO
AJZVZ
ALMA_UNASSIGNED_HOLDINGS
ALWAN
AMKLP
AMXSW
AMYLF
AMYQR
AOCGG
ARCSS
ARMRJ
ASPBG
AVWKF
AXYYD
AYJHY
AZFZN
B-.
B0M
BA0
BBWZM
BDATZ
BGNMA
BSONS
CAG
COF
CS3
CSCUP
DDRTE
DL5
DNIVK
DPUIP
DU5
EAD
EAP
EAS
EBD
EBLON
EBS
EDO
EIOEI
EJD
EMK
EPL
ESBYG
ESX
F5P
FEDTE
FERAY
FFXSO
FIGPU
FINBP
FNLPD
FRRFC
FSGXE
FWDCC
GGCAI
GGRSB
GJIRD
GNWQR
GQ6
GQ7
GQ8
GXS
H13
HF~
HG5
HG6
HMJXF
HQYDN
HRMNR
HVGLF
HZ~
H~9
I-F
I09
IHE
IJ-
IKXTQ
ITM
IWAJR
IXC
IZIGR
IZQ
I~X
I~Z
J-C
J0Z
JBSCW
JCJTX
JZLTJ
KDC
KOV
KOW
LAK
LLZTM
M4Y
MA-
N2Q
N9A
NB0
NDZJH
NPVJJ
NQJWS
NU0
O9-
O93
O9G
O9I
O9J
OAM
OVD
P19
P2P
P9O
PF0
PT4
PT5
QOK
QOS
R4E
R89
R9I
RHV
RNI
ROL
RPX
RSV
RZC
RZE
RZK
S16
S1Z
S26
S27
S28
S3B
SAP
SCJ
SCLPG
SCO
SDH
SDM
SHX
SISQX
SJYHP
SNE
SNPRN
SNX
SOHCF
SOJ
SPISZ
SRMVM
SSLCW
STPWE
SZN
T13
T16
TEORI
TSG
TSK
TSV
TUC
TUS
U2A
UG4
UOJIU
UTJUX
UZXMN
VC2
VFIZW
VH1
W23
W48
WH7
WK8
YLTOR
Z45
Z7R
Z7X
Z7Z
Z83
Z88
Z8M
Z8N
Z8R
Z8T
Z8W
Z92
ZMTXR
~8M
~EX
AAPKM
AAYXX
ABBRH
ABDBE
ABFSG
ABJCF
ABRTQ
ACSTC
ADHKG
ADKFA
AEZWR
AFDZB
AFFHD
AFHIU
AFKRA
AFOHR
AGQPQ
AHPBZ
AHWEU
AIXLP
ARAPS
ATHPR
AYFIA
BENPR
BGLVJ
CCPQU
CITATION
HCIFZ
K7-
M7S
PHGZM
PHGZT
PQGLB
PTHSS
8FE
8FG
AZQEC
DWQXO
GNUQQ
JQ2
L6V
P62
PKEHL
PQEST
PQQKQ
PQUKI
PRINS
ID FETCH-LOGICAL-c319t-12d98ed419e54c0e2973a49478d0d1b818e90be228aa4a3cedb4a34e32c8b3d73
IEDL.DBID M7S
ISICitedReferencesCount 3
ISICitedReferencesURI http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000994797400001&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
ISSN 0920-8542
IngestDate Sun Nov 30 04:07:35 EST 2025
Sat Nov 29 04:27:45 EST 2025
Tue Nov 18 22:25:36 EST 2025
Fri Feb 21 02:41:32 EST 2025
IsPeerReviewed true
IsScholarly true
Issue 17
Keywords Named Entity Recognition
Natural language processing
Spell-checker
Document summarization
Automatic document verification
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c319t-12d98ed419e54c0e2973a49478d0d1b818e90be228aa4a3cedb4a34e32c8b3d73
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
PQID 3256585029
PQPubID 2043774
PageCount 21
ParticipantIDs proquest_journals_3256585029
crossref_citationtrail_10_1007_s11227_023_05371_4
crossref_primary_10_1007_s11227_023_05371_4
springer_journals_10_1007_s11227_023_05371_4
PublicationCentury 2000
PublicationDate 20231100
2023-11-00
20231101
PublicationDateYYYYMMDD 2023-11-01
PublicationDate_xml – month: 11
  year: 2023
  text: 20231100
PublicationDecade 2020
PublicationPlace New York
PublicationPlace_xml – name: New York
PublicationSubtitle An International Journal of High-Performance Computer Design, Analysis, and Use
PublicationTitle The Journal of supercomputing
PublicationTitleAbbrev J Supercomput
PublicationYear 2023
Publisher Springer US
Springer Nature B.V
Publisher_xml – name: Springer US
– name: Springer Nature B.V
References Naman J (2022) NER dataset. https://www.kaggle.com/namanj27/ner-dataset, Accessed 28 Dec 2022
Beusekom JV, Shafait F (2011) Distortion measurement for automatic document verification. In: 2011 International Conference on Document Analysis and Recognition, pp 289–293
Poon H, Domingos P (2009) Unsupervised semantic parsing. In: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, Singapore, pp 1–10. https://aclanthology.org/D09-1001
Petkova D, Croft WB (2007) Proximity-Based document representation for named entity retrieval. In: Proceedings of the Sixteenth ACM Conference on Conference on Information and Knowledge Management, CIKM ’07, Association for Computing Machinery, New York, pp 731–740
SangEMeulderFIntroduction to the CoNLL-2003 shared task: language-independent named entity recognitionProc Seventh Conf Nat Lang Learn HLT-NAACL20032003142147
Garain U, Halder B (2008) On automatic authenticity verification of printed security documents. In: 2008 Sixth Indian Conference on Computer Vision, Graphics and Image Processing, pp 706–713
Mollá D, van Zaanen M, Smith D (2006) Named Entity Recognition for question answering. In: Proceedings of the Australasian Language Technology Workshop 2006, Proceedings of the 2006 Australasian Language Technology Workshop (ALTW2006), Sydney, Australia, pp 51–58. https://aclanthology.org/U06-1009
Wang J-H (2011) Web-based verification on the representativeness of terms extracted from single short documents. In: 2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology, vol. 3, pp 114–117
ReddySTäckströmOCollinsMTransforming dependency structures to logical forms for semantic parsingTrans Assoc Comput Linguist2016412714010.1162/tacl_a_00088
Wu C-H, Huang C-L, Hsu C-S, et al (2007) Speech retrieval using spoken keyword extraction and semantic verification. TENCON 2007–2007 IEEE Region 10 Conference, pp 1–4
Zhang Z, Han X, Liu Z, et al (2019) ERNIE: enhanced language representation with informative entities. In: Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics, Florence, pp 1441–1451
BassilYA trainable summarizer with knowledge acquired from robust NLP techniquesInt J Res Rev Comput Sci (IJRRCS)20123120792557
BensefiaAPaquetTHeutteLA writer identification and verification systemPattern Recognit Lett200526132080209210.1016/j.patrec.2005.03.0241087.68677
MridhaMFLimaAANurKA survey of automatic text summarization: Progress. Process and challengesIEEE Access2021915604315607010.1109/ACCESS.2021.3129786
Roychoudhury S, Bellarykar N, Kulkarni V (2016) A NLP based framework to support document verification-as-a-service. In: 2016 IEEE 20th International Enterprise Distributed Object Computing Conference (EDOC), pp 1–10
Takata Y, Nakamura T, Seki H (2004) Accessibility verification of WWW documents by an automatic guideline verification tool. In: 37th Annual Hawaii International Conference on System Sciences, 2004. Proceedings of the, p 10
Pariza S (2022) BBC news summary., 2022. https://www.kaggle.com/pariza/bbc-news-summary, Accessed 28 Dec
Hamad F, Zraqou J, Maaita A, et al (2015) A secure authentication system for ePassport detection and verification. In: 2015 European Intelligence and Security Informatics Conference, pp 173–176
ChengPErkKAttending to entities for better text understandingProc AAAI Conf Artif Intell202034575547561
Elkasrawi S, Dengel A, Abdelsamad A, et al (2016) What you see is what you get? Automatic image verification for online news content. In: 2016 12th IAPR Workshop on Document Analysis Systems (DAS), pp 114–119
EtzioniOCafarellaMDowneyDUnsupervised named-entity extraction from the Web: an experimental studyArtif Intell200516519113410.1016/j.artint.2005.03.001
HassanpourSO’ConnorMJDasAKBassiliadesNGovernatoriGPaschkeAA framework for the automatic extraction of rules from online textRule-based reasoning, programming, and applications2011Berlin, HeidelbergSpringer26628010.1007/978-3-642-22546-8_21
Techopedia (2022) What does spell checker mean., 2017. https://www.techopedia.com/definition/12396/spell-checker, Accessed 28 Dec
Ghanmi N, Awal AM (2018) A new descriptor for pattern matching: application to identity document verification. In: 2018 13th IAPR International Workshop on Document Analysis Systems (DAS), pp 375–380
Itcib (2022) Financial Documents Verification (2022). https://itcib.com/financial-documents-verification.html Accessed 28 Dec
Sampaio P, Santos C, Courtias J (2000) About the semantic verification of SMIL documents. In: 2000 IEEE International Conference on Multimedia and Expo. ICME2000. Proceedings. Latest Advances in the Fast Changing World of Multimedia (Cat. No.00TH8532), vol. 3, pp 1675–1678
Guo J, Xu G, Cheng X, et al (2009) Named Entity Recognition in query. In: Proceedings of the 32nd International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR ’09, Association for Computing Machinery, New York, pp 267–274
Ando T, Yatsu H, Hisazumi K, et al (2015) Reference model of specifications toward independent verification and validation. In: TENCON 2015–2015 IEEE Region 10 Conference, pp 1–3
NadeauDSekineSA survey of named entity recognition and classificationLingvist Investig200730132610.1075/li.30.1.03nad
Babych B, Hartley A (2003) Improving machine translation quality with automatic Named Entity Recognition. In: Proceedings of the 7th International EAMT Workshop on MT and Other Language Technology Tools, Improving MT Through Other Language Technology Tools, Resource and Tools for Building MT at EACL 2003. Budapest https://aclanthology.org/W03-2201
Hnoohom N, Chumuang N, Ketcham M (2015) Thai Handwritten verification system on documents for the investigation. In: 2015 11th International Conference on Signal-Image Technology & Internet-Based Systems (SITIS), pp 617–622
TolosanaRVera-RodriguezROrtega-GarciaJPreprocessing and feature selection for improved sensor interoperability in online biometric signature verificationIEEE Access2015347848910.1109/ACCESS.2015.2431493
5371_CR10
5371_CR32
5371_CR31
5371_CR12
5371_CR11
S Hassanpour (5371_CR13) 2011
5371_CR30
5371_CR19
5371_CR14
5371_CR16
5371_CR15
P Cheng (5371_CR6) 2020; 34
5371_CR7
MF Mridha (5371_CR17) 2021; 9
5371_CR9
O Etzioni (5371_CR8) 2005; 165
Y Bassil (5371_CR3) 2012; 3
S Reddy (5371_CR23) 2016; 4
5371_CR21
5371_CR20
A Bensefia (5371_CR4) 2005; 26
D Nadeau (5371_CR18) 2007; 30
5371_CR22
5371_CR2
5371_CR28
5371_CR5
5371_CR25
5371_CR24
5371_CR1
5371_CR27
E Sang (5371_CR26) 2003; 2003
R Tolosana (5371_CR29) 2015; 3
References_xml – reference: Ghanmi N, Awal AM (2018) A new descriptor for pattern matching: application to identity document verification. In: 2018 13th IAPR International Workshop on Document Analysis Systems (DAS), pp 375–380
– reference: NadeauDSekineSA survey of named entity recognition and classificationLingvist Investig200730132610.1075/li.30.1.03nad
– reference: Beusekom JV, Shafait F (2011) Distortion measurement for automatic document verification. In: 2011 International Conference on Document Analysis and Recognition, pp 289–293
– reference: Pariza S (2022) BBC news summary., 2022. https://www.kaggle.com/pariza/bbc-news-summary, Accessed 28 Dec
– reference: HassanpourSO’ConnorMJDasAKBassiliadesNGovernatoriGPaschkeAA framework for the automatic extraction of rules from online textRule-based reasoning, programming, and applications2011Berlin, HeidelbergSpringer26628010.1007/978-3-642-22546-8_21
– reference: EtzioniOCafarellaMDowneyDUnsupervised named-entity extraction from the Web: an experimental studyArtif Intell200516519113410.1016/j.artint.2005.03.001
– reference: Elkasrawi S, Dengel A, Abdelsamad A, et al (2016) What you see is what you get? Automatic image verification for online news content. In: 2016 12th IAPR Workshop on Document Analysis Systems (DAS), pp 114–119
– reference: Babych B, Hartley A (2003) Improving machine translation quality with automatic Named Entity Recognition. In: Proceedings of the 7th International EAMT Workshop on MT and Other Language Technology Tools, Improving MT Through Other Language Technology Tools, Resource and Tools for Building MT at EACL 2003. Budapest https://aclanthology.org/W03-2201
– reference: SangEMeulderFIntroduction to the CoNLL-2003 shared task: language-independent named entity recognitionProc Seventh Conf Nat Lang Learn HLT-NAACL20032003142147
– reference: Ando T, Yatsu H, Hisazumi K, et al (2015) Reference model of specifications toward independent verification and validation. In: TENCON 2015–2015 IEEE Region 10 Conference, pp 1–3
– reference: ReddySTäckströmOCollinsMTransforming dependency structures to logical forms for semantic parsingTrans Assoc Comput Linguist2016412714010.1162/tacl_a_00088
– reference: Zhang Z, Han X, Liu Z, et al (2019) ERNIE: enhanced language representation with informative entities. In: Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics, Florence, pp 1441–1451
– reference: Poon H, Domingos P (2009) Unsupervised semantic parsing. In: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, Singapore, pp 1–10. https://aclanthology.org/D09-1001
– reference: Petkova D, Croft WB (2007) Proximity-Based document representation for named entity retrieval. In: Proceedings of the Sixteenth ACM Conference on Conference on Information and Knowledge Management, CIKM ’07, Association for Computing Machinery, New York, pp 731–740
– reference: BassilYA trainable summarizer with knowledge acquired from robust NLP techniquesInt J Res Rev Comput Sci (IJRRCS)20123120792557
– reference: Wang J-H (2011) Web-based verification on the representativeness of terms extracted from single short documents. In: 2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology, vol. 3, pp 114–117
– reference: Hnoohom N, Chumuang N, Ketcham M (2015) Thai Handwritten verification system on documents for the investigation. In: 2015 11th International Conference on Signal-Image Technology & Internet-Based Systems (SITIS), pp 617–622
– reference: Takata Y, Nakamura T, Seki H (2004) Accessibility verification of WWW documents by an automatic guideline verification tool. In: 37th Annual Hawaii International Conference on System Sciences, 2004. Proceedings of the, p 10
– reference: BensefiaAPaquetTHeutteLA writer identification and verification systemPattern Recognit Lett200526132080209210.1016/j.patrec.2005.03.0241087.68677
– reference: Roychoudhury S, Bellarykar N, Kulkarni V (2016) A NLP based framework to support document verification-as-a-service. In: 2016 IEEE 20th International Enterprise Distributed Object Computing Conference (EDOC), pp 1–10
– reference: Guo J, Xu G, Cheng X, et al (2009) Named Entity Recognition in query. In: Proceedings of the 32nd International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR ’09, Association for Computing Machinery, New York, pp 267–274
– reference: Mollá D, van Zaanen M, Smith D (2006) Named Entity Recognition for question answering. In: Proceedings of the Australasian Language Technology Workshop 2006, Proceedings of the 2006 Australasian Language Technology Workshop (ALTW2006), Sydney, Australia, pp 51–58. https://aclanthology.org/U06-1009
– reference: Naman J (2022) NER dataset. https://www.kaggle.com/namanj27/ner-dataset, Accessed 28 Dec 2022
– reference: Garain U, Halder B (2008) On automatic authenticity verification of printed security documents. In: 2008 Sixth Indian Conference on Computer Vision, Graphics and Image Processing, pp 706–713
– reference: Techopedia (2022) What does spell checker mean., 2017. https://www.techopedia.com/definition/12396/spell-checker, Accessed 28 Dec
– reference: MridhaMFLimaAANurKA survey of automatic text summarization: Progress. Process and challengesIEEE Access2021915604315607010.1109/ACCESS.2021.3129786
– reference: TolosanaRVera-RodriguezROrtega-GarciaJPreprocessing and feature selection for improved sensor interoperability in online biometric signature verificationIEEE Access2015347848910.1109/ACCESS.2015.2431493
– reference: Hamad F, Zraqou J, Maaita A, et al (2015) A secure authentication system for ePassport detection and verification. In: 2015 European Intelligence and Security Informatics Conference, pp 173–176
– reference: Wu C-H, Huang C-L, Hsu C-S, et al (2007) Speech retrieval using spoken keyword extraction and semantic verification. TENCON 2007–2007 IEEE Region 10 Conference, pp 1–4
– reference: Itcib (2022) Financial Documents Verification (2022). https://itcib.com/financial-documents-verification.html Accessed 28 Dec
– reference: ChengPErkKAttending to entities for better text understandingProc AAAI Conf Artif Intell202034575547561
– reference: Sampaio P, Santos C, Courtias J (2000) About the semantic verification of SMIL documents. In: 2000 IEEE International Conference on Multimedia and Expo. ICME2000. Proceedings. Latest Advances in the Fast Changing World of Multimedia (Cat. No.00TH8532), vol. 3, pp 1675–1678
– volume: 2003
  start-page: 142
  year: 2003
  ident: 5371_CR26
  publication-title: Proc Seventh Conf Nat Lang Learn HLT-NAACL
– volume: 165
  start-page: 91
  issue: 1
  year: 2005
  ident: 5371_CR8
  publication-title: Artif Intell
  doi: 10.1016/j.artint.2005.03.001
– ident: 5371_CR25
  doi: 10.1109/ICME.2000.871093
– volume: 4
  start-page: 127
  year: 2016
  ident: 5371_CR23
  publication-title: Trans Assoc Comput Linguist
  doi: 10.1162/tacl_a_00088
– ident: 5371_CR28
– ident: 5371_CR30
  doi: 10.1109/WI-IAT.2011.258
– volume: 34
  start-page: 7554
  issue: 5
  year: 2020
  ident: 5371_CR6
  publication-title: Proc AAAI Conf Artif Intell
– ident: 5371_CR32
  doi: 10.18653/v1/P19-1139
– start-page: 266
  volume-title: Rule-based reasoning, programming, and applications
  year: 2011
  ident: 5371_CR13
  doi: 10.1007/978-3-642-22546-8_21
– ident: 5371_CR1
  doi: 10.1109/TENCON.2015.7372818
– ident: 5371_CR11
  doi: 10.1145/1571941.1571989
– ident: 5371_CR16
– volume: 3
  start-page: 478
  year: 2015
  ident: 5371_CR29
  publication-title: IEEE Access
  doi: 10.1109/ACCESS.2015.2431493
– ident: 5371_CR22
  doi: 10.3115/1699510.1699512
– ident: 5371_CR5
  doi: 10.1109/ICDAR.2011.66
– ident: 5371_CR10
  doi: 10.1109/DAS.2018.74
– ident: 5371_CR12
  doi: 10.1109/EISIC.2015.21
– ident: 5371_CR31
– volume: 3
  start-page: 2079
  issue: 1
  year: 2012
  ident: 5371_CR3
  publication-title: Int J Res Rev Comput Sci (IJRRCS)
– volume: 9
  start-page: 156043
  year: 2021
  ident: 5371_CR17
  publication-title: IEEE Access
  doi: 10.1109/ACCESS.2021.3129786
– ident: 5371_CR2
  doi: 10.3115/1609822.1609823
– ident: 5371_CR9
  doi: 10.1109/ICVGIP.2008.67
– ident: 5371_CR19
– ident: 5371_CR20
– ident: 5371_CR7
  doi: 10.1109/DAS.2016.75
– ident: 5371_CR14
  doi: 10.1109/SITIS.2015.70
– ident: 5371_CR24
  doi: 10.1109/EDOC.2016.7579376
– ident: 5371_CR27
  doi: 10.1109/HICSS.2004.1265265
– ident: 5371_CR21
  doi: 10.1145/1321440.1321542
– volume: 26
  start-page: 2080
  issue: 13
  year: 2005
  ident: 5371_CR4
  publication-title: Pattern Recognit Lett
  doi: 10.1016/j.patrec.2005.03.024
– ident: 5371_CR15
– volume: 30
  start-page: 3
  issue: 1
  year: 2007
  ident: 5371_CR18
  publication-title: Lingvist Investig
  doi: 10.1075/li.30.1.03nad
SSID ssj0004373
Score 2.343961
Snippet Many enterprise systems are document-intensive and require extensive manual verification. The verification process has challenge in terms of time and remaining...
SourceID proquest
crossref
springer
SourceType Aggregation Database
Enrichment Source
Index Database
Publisher
StartPage 19431
SubjectTerms Accuracy
Algorithms
Balance sheets
Classification
Compilers
Computer Science
Credit reports
Currency transactions
Documents
International finance
Interpreters
Matching
Processor Architectures
Programming Languages
Recognition
Verification
SummonAdditionalLinks – databaseName: SpringerLink Contemporary (1997 - Present)
  dbid: RSV
  link: http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV07T8MwELagMLBQnqJQkAc2sBQ_gu0RoVZMBZWHukVO7NJKJUFtisS_55wHEQiQYMoQ27LufPnucr77EDrVgeAiMJZcSB0TobQiRlJKdAj-vbQmdCIpyCbkYKBGI31bFYUt6tvudUqy-FI3xW6UMUkAY4jvQQKRzypaA7hTnrBhePfYVEPyMq-sITBSoWBVqcz3a3yGo8bH_JIWLdCm3_7fPrfQZuVd4svyOGyjFZfuoHbN3IArQ95FN710UqT-8cAAHOKeL9Z9w8P6NlGWYjN7yubTfPKMwa3F47oxB7ZZsvR_FDHYgL9mVGh2Dz30e_dX16SiViAJ2FxOKLNaOSuodqFIAucZrIzQQiobWBoDijsdxI4xZYwwHPYTw0M4zhIVcyv5PmqlWeoOEBYwdRz7QEsJiC2loUY65fvIcSdCozqI1hKOkqrvuKe_mEVNx2QvsQgkFhUSi0QHnX3MeSm7bvw6ulsrLqoscBFx8OUgFAqY7qDzWlHN659XO_zb8CO04Rnoy_LELmrl86U7RuvJaz5dzE-Kk_kO5gTafg
  priority: 102
  providerName: Springer Nature
Title Enhanced Named Entity Recognition algorithm for financial document verification
URI https://link.springer.com/article/10.1007/s11227-023-05371-4
https://www.proquest.com/docview/3256585029
Volume 79
WOSCitedRecordID wos000994797400001&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVPQU
  databaseName: Advanced Technologies & Aerospace Database
  customDbUrl:
  eissn: 1573-0484
  dateEnd: 20241212
  omitProxy: false
  ssIdentifier: ssj0004373
  issn: 0920-8542
  databaseCode: P5Z
  dateStart: 20230101
  isFulltext: true
  titleUrlDefault: https://search.proquest.com/hightechjournals
  providerName: ProQuest
– providerCode: PRVPQU
  databaseName: Computer Science Database
  customDbUrl:
  eissn: 1573-0484
  dateEnd: 20241212
  omitProxy: false
  ssIdentifier: ssj0004373
  issn: 0920-8542
  databaseCode: K7-
  dateStart: 20230101
  isFulltext: true
  titleUrlDefault: http://search.proquest.com/compscijour
  providerName: ProQuest
– providerCode: PRVPQU
  databaseName: Engineering Database
  customDbUrl:
  eissn: 1573-0484
  dateEnd: 20241212
  omitProxy: false
  ssIdentifier: ssj0004373
  issn: 0920-8542
  databaseCode: M7S
  dateStart: 20230101
  isFulltext: true
  titleUrlDefault: http://search.proquest.com
  providerName: ProQuest
– providerCode: PRVPQU
  databaseName: ProQuest Central
  customDbUrl:
  eissn: 1573-0484
  dateEnd: 20241212
  omitProxy: false
  ssIdentifier: ssj0004373
  issn: 0920-8542
  databaseCode: BENPR
  dateStart: 20230101
  isFulltext: true
  titleUrlDefault: https://www.proquest.com/central
  providerName: ProQuest
– providerCode: PRVAVX
  databaseName: SpringerLINK Contemporary 1997-Present
  customDbUrl:
  eissn: 1573-0484
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0004373
  issn: 0920-8542
  databaseCode: RSV
  dateStart: 19970101
  isFulltext: true
  titleUrlDefault: https://link.springer.com/search?facet-content-type=%22Journal%22
  providerName: Springer Nature
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwpV1LS8QwEB509eDFt7i6Ljl402DTpiY5icqKINRlfbB4KWmbdQXt6j4E_72TNrUo6MVLe2iThn6ZzEwyMx_AvvJ4wD2d0WOhEsqlklQLxqgK0b4XmQ4NTwuyCRFFst9XXbfhNnFhldWaWCzU2Si1e-RHAepmNG09X528vlHLGmVPVx2Fxjws2CoJrAjdu6nzIoPyhFmhiyRD7rukmTJ1jvm-oKixqK1ogn7Ud8VUW5s_DkgLvXOx8t8Rr8KyszjJaTlF1mDO5OuwUrE5ECfcG3DdyYdFOACJNKpI0rEJvB-kV0UYjXKinx_xA9PhC0FTlwyqYh0ExzKzu4wE5cKGHhVob8LdRef2_JI6ugWaohxOKfMzJU3GmTIhTz1jWa00V1zIzMtYgprdKC8xvi-15jrA8SR44ybwU5kEmQi2oJGPcrMNhGPTQWKdL8nR3xSaaWGkrS0XGB5q2QRW_es4dbXILSXGc1xXUbb4xIhPXOAT8yYcfLV5LStx_Pl2qwIldlI5iWtEmnBYwVo__r23nb9724Uly0Jfpii2oDEdz8weLKbv06fJuA0LZ52o22vD_JWg7WKG4rUbPuC1d3P_CRJM6Sk
linkProvider ProQuest
linkToHtml http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMw1V1NT9wwEB3BFgku0FIQS6H4UE5gkdgOtg8IIboItHRbVSBxS53YC0iQXdgFxJ_qb2ScxERFKjcOnHJIbMnxmy97Zh7ANx0JLiJj6bbUGRVKK2pkHFOdoH8vrUmcyEuyCdnrqbMz_WsC_oZaGJ9WGXRiqajtIPdn5FscbTO6thHTu8Mb6lmj_O1qoNCoYNF1jw8Yso12jr7j_q4zdtA52T-kNasAzRFuYxozq5WzItYuEXnkPHmTEVpIZSMbZ2jAnI4yx5gyRhieO5vhQzjOcpVxKznOOwkfBFfSy1VX0qYOk1c32hpDMpUIVhfpVKV6MWOSooWkvoMKxm3_GsLGu31xIVvauYO59_aHPsJs7VGTvUoEPsGEK-ZhLrBVkFp5fYafneKiTHcgPYMuAOn4AuVH8jtkUA0KYq7OcUHji2uCrjzph2YkBNd-509RCcq9T60q0bwAp2-yrEVoFYPCLQEROLSf-eBSCYynpYmNdMr3zuNOJEa1IQ57m-Z1r3VP-XGVNl2iPR5SxENa4iEVbdh4HjOsOo28-vVKAEFaa51R2iCgDZsBRs3r_8-2_PpsazB9ePLjOD0-6nW_wAzzKC7LMVegNb69c6swld-PL0e3X0t5IPDnreH1BG09QZE
linkToPdf http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwpV1LSwMxEB60inixPrFaNQdvGrqPrEmOoi2KshZf9LZkN6kt1N3SbgX_vck-XBUVxNMe8iDMZJiZzXzfABxyi7jEEhKfUB5iwjjDgto25p6O76kUniJR1myC-j7r9Xj3A4o_q3YvnyRzTINhaYrT1lj2WxXwzXYcirW_wYaPRGdB87BATCG9ydfvHitkpJu_MXOdJDGPOAVs5vs9PrumKt788kSaeZ5O_f9nXoWVIupEp_k1WYM5Fa9DvezogAoD34CbdjzISgKQL7SbRG0D4n1Ft2WVURIjMXpKJsN08Ix0uIv6JWEHkkk0M38akbYNU36UaXwTHjrt-7MLXLRcwJG2xRTbjuRMSWJz5ZHIUqazlSCcUCYtaYfauytuhcpxmBBEuPo8of4Q5ToRC11J3S2oxUmstgERvbQfmgSMEZ1zUmELqpjhl3MV8QRrgF1KO4gKPnLTFmMUVEzKRmKBlliQSSwgDTh6XzPO2Th-nd0slRgUljkNXB3j6RTJcngDjkulVcM_77bzt-kHsNQ97wTXl_7VLiybJvU5grEJtXQyU3uwGL2kw-lkP7uwb38J5kY
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Enhanced+Named+Entity+Recognition+algorithm+for+financial+document+verification&rft.jtitle=The+Journal+of+supercomputing&rft.au=Toprak%2C+Ahmet&rft.au=Turan%2C+Metin&rft.date=2023-11-01&rft.pub=Springer+Nature+B.V&rft.issn=0920-8542&rft.eissn=1573-0484&rft.volume=79&rft.issue=17&rft.spage=19431&rft.epage=19451&rft_id=info:doi/10.1007%2Fs11227-023-05371-4
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0920-8542&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0920-8542&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0920-8542&client=summon