MapMarker: Extraction of Postal Addresses and Associated Information for General Web Pages

Address information is essential for people's daily life. People often need to query addresses of unfamiliar location through Web and then use map services to mark down the location for direction purpose. Although both address information and map services are available online, they are not well...

Celý popis

Uloženo v:
Podrobná bibliografie
Vydáno v:2010 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology Ročník 1; s. 105 - 111
Hlavní autoři: Chang, Chia-Hui, Li, Shu-Ying
Médium: Konferenční příspěvek
Jazyk:angličtina
Vydáno: IEEE 01.08.2010
Témata:
ISBN:9781424484829, 1424484820
On-line přístup:Získat plný text
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Abstract Address information is essential for people's daily life. People often need to query addresses of unfamiliar location through Web and then use map services to mark down the location for direction purpose. Although both address information and map services are available online, they are not well combined. Users usually need to copy individual address from a Web site and paste it to another Web site with map services to locate its direction. Such copy and paste operations have to be repeated if multiple addresses are listed on a single page such as public school list or apartment list. Furthermore, associated information with individual address has to be copied and included on each marker for better comprehension. Our research is devoted to automate the above process and make the combination an easier task for users. The main techniques applied here include postal address extraction and associated information extraction. We apply sequence labeling algorithm based on Conditional Random Fields (CRFs) to train models for address extraction. Meanwhile, using the extracted addresses as landmarks, we apply pattern mining to identify the boundaries of address blocks and extract associated information with each individual address. The experimental result shows high F-score at 91% for postal address extraction and 87% accuracy for associated information extraction.
AbstractList Address information is essential for people's daily life. People often need to query addresses of unfamiliar location through Web and then use map services to mark down the location for direction purpose. Although both address information and map services are available online, they are not well combined. Users usually need to copy individual address from a Web site and paste it to another Web site with map services to locate its direction. Such copy and paste operations have to be repeated if multiple addresses are listed on a single page such as public school list or apartment list. Furthermore, associated information with individual address has to be copied and included on each marker for better comprehension. Our research is devoted to automate the above process and make the combination an easier task for users. The main techniques applied here include postal address extraction and associated information extraction. We apply sequence labeling algorithm based on Conditional Random Fields (CRFs) to train models for address extraction. Meanwhile, using the extracted addresses as landmarks, we apply pattern mining to identify the boundaries of address blocks and extract associated information with each individual address. The experimental result shows high F-score at 91% for postal address extraction and 87% accuracy for associated information extraction.
Author Chang, Chia-Hui
Li, Shu-Ying
Author_xml – sequence: 1
  givenname: Chia-Hui
  surname: Chang
  fullname: Chang, Chia-Hui
  email: chia@csie.ncu.edu.tw
  organization: Dept. of Comput. Sci. & Inf. Eng., Nat. Central Univ., Jhongli, Taiwan
– sequence: 2
  givenname: Shu-Ying
  surname: Li
  fullname: Li, Shu-Ying
  email: 965202104@cc.ncu.edu.tw
  organization: Dept. of Comput. Sci. & Inf. Eng., Nat. Central Univ., Jhongli, Taiwan
BookMark eNotTj1PwzAUNAIkoGRlYfEfSLGdZ8dmi6pSIrWiQ1ElluolfkGBNqnsDPDvGxWm-9Dd6e7YVdd3xNiDFFMphXvalmlZbKZKjIaBC5a43IrcOA3SSbg8awkKwIJV7oYlMX4JIaRUArS9ZR8rPK4wfFN45vOfIWA9tH3H-4av-zjgnhfeB4qRIsfO8yLGvm5xIM_LrunDAc_xkfEFdRTGwpYqvsZPivfsusF9pOQfJ-z9Zb6ZvabLt0U5K5YpSiOGFDCnWhsNQONx2whFKiPQzvtae6OpsRXVLgdpHSHkonJVI9GAN0oj2WzCHv92WyLaHUN7wPC700YaJVx2AvUyVXE
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1109/WI-IAT.2010.64
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Xplore POP ALL
IEEE Xplore All Conference Proceedings
IEEE Xplore
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Computer Science
EISBN 9780769541914
0769541917
EndPage 111
ExternalDocumentID 5616209
Genre orig-research
GroupedDBID 6IE
6IL
ACM
ALMA_UNASSIGNED_HOLDINGS
APO
CBEJK
GUFHI
LHSKQ
RIB
RIC
RIE
RIL
ID FETCH-LOGICAL-a160t-4a7ec56544e0768f02e23e459ddc5d65ef8bec974189ea470b9bf1a64d625ae83
IEDL.DBID RIE
ISBN 9781424484829
1424484820
IngestDate Wed Sep 03 07:11:07 EDT 2025
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-a160t-4a7ec56544e0768f02e23e459ddc5d65ef8bec974189ea470b9bf1a64d625ae83
PageCount 7
ParticipantIDs ieee_primary_5616209
PublicationCentury 2000
PublicationDate 2010-Aug.
PublicationDateYYYYMMDD 2010-08-01
PublicationDate_xml – month: 08
  year: 2010
  text: 2010-Aug.
PublicationDecade 2010
PublicationTitle 2010 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology
PublicationTitleAbbrev wi-iat
PublicationYear 2010
Publisher IEEE
Publisher_xml – name: IEEE
SSID ssj0001120458
ssj0000452489
Score 1.5417923
Snippet Address information is essential for people's daily life. People often need to query addresses of unfamiliar location through Web and then use map services to...
SourceID ieee
SourceType Publisher
StartPage 105
SubjectTerms address extraction
associated information extraction
Conditional random fields
Data mining
Feature extraction
Hidden Markov models
Information retrieval
Labeling
Random variables
Urban areas
Web pages
Web sites
Title MapMarker: Extraction of Postal Addresses and Associated Information for General Web Pages
URI https://ieeexplore.ieee.org/document/5616209
Volume 1
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlZ09T8MwEIatUjEwFWgR3_LAiKmTOrHNVqFWdKDqUNSKpXLis4SE0ipNET8f23EKAwubE2WIfEre5O7e5xC6SzJDM64GRKRSEgacEilUTrgj4ioreUZoP2yCT6diuZSzFrrfe2EAwDefwYNb-lq-Xuc7lyrrW61PY-fWO-Cc116tfT7FocFZIKf7_ErkQOui8XIJZqWuQTyFYxkgjhGV_cWETIbzutXL0Qd-jVrxSjPu_O8ej1Hvx7KHZ3sxOkEtKE5Rp5nZgMMj3EVvL2rj_DlQPuLRV1XWvga8NtiN7VUfeKi154lvsSo0bqIHGgfjkr_crnAAVuMFZHhm30rbHnodj-ZPzyTMVyAqSmlFmOKQ2w86xsDV4wyNIR4AS6TWeaLTBIywEZaObyNBMU4zmZlIpUzbnyYFYnCG2sW6gHOEhVHGaKpSyA0TKlI0Z5IqLbMUqJHxBeq6rVptaoTGKuzS5d-nr9BRXaR3fXbXqF2VO7hBh_ln9b4tb33cvwHhfKnP
linkProvider IEEE
linkToHtml http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlZ3PT8IwFMcboiZ6QgXjb3vwaKUb3dZ6IwYCEQgHDMQL6dbXxMQMMobxz7ftCnrw4q1bdljabm97730_X4Tuo1TTNJFtwmMhCIOEEsFlRhJLxJUm5GmunNlEMh7z-VxMauhhp4UBANd8Bo926Gr5apltbKqsZWJ9HFq13n7EWBhUaq1dRsXCwZlnp7sMS2BR63yr5uLMBLst5MkfC49xDKhozQZk0JlWzV6WP_DLbMXFml79f3d5jJo_oj082YWjE1SD_BTVt64N2D_EDfQ2kiur0IHiCXe_yqJSNuClxta4V37gjlKOKL7GMld4u36gsJcuucvNCHtkNZ5BiifmvbRuotded_rcJ95hgcggpiVhMoHMfNIxBrYip2kIYRtYJJTKIhVHoLlZY2EJNwIkS2gqUh3ImCnz2ySBt8_QXr7M4RxhrqXWisoYMs24DCTNmKBSiTQGqkV4gRp2qharCqKx8LN0-ffpO3TYn46Gi-Fg_HKFjqqSve26u0Z7ZbGBG3SQfZbv6-LW7YFv6JOtFg
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2010+IEEE%2FWIC%2FACM+International+Conference+on+Web+Intelligence+and+Intelligent+Agent+Technology&rft.atitle=MapMarker%3A+Extraction+of+Postal+Addresses+and+Associated+Information+for+General+Web+Pages&rft.au=Chang%2C+Chia-Hui&rft.au=Li%2C+Shu-Ying&rft.date=2010-08-01&rft.pub=IEEE&rft.isbn=9781424484829&rft.volume=1&rft.spage=105&rft.epage=111&rft_id=info:doi/10.1109%2FWI-IAT.2010.64&rft.externalDocID=5616209
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9781424484829/lc.gif&client=summon&freeimage=true
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9781424484829/mc.gif&client=summon&freeimage=true
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9781424484829/sc.gif&client=summon&freeimage=true