MapMarker: Extraction of Postal Addresses and Associated Information for General Web Pages
Address information is essential for people's daily life. People often need to query addresses of unfamiliar location through Web and then use map services to mark down the location for direction purpose. Although both address information and map services are available online, they are not well...
Uloženo v:
| Vydáno v: | 2010 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology Ročník 1; s. 105 - 111 |
|---|---|
| Hlavní autoři: | , |
| Médium: | Konferenční příspěvek |
| Jazyk: | angličtina |
| Vydáno: |
IEEE
01.08.2010
|
| Témata: | |
| ISBN: | 9781424484829, 1424484820 |
| On-line přístup: | Získat plný text |
| Tagy: |
Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
|
| Abstract | Address information is essential for people's daily life. People often need to query addresses of unfamiliar location through Web and then use map services to mark down the location for direction purpose. Although both address information and map services are available online, they are not well combined. Users usually need to copy individual address from a Web site and paste it to another Web site with map services to locate its direction. Such copy and paste operations have to be repeated if multiple addresses are listed on a single page such as public school list or apartment list. Furthermore, associated information with individual address has to be copied and included on each marker for better comprehension. Our research is devoted to automate the above process and make the combination an easier task for users. The main techniques applied here include postal address extraction and associated information extraction. We apply sequence labeling algorithm based on Conditional Random Fields (CRFs) to train models for address extraction. Meanwhile, using the extracted addresses as landmarks, we apply pattern mining to identify the boundaries of address blocks and extract associated information with each individual address. The experimental result shows high F-score at 91% for postal address extraction and 87% accuracy for associated information extraction. |
|---|---|
| AbstractList | Address information is essential for people's daily life. People often need to query addresses of unfamiliar location through Web and then use map services to mark down the location for direction purpose. Although both address information and map services are available online, they are not well combined. Users usually need to copy individual address from a Web site and paste it to another Web site with map services to locate its direction. Such copy and paste operations have to be repeated if multiple addresses are listed on a single page such as public school list or apartment list. Furthermore, associated information with individual address has to be copied and included on each marker for better comprehension. Our research is devoted to automate the above process and make the combination an easier task for users. The main techniques applied here include postal address extraction and associated information extraction. We apply sequence labeling algorithm based on Conditional Random Fields (CRFs) to train models for address extraction. Meanwhile, using the extracted addresses as landmarks, we apply pattern mining to identify the boundaries of address blocks and extract associated information with each individual address. The experimental result shows high F-score at 91% for postal address extraction and 87% accuracy for associated information extraction. |
| Author | Chang, Chia-Hui Li, Shu-Ying |
| Author_xml | – sequence: 1 givenname: Chia-Hui surname: Chang fullname: Chang, Chia-Hui email: chia@csie.ncu.edu.tw organization: Dept. of Comput. Sci. & Inf. Eng., Nat. Central Univ., Jhongli, Taiwan – sequence: 2 givenname: Shu-Ying surname: Li fullname: Li, Shu-Ying email: 965202104@cc.ncu.edu.tw organization: Dept. of Comput. Sci. & Inf. Eng., Nat. Central Univ., Jhongli, Taiwan |
| BookMark | eNotTj1PwzAUNAIkoGRlYfEfSLGdZ8dmi6pSIrWiQ1ElluolfkGBNqnsDPDvGxWm-9Dd6e7YVdd3xNiDFFMphXvalmlZbKZKjIaBC5a43IrcOA3SSbg8awkKwIJV7oYlMX4JIaRUArS9ZR8rPK4wfFN45vOfIWA9tH3H-4av-zjgnhfeB4qRIsfO8yLGvm5xIM_LrunDAc_xkfEFdRTGwpYqvsZPivfsusF9pOQfJ-z9Zb6ZvabLt0U5K5YpSiOGFDCnWhsNQONx2whFKiPQzvtae6OpsRXVLgdpHSHkonJVI9GAN0oj2WzCHv92WyLaHUN7wPC700YaJVx2AvUyVXE |
| ContentType | Conference Proceeding |
| DBID | 6IE 6IL CBEJK RIE RIL |
| DOI | 10.1109/WI-IAT.2010.64 |
| DatabaseName | IEEE Electronic Library (IEL) Conference Proceedings IEEE Xplore POP ALL IEEE Xplore All Conference Proceedings IEEE Xplore IEEE Proceedings Order Plans (POP All) 1998-Present |
| DatabaseTitleList | |
| Database_xml | – sequence: 1 dbid: RIE name: IEEE Electronic Library (IEL) url: https://ieeexplore.ieee.org/ sourceTypes: Publisher |
| DeliveryMethod | fulltext_linktorsrc |
| Discipline | Computer Science |
| EISBN | 9780769541914 0769541917 |
| EndPage | 111 |
| ExternalDocumentID | 5616209 |
| Genre | orig-research |
| GroupedDBID | 6IE 6IL ACM ALMA_UNASSIGNED_HOLDINGS APO CBEJK GUFHI LHSKQ RIB RIC RIE RIL |
| ID | FETCH-LOGICAL-a160t-4a7ec56544e0768f02e23e459ddc5d65ef8bec974189ea470b9bf1a64d625ae83 |
| IEDL.DBID | RIE |
| ISBN | 9781424484829 1424484820 |
| IngestDate | Wed Sep 03 07:11:07 EDT 2025 |
| IsPeerReviewed | false |
| IsScholarly | false |
| Language | English |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-LOGICAL-a160t-4a7ec56544e0768f02e23e459ddc5d65ef8bec974189ea470b9bf1a64d625ae83 |
| PageCount | 7 |
| ParticipantIDs | ieee_primary_5616209 |
| PublicationCentury | 2000 |
| PublicationDate | 2010-Aug. |
| PublicationDateYYYYMMDD | 2010-08-01 |
| PublicationDate_xml | – month: 08 year: 2010 text: 2010-Aug. |
| PublicationDecade | 2010 |
| PublicationTitle | 2010 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology |
| PublicationTitleAbbrev | wi-iat |
| PublicationYear | 2010 |
| Publisher | IEEE |
| Publisher_xml | – name: IEEE |
| SSID | ssj0001120458 ssj0000452489 |
| Score | 1.5417923 |
| Snippet | Address information is essential for people's daily life. People often need to query addresses of unfamiliar location through Web and then use map services to... |
| SourceID | ieee |
| SourceType | Publisher |
| StartPage | 105 |
| SubjectTerms | address extraction associated information extraction Conditional random fields Data mining Feature extraction Hidden Markov models Information retrieval Labeling Random variables Urban areas Web pages Web sites |
| Title | MapMarker: Extraction of Postal Addresses and Associated Information for General Web Pages |
| URI | https://ieeexplore.ieee.org/document/5616209 |
| Volume | 1 |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| link | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlZ09T8MwEIatUjEwFWgR3_LAiKmTOrHNVqFWdKDqUNSKpXLis4SE0ipNET8f23EKAwubE2WIfEre5O7e5xC6SzJDM64GRKRSEgacEilUTrgj4ioreUZoP2yCT6diuZSzFrrfe2EAwDefwYNb-lq-Xuc7lyrrW61PY-fWO-Cc116tfT7FocFZIKf7_ErkQOui8XIJZqWuQTyFYxkgjhGV_cWETIbzutXL0Qd-jVrxSjPu_O8ej1Hvx7KHZ3sxOkEtKE5Rp5nZgMMj3EVvL2rj_DlQPuLRV1XWvga8NtiN7VUfeKi154lvsSo0bqIHGgfjkr_crnAAVuMFZHhm30rbHnodj-ZPzyTMVyAqSmlFmOKQ2w86xsDV4wyNIR4AS6TWeaLTBIywEZaObyNBMU4zmZlIpUzbnyYFYnCG2sW6gHOEhVHGaKpSyA0TKlI0Z5IqLbMUqJHxBeq6rVptaoTGKuzS5d-nr9BRXaR3fXbXqF2VO7hBh_ln9b4tb33cvwHhfKnP |
| linkProvider | IEEE |
| linkToHtml | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlZ3PT8IwFMcboiZ6QgXjb3vwaKUb3dZ6IwYCEQgHDMQL6dbXxMQMMobxz7ftCnrw4q1bdljabm97730_X4Tuo1TTNJFtwmMhCIOEEsFlRhJLxJUm5GmunNlEMh7z-VxMauhhp4UBANd8Bo926Gr5apltbKqsZWJ9HFq13n7EWBhUaq1dRsXCwZlnp7sMS2BR63yr5uLMBLst5MkfC49xDKhozQZk0JlWzV6WP_DLbMXFml79f3d5jJo_oj082YWjE1SD_BTVt64N2D_EDfQ2kiur0IHiCXe_yqJSNuClxta4V37gjlKOKL7GMld4u36gsJcuucvNCHtkNZ5BiifmvbRuotded_rcJ95hgcggpiVhMoHMfNIxBrYip2kIYRtYJJTKIhVHoLlZY2EJNwIkS2gqUh3ImCnz2ySBt8_QXr7M4RxhrqXWisoYMs24DCTNmKBSiTQGqkV4gRp2qharCqKx8LN0-ffpO3TYn46Gi-Fg_HKFjqqSve26u0Z7ZbGBG3SQfZbv6-LW7YFv6JOtFg |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2010+IEEE%2FWIC%2FACM+International+Conference+on+Web+Intelligence+and+Intelligent+Agent+Technology&rft.atitle=MapMarker%3A+Extraction+of+Postal+Addresses+and+Associated+Information+for+General+Web+Pages&rft.au=Chang%2C+Chia-Hui&rft.au=Li%2C+Shu-Ying&rft.date=2010-08-01&rft.pub=IEEE&rft.isbn=9781424484829&rft.volume=1&rft.spage=105&rft.epage=111&rft_id=info:doi/10.1109%2FWI-IAT.2010.64&rft.externalDocID=5616209 |
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9781424484829/lc.gif&client=summon&freeimage=true |
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9781424484829/mc.gif&client=summon&freeimage=true |
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9781424484829/sc.gif&client=summon&freeimage=true |

