3SEPIAS: A Semi-Structured Search Engine for Personal Information in dAtaspace System
Nowadays, personal information is being distributed into more and more heterogeneous sources, which presents a huge obstacle to management and retrieval of personal information. To address this problem, this paper presents the blueprint of a novel Personal Information Management (PIM) system named 3...
Saved in:
| Published in: | Information sciences Vol. 218; pp. 31 - 50 |
|---|---|
| Main Authors: | , , |
| Format: | Journal Article |
| Language: | English |
| Published: |
Elsevier Inc
01.01.2013
|
| Subjects: | |
| ISSN: | 0020-0255, 1872-6291 |
| Online Access: | Get full text |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| Abstract | Nowadays, personal information is being distributed into more and more heterogeneous sources, which presents a huge obstacle to management and retrieval of personal information. To address this problem, this paper presents the blueprint of a novel Personal Information Management (PIM) system named 3SEPIAS (short for Semi-Structured Search Engine for Personal Information in dAtaspace System).
3SEPIAS has three main features, data integration without upfront semantic reconciliation, flexible query model for data having sparse and evolving schema, and efficient best-effort proximity search approach on graphs. For that, we first propose a semi-structured graph data model called Interpreted Object Model (IOM) to uniformly represents a user’s heterogeneous personal information and loosely integrates it into a dataspace in a schema-later way. Then, a Semi-Structured Search Engine (3SE) can be used to search over the personal dataspaces. We propose an intuitive 3SE Query Language (3SQL) that enables users to query in a varying degree of structural constraint according to their knowledge of underlying schemas. Moreover, a best-effort top-k proximity search optimization strategy and corresponding graph index structures are proposed to improve the efficiency of query processing.
We perform comprehensive experiments to test both effectiveness and efficiency of our proximity search approach. The results reveal that 3SE can beat the previous proximity search systems by a large margin with only a little or even no loss of result quality, especially for large graphs. |
|---|---|
| AbstractList | Nowadays, personal information is being distributed into more and more heterogeneous sources, which presents a huge obstacle to management and retrieval of personal information. To address this problem, this paper presents the blueprint of a novel Personal Information Management (PIM) system named 3SEPIAS (short for Semi-Structured Search Engine for Personal Information in dAtaspace System).
3SEPIAS has three main features, data integration without upfront semantic reconciliation, flexible query model for data having sparse and evolving schema, and efficient best-effort proximity search approach on graphs. For that, we first propose a semi-structured graph data model called Interpreted Object Model (IOM) to uniformly represents a user’s heterogeneous personal information and loosely integrates it into a dataspace in a schema-later way. Then, a Semi-Structured Search Engine (3SE) can be used to search over the personal dataspaces. We propose an intuitive 3SE Query Language (3SQL) that enables users to query in a varying degree of structural constraint according to their knowledge of underlying schemas. Moreover, a best-effort top-k proximity search optimization strategy and corresponding graph index structures are proposed to improve the efficiency of query processing.
We perform comprehensive experiments to test both effectiveness and efficiency of our proximity search approach. The results reveal that 3SE can beat the previous proximity search systems by a large margin with only a little or even no loss of result quality, especially for large graphs. |
| Author | He, Yanxiang Liu, Mengchi Zhong, Ming |
| Author_xml | – sequence: 1 givenname: Ming surname: Zhong fullname: Zhong, Ming email: mike.clark.whu@gmail.com organization: State Key Laboratory of Software Engineering, Wuhan University, Luojiashan, Wuhan 430072, China – sequence: 2 givenname: Mengchi surname: Liu fullname: Liu, Mengchi email: mengchi@scs.carleton.ca organization: School of Computer Science, Carleton University, 1125 Colonel By Drive, Ottawa, Canada K1S 5B6 – sequence: 3 givenname: Yanxiang surname: He fullname: He, Yanxiang email: yxhe@whu.edu.cn organization: State Key Laboratory of Software Engineering, Wuhan University, Luojiashan, Wuhan 430072, China |
| BookMark | eNp9kEFLwzAcxYNMcJt-AG_5Aq3_pE3a6qmMqYOBg7pzSNNEM9Z0JJmwb2-nnjzs9HiH34P3m6GJG5xG6J5ASoDwh11qXUgpEJoCT4FkV2hKyoImnFZkgqYAFBKgjN2gWQg7AMgLzqdomzXLzapuHnGNG93bpIn-qOLR627s0qtPvHQf1mlsBo832ofByT1eubH2MtrBYetwV0cZDlJp3JxC1P0tujZyH_TdX87R9nn5vnhN1m8vq0W9ThStipgwKDVpW1p0UDFZ8Vy2JjeQU9YSY1hWyEplGYFClpUkhWKVIqyrDCuNUly22RyR313lhxC8NuLgbS_9SRAQZy9iJ0Yv4uxFABejl5Ep_jHKxp8n0Uu7v0g-_ZJ6vPRltRdBWe2U7qzXKopusBfob51bfvQ |
| CitedBy_id | crossref_primary_10_1016_j_jksuci_2014_03_017 crossref_primary_10_1145_3003665_3003672 crossref_primary_10_1007_s41870_023_01518_x crossref_primary_10_1109_TKDE_2014_2310207 |
| Cites_doi | 10.1145/1142351.1142352 10.1016/B978-155860869-6/50065-2 10.1145/641043.641053 10.1109/ICDE.2006.67 10.1145/320719.322582 10.1145/383952.383985 10.1016/j.ins.2009.06.025 10.1145/1247480.1247487 10.1145/1376616.1376701 10.1109/IRI.2008.4583065 10.1016/B978-012722442-8/50013-6 10.1145/1645953.1646131 10.1145/348751.348758 10.1145/371920.372057 10.1016/j.ins.2011.12.011 10.1109/IDEAS.2000.880562 10.1145/1107499.1107502 10.1145/1620432.1620453 10.1145/1066157.1066217 10.1145/872760.872761 10.1145/275487.275488 10.1145/971699.318923 10.1016/B978-012722442-8/50080-X 10.1145/1007568.1007656 10.1016/B978-012088469-8.50010-3 10.1109/DEXA.2009.23 10.1145/860450.860451 10.1145/381854.381893 10.1145/1099554.1099559 10.1145/1247480.1247516 10.1145/1066157.1066168 10.1145/988672.988751 10.1145/1376616.1376702 10.1145/121133.121138 10.1109/ICDE.2002.994756 10.1145/564691.564782 10.1145/872760.872762 |
| ContentType | Journal Article |
| Copyright | 2012 Elsevier Inc. |
| Copyright_xml | – notice: 2012 Elsevier Inc. |
| DBID | AAYXX CITATION |
| DOI | 10.1016/j.ins.2012.06.013 |
| DatabaseName | CrossRef |
| DatabaseTitle | CrossRef |
| DatabaseTitleList | |
| DeliveryMethod | fulltext_linktorsrc |
| Discipline | Engineering Library & Information Science |
| EISSN | 1872-6291 |
| EndPage | 50 |
| ExternalDocumentID | 10_1016_j_ins_2012_06_013 S0020025512004082 |
| GroupedDBID | --K --M --Z -~X .DC .~1 0R~ 1B1 1RT 1~. 1~5 4.4 457 4G. 5GY 5VS 7-5 71M 8P~ 9JN 9JO AAAKF AABNK AACTN AAEDT AAEDW AAIAV AAIKJ AAKOC AALRI AAOAW AAQFI AARIN AAXUO AAYFN ABAOU ABBOA ABFNM ABJNI ABMAC ABUCO ABXDB ABYKQ ACAZW ACDAQ ACGFS ACRLP ACZNC ADBBV ADEZE ADGUI ADTZH AEBSH AECPX AEKER AENEX AFKWA AFTJW AGHFR AGUBO AGYEJ AHHHB AHJVU AHZHX AIALX AIEXJ AIGVJ AIKHN AITUG AJBFU AJOXV ALMA_UNASSIGNED_HOLDINGS AMFUW AMRAJ AOUOD APLSM ARUGR AXJTR BJAXD BKOJK BLXMC CS3 DU5 EBS EFJIC EFLBG EO8 EO9 EP2 EP3 F5P FDB FIRID FNPLU FYGXN G-Q GBLVA GBOLZ HAMUX IHE J1W JJJVA KOM LG9 LY1 M41 MHUIS MO0 MS~ N9A O-L O9- OAUVE OZT P-8 P-9 P2P PC. Q38 RIG ROL RPZ SDF SDG SDP SES SPC SPCBC SSB SSD SST SSV SSW SSZ T5K TN5 TWZ WH7 XPP ZMT ~02 ~G- 1OL 29I 77I 9DU AAAKG AAQXK AATTM AAXKI AAYWO AAYXX ABEFU ABWVN ACLOT ACNNM ACRPL ACVFH ADCNI ADJOM ADMUD ADNMO ADVLN AEIPS AEUPX AFFNX AFJKZ AFPUW AGQPQ AIGII AIIUN AKBMS AKRWK AKYEP ANKPU APXCP ASPBG AVWKF AZFZN CITATION EFKBS EJD FEDTE FGOYB HLZ HVGLF HZ~ H~9 R2- SBC SDS SEW UHS WUQ YYP ZY4 ~HD |
| ID | FETCH-LOGICAL-c297t-508e1bb27d095a964abf4f0425b1ff537a9c33107a89a17c59c15d9f58fcc6ab3 |
| ISICitedReferencesCount | 3 |
| ISICitedReferencesURI | http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000311194900003&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| ISSN | 0020-0255 |
| IngestDate | Sat Nov 29 08:03:26 EST 2025 Tue Nov 18 21:55:37 EST 2025 Fri Feb 23 02:23:14 EST 2024 |
| IsPeerReviewed | true |
| IsScholarly | true |
| Keywords | Query processing and optimization Personal information management Graph data model Dataspace Semi-structured query |
| Language | English |
| License | https://www.elsevier.com/tdm/userlicense/1.0 |
| LinkModel | OpenURL |
| MergedId | FETCHMERGED-LOGICAL-c297t-508e1bb27d095a964abf4f0425b1ff537a9c33107a89a17c59c15d9f58fcc6ab3 |
| PageCount | 20 |
| ParticipantIDs | crossref_primary_10_1016_j_ins_2012_06_013 crossref_citationtrail_10_1016_j_ins_2012_06_013 elsevier_sciencedirect_doi_10_1016_j_ins_2012_06_013 |
| PublicationCentury | 2000 |
| PublicationDate | 2013-01-01 2013-1-00 |
| PublicationDateYYYYMMDD | 2013-01-01 |
| PublicationDate_xml | – month: 01 year: 2013 text: 2013-01-01 day: 01 |
| PublicationDecade | 2010 |
| PublicationTitle | Information sciences |
| PublicationYear | 2013 |
| Publisher | Elsevier Inc |
| Publisher_xml | – name: Elsevier Inc |
| References | L. Guo, F. Shao, C. Botev, J. Shanmugasundaram, XRANK: ranked keyword search over XML documents, in: Proceedings of ACM SIGMOD International Conference on Management of Data, ACM, 2003, pp. 16–27. Li, Li, Feng, Zhou (b0190) 2009; 179 V. Kacholia, S. Pandit, S. Chakrabarti, S. Sudarshan, R. Desai, H. Karambelkar, Bidirectional expansion for keyword search on graph databases, in: Proceedings of ACM SIGMOD International Conference on Management of Data, ACM, 2005, pp. 505–516. A.D. Sarma, X. Dong, A. Halevy, Bootstrapping pay-as-you-go data integration systems, in: Proceedings of ACM SIGMOD International Conference on Management of Data, ACM, 2008, pp. 861–874. M. Zhong, M. Liu, A flexible data warehousing approach for one-stop querying on heterogeneous personal information, in: Proceedings of Workshops of International Conference on Database and Expert Systems Applications, IEEE Computer Society, 2009, pp. 412–416. H. Bast, I. Weber, The completesearch engine: interactive, efficient, and towards IR&DB integration, in: Proceedings of Conference on Innovative Data Systems Research, 2007, pp. 88–95. Available from A. Halevy, M. Franklin, D. Maier, Principles of dataspace systems, in: Proceedings of ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems, ACM, 2006, pp. 1–9. M. Zhong, M. Liu, 3SE: a semi-structured search engine for heterogeneous data in graph model, in: Proceedings of ACM Conference on Information and Knowledge Management, ACM, 2009, pp. 1405–1408. J. Rekimoto, Time-machine computing: a time-centric approach for the information environment, in: Proceedings of ACM Symposium on User Interface Software and Technology, ACM, 1999, pp. 45–54. Y. Xu, Y. Papakonstantinou, Efficient keyword search for smallest LCAs in XML databases, in: Proceedings of ACM SIGMOD International Conference on Management of Data, ACM, 2005, pp. 527–538. V. Hristidis, L. Gravano, Y. Papakonstantinou, Efficient IR-style keyword search over relational databases, in: Proceedings of International Conference on Very Large Data Bases, Morgan Kaufmann, 2003, pp. 850–861. Lou, Li, Chen (b0205) 2012; 190 J.L. Beckmann, A. Halverson, R. Krishnamurthy, J.F. Naughton, Extending rdbmss to support sparse datasets using an interpreted attribute storage format, in: Proceedings of IEEE International Conference on Data Engineering, IEEE Computer Society, 2006, p. 58. Franklin, Halevy, Maier (b0105) 2005; 34 J.-P. Dittrich, M.A.V. Salles, D. Kossmann, L. Blunschi, iMeMex: escapes from the personal information jungle (demo paper), in: Proceedings of International Conference on Very Large Data Bases, ACM, 2005, pp. 1306–1309. X. Dong, A. Halevy, E. Nemes, S. Sigundsson, P. Domingos, SEMEX: toward on-the-fly personal information integration, in: Proceedings of International Workshop on Information Integration on the Web, 2004. R. Fagin, Combining fuzzy information from multiple systems, in: Proceedings of ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems, ACM, 1998, pp. 1–10. D.J. Abadi, A. Marcus, S.R. Madden, K. Hollenbach, Scalable semantic web data management using vertical partitioning, in: Proceedings of International Conference on Very Large Data Bases, ACM, 2007, pp. 411–422. S. Cohen, J. Mamou, Y. Kanza, Y. Sagiv, XSEarch: a semantic search engine for XML, in: Proceedings of International Conference on Very Large Data Bases, Morgan Kaufmann, 2003, pp. 45–56. J.-P. Dittrich, iMeMex: a platform for personal dataspace management, in: Proceedings of Workshops of International ACM SIGIR Conference on Research and Development in Information Retrieval, ACM, 2006, pp. 40–43. X. Dong, A. Halevy, J. Madhavan, Reference reconciliation in complex information spaces, in: Proceedings of ACM SIGMOD International Conference on Management of Data, ACM, 2005, pp. 85–96. Dourish, Edwards, LaMarca, Lamping, Petersen, Salisbury, Terry, Thornton (b0090) 2000; 18 J. Kamps, M. Marx, M. de Rijke, B. Sigurbjörnsson, Structured queries in XML retrieval, in: Proceedings of ACM Conference on Information and Knowledge Management, ACM, 2005, pp. 4–11. . J. Graupmann, R. Schenkel, G. Weikum, The spheresearch engine for unified ranked retrieval of heterogeneous XML and web documents, in: Proceedings of International Conference on Very Large Data Bases, ACM, 2005, pp. 529–540. M.A.V. Salles, J.-P. Dittrich, S.K. Karakashian, O.R. Girard, L. Blunschi, iTrails: pay-as-you-go information integration in dataspaces, in: Proceedings of International Conference on Very Large Data Bases, ACM, 2007, pp. 663–674. S. Al-Khalifa, C. Yu, H.V. Jagadish, Querying structured text in an XML database, in: Proceedings of ACM SIGMOD International Conference on Management of Data, ACM Press, 2003, pp. 4–15. G. Bhalotia, A. Hulgeriy, C. Nakhez, S. Chakrabarti, S. Sudarshan, Keyword searching and browsing in databases using banks, in: Proceedings of IEEE International Conference on Data Engineering, IEEE Computer Society, 2002, pp. 431–440. N. Fuhr, K. Großjohann, XIRQL: a query language for information retrieval in XML documents, in: Proceedings of International ACM SIGIR Conference on Research and Development in Information Retrieval, ACM, 2001, pp. 172–180. R. Kaushik, R. Krishnamurthy, J.F. Naughton, R. Ramakrishnan, On the integration of structure indexes and inverted lists, in: Proceedings of ACM SIGMOD International Conference on Management of Data, ACM, 2004, pp. 779–790. M. Zhong, M. Liu, Q. Chen, Modeling heterogeneous data in dataspace, in: Proceedings of IEEE International Conference on Information Reuse and Integration, IEEE, 2008, pp. 404–409. X. Dong, A. Halevy, Indexing dataspaces, in: Proceedings of ACM SIGMOD International Conference on Management of Data, ACM, 2007, pp. 43–54. V. Hristidis, Y. Papakonstantinou, Discover: keyword search in relational databases, in: Proceedings of International Conference on Very Large Data Bases, Morgan Kaufmann, 2002, pp. 670–681. E. Chu, J.L. Beckmann, J. Naughton, Extending rdbmss to support sparse datasets using an interpreted attribute storage format, in: Proceedings of ACM SIGMOD International Conference on Management of Data, ACM, 2007, pp. 821–832. U. Masermann, G. Vossen, Sisql: schema-independent database querying (on and off the web), in: Proceedings of International Database Engineering & Applications Symposium, IEEE Computer Society, 2000, pp. 55–64. E. Chu, A. Baid, T. Chen, A. Doan, J. Naughton, A relational approach to incrementally extracting and querying structure in unstructured data, in: Proceedings of International Conference on Very Large Data Bases, ACM, 2007, pp. 1045–1056. D.R. Karger, K. Bakshi, D. Huynh, D. Quan, V. Sinha, Haystack: a customizable general-purpose information management tool for end users of semistructured data, in: Proceedings of Conference on Innovative Data Systems Research, 2005, pp. 13–26. Available from S. Dumais, E. Cutrell, J. Cadiz, G. Jancke, R. Sarin, D.C. Robbins, Stuff i’ve seen: a system for personal information retrieval and re-use, in: Proceedings of International ACM SIGIR Conference on Research and Development in Information Retrieval, ACM, 2003, pp. 72–79. W. Jones, H. Bruce, A report on the NSF-sponsored workshop on personal information management, in: Proceedings of NSF-Sponsored Workshop on Personal Information Management, Seattle, January 2005. G.P. Copeland, S. Khoshafian, A decomposition storage model, in: Proceedings of ACM SIGMOD International Conference on Management of Data, ACM, 1985, pp. 268–279. W.-S. Li, K. Candan, Q. Vu, D. Agrawal, Retrieving and organizing web pages by “information unit”, in: Proceedings of International World Wide Web Conference, ACM, 2001, pp. 230–244. M. Zhong, M. Liu, Efficient keyword proximity search using a frontier-reduce strategy based on d-distance graph index, in: Proceedings of International Database Engineering and Applications Symposium, ACM, 2009, pp. 206–216. J. Gemmell, G. Bell, R. Lueder, S. Drucker, C. Wong, MyLifeBits: fulfilling the Memex vision, in: Proceedings of ACM Multimedia, ACM, 2002, pp. 235–238. J.-P. Dittrich, M.A.V. Salles, iDM: a unified and versatile data model for personal dataspace management, in: Proceedings of International Conference on Very Large Data Bases, ACM, 2006, pp. 367–378. D.K. Gifford, P. Jouvelot, M.A. Sheldon, J. James W. O’Toole, Semantic file systems, in: Proceedings of ACM Symposium on Operating Systems Principles, ACM, 1991, pp. 16–25. S.R. Jeffery, M.J. Franklin, A.Y. Halevy, Pay-as-you-go user feedback for dataspace systems, in: Proceedings of ACM SIGMOD International Conference on Management of Data, ACM, 2008, pp. 847–860. Y. Li, C. Yu, H.V. Jagadish, Schema-free xquery, in: Proceedings of International Conference on Very Large Data Bases, Morgan Kaufmann, 2004, pp. 72–83. S. Agrawal, S. Chaudhuri, G. Das, Dbxplorer: a system for keyword-based search over relational databases, in: Proceedings of IEEE International Conference on Data Engineering, IEEE Computer Society, 2002, pp. 5–16. S. Amer-Yahia, C. Botev, J. Shanmugasundaram, TeXQuery: a fulltext search extension to xquery, in: Proceedings of International World Wide Web Conference, ACM, 2004, pp. 583–594. H. He, H. Wang, J. Yang, P.S. Yu, BLINKS: ranked keyword searches on graphs, in: Proceedings of ACM SIGMOD International Conference on Management of Data, ACM, 2007, pp. 305–316. Freeman, Gelernter (b0110) 1996; 25 10.1016/j.ins.2012.06.013_b0225 10.1016/j.ins.2012.06.013_b0035 10.1016/j.ins.2012.06.013_b0155 10.1016/j.ins.2012.06.013_b0230 10.1016/j.ins.2012.06.013_b0075 10.1016/j.ins.2012.06.013_b0030 10.1016/j.ins.2012.06.013_b0195 10.1016/j.ins.2012.06.013_b0150 10.1016/j.ins.2012.06.013_b0070 10.1016/j.ins.2012.06.013_b0215 10.1016/j.ins.2012.06.013_b0015 10.1016/j.ins.2012.06.013_b0025 Lou (10.1016/j.ins.2012.06.013_b0205) 2012; 190 10.1016/j.ins.2012.06.013_b0145 10.1016/j.ins.2012.06.013_b0100 10.1016/j.ins.2012.06.013_b0220 10.1016/j.ins.2012.06.013_b0065 10.1016/j.ins.2012.06.013_b0020 10.1016/j.ins.2012.06.013_b0185 10.1016/j.ins.2012.06.013_b0140 10.1016/j.ins.2012.06.013_b0060 10.1016/j.ins.2012.06.013_b0180 Li (10.1016/j.ins.2012.06.013_b0190) 2009; 179 Freeman (10.1016/j.ins.2012.06.013_b0110) 1996; 25 10.1016/j.ins.2012.06.013_b0005 10.1016/j.ins.2012.06.013_b0125 10.1016/j.ins.2012.06.013_b0135 10.1016/j.ins.2012.06.013_b0210 10.1016/j.ins.2012.06.013_b0055 10.1016/j.ins.2012.06.013_b0010 10.1016/j.ins.2012.06.013_b0175 Franklin (10.1016/j.ins.2012.06.013_b0105) 2005; 34 10.1016/j.ins.2012.06.013_b0130 10.1016/j.ins.2012.06.013_b0250 10.1016/j.ins.2012.06.013_b0095 10.1016/j.ins.2012.06.013_b0050 10.1016/j.ins.2012.06.013_b0170 10.1016/j.ins.2012.06.013_b0115 10.1016/j.ins.2012.06.013_b0235 10.1016/j.ins.2012.06.013_b0245 10.1016/j.ins.2012.06.013_b0200 10.1016/j.ins.2012.06.013_b0045 Dourish (10.1016/j.ins.2012.06.013_b0090) 2000; 18 10.1016/j.ins.2012.06.013_b0165 10.1016/j.ins.2012.06.013_b0120 10.1016/j.ins.2012.06.013_b0240 10.1016/j.ins.2012.06.013_b0085 10.1016/j.ins.2012.06.013_b0040 10.1016/j.ins.2012.06.013_b0160 10.1016/j.ins.2012.06.013_b0080 |
| References_xml | – reference: E. Chu, A. Baid, T. Chen, A. Doan, J. Naughton, A relational approach to incrementally extracting and querying structure in unstructured data, in: Proceedings of International Conference on Very Large Data Bases, ACM, 2007, pp. 1045–1056. – reference: J.-P. Dittrich, M.A.V. Salles, iDM: a unified and versatile data model for personal dataspace management, in: Proceedings of International Conference on Very Large Data Bases, ACM, 2006, pp. 367–378. – reference: J.-P. Dittrich, M.A.V. Salles, D. Kossmann, L. Blunschi, iMeMex: escapes from the personal information jungle (demo paper), in: Proceedings of International Conference on Very Large Data Bases, ACM, 2005, pp. 1306–1309. – reference: J. Kamps, M. Marx, M. de Rijke, B. Sigurbjörnsson, Structured queries in XML retrieval, in: Proceedings of ACM Conference on Information and Knowledge Management, ACM, 2005, pp. 4–11. – reference: S. Cohen, J. Mamou, Y. Kanza, Y. Sagiv, XSEarch: a semantic search engine for XML, in: Proceedings of International Conference on Very Large Data Bases, Morgan Kaufmann, 2003, pp. 45–56. – reference: S.R. Jeffery, M.J. Franklin, A.Y. Halevy, Pay-as-you-go user feedback for dataspace systems, in: Proceedings of ACM SIGMOD International Conference on Management of Data, ACM, 2008, pp. 847–860. – volume: 190 start-page: 127 year: 2012 end-page: 143 ident: b0205 article-title: Semantic relevance ranking for XML keyword search publication-title: Information Sciences – reference: J. Rekimoto, Time-machine computing: a time-centric approach for the information environment, in: Proceedings of ACM Symposium on User Interface Software and Technology, ACM, 1999, pp. 45–54. – reference: V. Kacholia, S. Pandit, S. Chakrabarti, S. Sudarshan, R. Desai, H. Karambelkar, Bidirectional expansion for keyword search on graph databases, in: Proceedings of ACM SIGMOD International Conference on Management of Data, ACM, 2005, pp. 505–516. – volume: 25 start-page: 80 year: 1996 end-page: 86 ident: b0110 article-title: Lifestreams: a storage model for personal data publication-title: SIGMOD Record – reference: H. He, H. Wang, J. Yang, P.S. Yu, BLINKS: ranked keyword searches on graphs, in: Proceedings of ACM SIGMOD International Conference on Management of Data, ACM, 2007, pp. 305–316. – reference: H. Bast, I. Weber, The completesearch engine: interactive, efficient, and towards IR&DB integration, in: Proceedings of Conference on Innovative Data Systems Research, 2007, pp. 88–95. Available from: <: – reference: M. Zhong, M. Liu, A flexible data warehousing approach for one-stop querying on heterogeneous personal information, in: Proceedings of Workshops of International Conference on Database and Expert Systems Applications, IEEE Computer Society, 2009, pp. 412–416. – volume: 179 start-page: 3745 year: 2009 end-page: 3762 ident: b0190 article-title: Sail: structure-aware indexing for effective and progressive top- publication-title: Information Sciences – reference: M. Zhong, M. Liu, Efficient keyword proximity search using a frontier-reduce strategy based on d-distance graph index, in: Proceedings of International Database Engineering and Applications Symposium, ACM, 2009, pp. 206–216. – reference: J. Graupmann, R. Schenkel, G. Weikum, The spheresearch engine for unified ranked retrieval of heterogeneous XML and web documents, in: Proceedings of International Conference on Very Large Data Bases, ACM, 2005, pp. 529–540. – reference: R. Fagin, Combining fuzzy information from multiple systems, in: Proceedings of ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems, ACM, 1998, pp. 1–10. – reference: R. Kaushik, R. Krishnamurthy, J.F. Naughton, R. Ramakrishnan, On the integration of structure indexes and inverted lists, in: Proceedings of ACM SIGMOD International Conference on Management of Data, ACM, 2004, pp. 779–790. – reference: S. Dumais, E. Cutrell, J. Cadiz, G. Jancke, R. Sarin, D.C. Robbins, Stuff i’ve seen: a system for personal information retrieval and re-use, in: Proceedings of International ACM SIGIR Conference on Research and Development in Information Retrieval, ACM, 2003, pp. 72–79. – volume: 34 start-page: 27 year: 2005 end-page: 33 ident: b0105 article-title: From databases to dataspaces: a new abstraction for information management publication-title: SIGMOD Record – reference: M. Zhong, M. Liu, 3SE: a semi-structured search engine for heterogeneous data in graph model, in: Proceedings of ACM Conference on Information and Knowledge Management, ACM, 2009, pp. 1405–1408. – reference: S. Agrawal, S. Chaudhuri, G. Das, Dbxplorer: a system for keyword-based search over relational databases, in: Proceedings of IEEE International Conference on Data Engineering, IEEE Computer Society, 2002, pp. 5–16. – reference: X. Dong, A. Halevy, E. Nemes, S. Sigundsson, P. Domingos, SEMEX: toward on-the-fly personal information integration, in: Proceedings of International Workshop on Information Integration on the Web, 2004. – reference: D.R. Karger, K. Bakshi, D. Huynh, D. Quan, V. Sinha, Haystack: a customizable general-purpose information management tool for end users of semistructured data, in: Proceedings of Conference on Innovative Data Systems Research, 2005, pp. 13–26. Available from: < – reference: J.L. Beckmann, A. Halverson, R. Krishnamurthy, J.F. Naughton, Extending rdbmss to support sparse datasets using an interpreted attribute storage format, in: Proceedings of IEEE International Conference on Data Engineering, IEEE Computer Society, 2006, p. 58. – reference: A.D. Sarma, X. Dong, A. Halevy, Bootstrapping pay-as-you-go data integration systems, in: Proceedings of ACM SIGMOD International Conference on Management of Data, ACM, 2008, pp. 861–874. – volume: 18 start-page: 140 year: 2000 end-page: 170 ident: b0090 article-title: Extending document management systems with user-specific active properties publication-title: ACM Transactions on Information Systems – reference: S. Amer-Yahia, C. Botev, J. Shanmugasundaram, TeXQuery: a fulltext search extension to xquery, in: Proceedings of International World Wide Web Conference, ACM, 2004, pp. 583–594. – reference: S. Al-Khalifa, C. Yu, H.V. Jagadish, Querying structured text in an XML database, in: Proceedings of ACM SIGMOD International Conference on Management of Data, ACM Press, 2003, pp. 4–15. – reference: J.-P. Dittrich, iMeMex: a platform for personal dataspace management, in: Proceedings of Workshops of International ACM SIGIR Conference on Research and Development in Information Retrieval, ACM, 2006, pp. 40–43. – reference: A. Halevy, M. Franklin, D. Maier, Principles of dataspace systems, in: Proceedings of ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems, ACM, 2006, pp. 1–9. – reference: W.-S. Li, K. Candan, Q. Vu, D. Agrawal, Retrieving and organizing web pages by “information unit”, in: Proceedings of International World Wide Web Conference, ACM, 2001, pp. 230–244. – reference: N. Fuhr, K. Großjohann, XIRQL: a query language for information retrieval in XML documents, in: Proceedings of International ACM SIGIR Conference on Research and Development in Information Retrieval, ACM, 2001, pp. 172–180. – reference: J. Gemmell, G. Bell, R. Lueder, S. Drucker, C. Wong, MyLifeBits: fulfilling the Memex vision, in: Proceedings of ACM Multimedia, ACM, 2002, pp. 235–238. – reference: Y. Xu, Y. Papakonstantinou, Efficient keyword search for smallest LCAs in XML databases, in: Proceedings of ACM SIGMOD International Conference on Management of Data, ACM, 2005, pp. 527–538. – reference: D.J. Abadi, A. Marcus, S.R. Madden, K. Hollenbach, Scalable semantic web data management using vertical partitioning, in: Proceedings of International Conference on Very Large Data Bases, ACM, 2007, pp. 411–422. – reference: X. Dong, A. Halevy, Indexing dataspaces, in: Proceedings of ACM SIGMOD International Conference on Management of Data, ACM, 2007, pp. 43–54. – reference: U. Masermann, G. Vossen, Sisql: schema-independent database querying (on and off the web), in: Proceedings of International Database Engineering & Applications Symposium, IEEE Computer Society, 2000, pp. 55–64. – reference: X. Dong, A. Halevy, J. Madhavan, Reference reconciliation in complex information spaces, in: Proceedings of ACM SIGMOD International Conference on Management of Data, ACM, 2005, pp. 85–96. – reference: L. Guo, F. Shao, C. Botev, J. Shanmugasundaram, XRANK: ranked keyword search over XML documents, in: Proceedings of ACM SIGMOD International Conference on Management of Data, ACM, 2003, pp. 16–27. – reference: Y. Li, C. Yu, H.V. Jagadish, Schema-free xquery, in: Proceedings of International Conference on Very Large Data Bases, Morgan Kaufmann, 2004, pp. 72–83. – reference: V. Hristidis, L. Gravano, Y. Papakonstantinou, Efficient IR-style keyword search over relational databases, in: Proceedings of International Conference on Very Large Data Bases, Morgan Kaufmann, 2003, pp. 850–861. – reference: M. Zhong, M. Liu, Q. Chen, Modeling heterogeneous data in dataspace, in: Proceedings of IEEE International Conference on Information Reuse and Integration, IEEE, 2008, pp. 404–409. – reference: G.P. Copeland, S. Khoshafian, A decomposition storage model, in: Proceedings of ACM SIGMOD International Conference on Management of Data, ACM, 1985, pp. 268–279. – reference: >. – reference: E. Chu, J.L. Beckmann, J. Naughton, Extending rdbmss to support sparse datasets using an interpreted attribute storage format, in: Proceedings of ACM SIGMOD International Conference on Management of Data, ACM, 2007, pp. 821–832. – reference: M.A.V. Salles, J.-P. Dittrich, S.K. Karakashian, O.R. Girard, L. Blunschi, iTrails: pay-as-you-go information integration in dataspaces, in: Proceedings of International Conference on Very Large Data Bases, ACM, 2007, pp. 663–674. – reference: G. Bhalotia, A. Hulgeriy, C. Nakhez, S. Chakrabarti, S. Sudarshan, Keyword searching and browsing in databases using banks, in: Proceedings of IEEE International Conference on Data Engineering, IEEE Computer Society, 2002, pp. 431–440. – reference: D.K. Gifford, P. Jouvelot, M.A. Sheldon, J. James W. O’Toole, Semantic file systems, in: Proceedings of ACM Symposium on Operating Systems Principles, ACM, 1991, pp. 16–25. – reference: V. Hristidis, Y. Papakonstantinou, Discover: keyword search in relational databases, in: Proceedings of International Conference on Very Large Data Bases, Morgan Kaufmann, 2002, pp. 670–681. – reference: W. Jones, H. Bruce, A report on the NSF-sponsored workshop on personal information management, in: Proceedings of NSF-Sponsored Workshop on Personal Information Management, Seattle, January 2005. – ident: 10.1016/j.ins.2012.06.013_b0140 doi: 10.1145/1142351.1142352 – ident: 10.1016/j.ins.2012.06.013_b0155 doi: 10.1016/B978-155860869-6/50065-2 – ident: 10.1016/j.ins.2012.06.013_b0065 – ident: 10.1016/j.ins.2012.06.013_b0170 – ident: 10.1016/j.ins.2012.06.013_b0180 – ident: 10.1016/j.ins.2012.06.013_b0120 doi: 10.1145/641043.641053 – ident: 10.1016/j.ins.2012.06.013_b0045 doi: 10.1109/ICDE.2006.67 – ident: 10.1016/j.ins.2012.06.013_b0215 doi: 10.1145/320719.322582 – ident: 10.1016/j.ins.2012.06.013_b0115 doi: 10.1145/383952.383985 – volume: 179 start-page: 3745 issue: 21 year: 2009 ident: 10.1016/j.ins.2012.06.013_b0190 article-title: Sail: structure-aware indexing for effective and progressive top-k keyword search over XML documents publication-title: Information Sciences doi: 10.1016/j.ins.2009.06.025 – ident: 10.1016/j.ins.2012.06.013_b0030 doi: 10.1109/ICDE.2006.67 – ident: 10.1016/j.ins.2012.06.013_b0075 doi: 10.1145/1247480.1247487 – ident: 10.1016/j.ins.2012.06.013_b0160 doi: 10.1145/1376616.1376701 – ident: 10.1016/j.ins.2012.06.013_b0250 doi: 10.1109/IRI.2008.4583065 – ident: 10.1016/j.ins.2012.06.013_b0050 doi: 10.1016/B978-012722442-8/50013-6 – ident: 10.1016/j.ins.2012.06.013_b0235 doi: 10.1145/1645953.1646131 – volume: 18 start-page: 140 issue: 2 year: 2000 ident: 10.1016/j.ins.2012.06.013_b0090 article-title: Extending document management systems with user-specific active properties publication-title: ACM Transactions on Information Systems doi: 10.1145/348751.348758 – ident: 10.1016/j.ins.2012.06.013_b0195 doi: 10.1145/371920.372057 – volume: 190 start-page: 127 year: 2012 ident: 10.1016/j.ins.2012.06.013_b0205 article-title: Semantic relevance ranking for XML keyword search publication-title: Information Sciences doi: 10.1016/j.ins.2011.12.011 – ident: 10.1016/j.ins.2012.06.013_b0070 – ident: 10.1016/j.ins.2012.06.013_b0210 doi: 10.1109/IDEAS.2000.880562 – volume: 34 start-page: 27 issue: 4 year: 2005 ident: 10.1016/j.ins.2012.06.013_b0105 article-title: From databases to dataspaces: a new abstraction for information management publication-title: SIGMOD Record doi: 10.1145/1107499.1107502 – ident: 10.1016/j.ins.2012.06.013_b0240 doi: 10.1145/1620432.1620453 – ident: 10.1016/j.ins.2012.06.013_b0230 doi: 10.1145/1066157.1066217 – ident: 10.1016/j.ins.2012.06.013_b0015 doi: 10.1145/872760.872761 – ident: 10.1016/j.ins.2012.06.013_b0100 doi: 10.1145/275487.275488 – ident: 10.1016/j.ins.2012.06.013_b0025 – ident: 10.1016/j.ins.2012.06.013_b0165 – ident: 10.1016/j.ins.2012.06.013_b0130 – ident: 10.1016/j.ins.2012.06.013_b0055 doi: 10.1145/971699.318923 – ident: 10.1016/j.ins.2012.06.013_b0150 doi: 10.1016/B978-012722442-8/50080-X – ident: 10.1016/j.ins.2012.06.013_b0185 doi: 10.1145/1007568.1007656 – ident: 10.1016/j.ins.2012.06.013_b0200 doi: 10.1016/B978-012088469-8.50010-3 – ident: 10.1016/j.ins.2012.06.013_b0245 doi: 10.1109/DEXA.2009.23 – ident: 10.1016/j.ins.2012.06.013_b0095 doi: 10.1145/860450.860451 – ident: 10.1016/j.ins.2012.06.013_b0040 – volume: 25 start-page: 80 issue: 1 year: 1996 ident: 10.1016/j.ins.2012.06.013_b0110 article-title: Lifestreams: a storage model for personal data publication-title: SIGMOD Record doi: 10.1145/381854.381893 – ident: 10.1016/j.ins.2012.06.013_b0175 doi: 10.1145/1099554.1099559 – ident: 10.1016/j.ins.2012.06.013_b0005 – ident: 10.1016/j.ins.2012.06.013_b0145 doi: 10.1145/1247480.1247516 – ident: 10.1016/j.ins.2012.06.013_b0060 – ident: 10.1016/j.ins.2012.06.013_b0080 doi: 10.1145/1066157.1066168 – ident: 10.1016/j.ins.2012.06.013_b0085 – ident: 10.1016/j.ins.2012.06.013_b0020 doi: 10.1145/988672.988751 – ident: 10.1016/j.ins.2012.06.013_b0220 – ident: 10.1016/j.ins.2012.06.013_b0225 doi: 10.1145/1376616.1376702 – ident: 10.1016/j.ins.2012.06.013_b0125 doi: 10.1145/121133.121138 – ident: 10.1016/j.ins.2012.06.013_b0035 doi: 10.1109/ICDE.2002.994756 – ident: 10.1016/j.ins.2012.06.013_b0010 doi: 10.1145/564691.564782 – ident: 10.1016/j.ins.2012.06.013_b0135 doi: 10.1145/872760.872762 |
| SSID | ssj0004766 |
| Score | 2.0799696 |
| Snippet | Nowadays, personal information is being distributed into more and more heterogeneous sources, which presents a huge obstacle to management and retrieval of... |
| SourceID | crossref elsevier |
| SourceType | Enrichment Source Index Database Publisher |
| StartPage | 31 |
| SubjectTerms | Dataspace Graph data model Personal information management Query processing and optimization Semi-structured query |
| Title | 3SEPIAS: A Semi-Structured Search Engine for Personal Information in dAtaspace System |
| URI | https://dx.doi.org/10.1016/j.ins.2012.06.013 |
| Volume | 218 |
| WOSCitedRecordID | wos000311194900003&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| journalDatabaseRights | – providerCode: PRVESC databaseName: Elsevier SD Freedom Collection Journals 2021 customDbUrl: eissn: 1872-6291 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0004766 issn: 0020-0255 databaseCode: AIEXJ dateStart: 19950101 isFulltext: true titleUrlDefault: https://www.sciencedirect.com providerName: Elsevier |
| link | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV1Nb9QwELWg5QAHBAVEgSIfEAeQUWLHccwtQosogmqlbdGKS2Q7cZsK0qrdov35jGM7m10-BEhcolUUeyPPZObZfvOM0DORqBoSnyZ1zRKSac6IpDYhqVZZ3ciMNn2F96cP4uCgmM_lNHBVL_vjBETXFculPP-vpoZ7YGxXOvsX5h46hRvwG4wOVzA7XP_I8Gw2me6XM19yPmu-tmTWa8ReOaa5ZxcHFcKeYjgNaPxlKEyK5Me6XCiINk6osxd7HqPY8aMhhQ7Q_PNJIPl-jEnR0X3aq_5W0x2bk3a1_tonANUtwUePx-sP7iyItfWHoTBmjbfpUChx0xWfZnxsLQQlOfWHc8XgS0P09eEzJASfiL0g7Q8h3q82nMK8xKmtu6Xc_FXi61k3lLPdRnQ_Z0qdfwDWuY62qeASgt92uT-Zv18V0Aq_qR3fOm5_90TAjT_6OYAZgZLDO-h2mE3g0nvBXXSt6XbQrZHG5A7aC5Up-DkemQ2HmH4PHQV_eY1LvOEt2HsL9v1haIyjt6z11XZ48BbsveU-Ono7OXzzjoSzNoihUiwI4PQm1ZqKGjC3knmmtM2si-g6tZYzoaRhMBUQqpAqFYZLk_JaWl5YY3Kl2QO01Z11zUOELeeUU5MYzepMWwktWJNZAN7MicvluyiJA1iZIETvzkP5UkXG4WkFY165Ma8c6zJlu-jF0OTcq7D87uEsWqUK34CHhxW40K-bPfq3Zo_RzdVX8QRtgY2aPXTDfFu0lxdPg6N9B3lwkwk |
| linkProvider | Elsevier |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=3SEPIAS%3A+A+Semi-Structured+Search+Engine+for+Personal+Information+in+dAtaspace+System&rft.jtitle=Information+sciences&rft.au=Zhong%2C+Ming&rft.au=Liu%2C+Mengchi&rft.au=He%2C+Yanxiang&rft.date=2013-01-01&rft.pub=Elsevier+Inc&rft.issn=0020-0255&rft.eissn=1872-6291&rft.volume=218&rft.spage=31&rft.epage=50&rft_id=info:doi/10.1016%2Fj.ins.2012.06.013&rft.externalDocID=S0020025512004082 |
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0020-0255&client=summon |
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0020-0255&client=summon |
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0020-0255&client=summon |