3SEPIAS: A Semi-Structured Search Engine for Personal Information in dAtaspace System

Nowadays, personal information is being distributed into more and more heterogeneous sources, which presents a huge obstacle to management and retrieval of personal information. To address this problem, this paper presents the blueprint of a novel Personal Information Management (PIM) system named 3...

Celý popis

Uloženo v:
Podrobná bibliografie
Vydáno v:Information sciences Ročník 218; s. 31 - 50
Hlavní autoři: Zhong, Ming, Liu, Mengchi, He, Yanxiang
Médium: Journal Article
Jazyk:angličtina
Vydáno: Elsevier Inc 01.01.2013
Témata:
ISSN:0020-0255, 1872-6291
On-line přístup:Získat plný text
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Abstract Nowadays, personal information is being distributed into more and more heterogeneous sources, which presents a huge obstacle to management and retrieval of personal information. To address this problem, this paper presents the blueprint of a novel Personal Information Management (PIM) system named 3SEPIAS (short for Semi-Structured Search Engine for Personal Information in dAtaspace System). 3SEPIAS has three main features, data integration without upfront semantic reconciliation, flexible query model for data having sparse and evolving schema, and efficient best-effort proximity search approach on graphs. For that, we first propose a semi-structured graph data model called Interpreted Object Model (IOM) to uniformly represents a user’s heterogeneous personal information and loosely integrates it into a dataspace in a schema-later way. Then, a Semi-Structured Search Engine (3SE) can be used to search over the personal dataspaces. We propose an intuitive 3SE Query Language (3SQL) that enables users to query in a varying degree of structural constraint according to their knowledge of underlying schemas. Moreover, a best-effort top-k proximity search optimization strategy and corresponding graph index structures are proposed to improve the efficiency of query processing. We perform comprehensive experiments to test both effectiveness and efficiency of our proximity search approach. The results reveal that 3SE can beat the previous proximity search systems by a large margin with only a little or even no loss of result quality, especially for large graphs.
AbstractList Nowadays, personal information is being distributed into more and more heterogeneous sources, which presents a huge obstacle to management and retrieval of personal information. To address this problem, this paper presents the blueprint of a novel Personal Information Management (PIM) system named 3SEPIAS (short for Semi-Structured Search Engine for Personal Information in dAtaspace System). 3SEPIAS has three main features, data integration without upfront semantic reconciliation, flexible query model for data having sparse and evolving schema, and efficient best-effort proximity search approach on graphs. For that, we first propose a semi-structured graph data model called Interpreted Object Model (IOM) to uniformly represents a user’s heterogeneous personal information and loosely integrates it into a dataspace in a schema-later way. Then, a Semi-Structured Search Engine (3SE) can be used to search over the personal dataspaces. We propose an intuitive 3SE Query Language (3SQL) that enables users to query in a varying degree of structural constraint according to their knowledge of underlying schemas. Moreover, a best-effort top-k proximity search optimization strategy and corresponding graph index structures are proposed to improve the efficiency of query processing. We perform comprehensive experiments to test both effectiveness and efficiency of our proximity search approach. The results reveal that 3SE can beat the previous proximity search systems by a large margin with only a little or even no loss of result quality, especially for large graphs.
Author He, Yanxiang
Liu, Mengchi
Zhong, Ming
Author_xml – sequence: 1
  givenname: Ming
  surname: Zhong
  fullname: Zhong, Ming
  email: mike.clark.whu@gmail.com
  organization: State Key Laboratory of Software Engineering, Wuhan University, Luojiashan, Wuhan 430072, China
– sequence: 2
  givenname: Mengchi
  surname: Liu
  fullname: Liu, Mengchi
  email: mengchi@scs.carleton.ca
  organization: School of Computer Science, Carleton University, 1125 Colonel By Drive, Ottawa, Canada K1S 5B6
– sequence: 3
  givenname: Yanxiang
  surname: He
  fullname: He, Yanxiang
  email: yxhe@whu.edu.cn
  organization: State Key Laboratory of Software Engineering, Wuhan University, Luojiashan, Wuhan 430072, China
BookMark eNp9kEFLwzAcxYNMcJt-AG_5Aq3_pE3a6qmMqYOBg7pzSNNEM9Z0JJmwb2-nnjzs9HiH34P3m6GJG5xG6J5ASoDwh11qXUgpEJoCT4FkV2hKyoImnFZkgqYAFBKgjN2gWQg7AMgLzqdomzXLzapuHnGNG93bpIn-qOLR627s0qtPvHQf1mlsBo832ofByT1eubH2MtrBYetwV0cZDlJp3JxC1P0tujZyH_TdX87R9nn5vnhN1m8vq0W9ThStipgwKDVpW1p0UDFZ8Vy2JjeQU9YSY1hWyEplGYFClpUkhWKVIqyrDCuNUly22RyR313lhxC8NuLgbS_9SRAQZy9iJ0Yv4uxFABejl5Ep_jHKxp8n0Uu7v0g-_ZJ6vPRltRdBWe2U7qzXKopusBfob51bfvQ
CitedBy_id crossref_primary_10_1016_j_jksuci_2014_03_017
crossref_primary_10_1145_3003665_3003672
crossref_primary_10_1007_s41870_023_01518_x
crossref_primary_10_1109_TKDE_2014_2310207
Cites_doi 10.1145/1142351.1142352
10.1016/B978-155860869-6/50065-2
10.1145/641043.641053
10.1109/ICDE.2006.67
10.1145/320719.322582
10.1145/383952.383985
10.1016/j.ins.2009.06.025
10.1145/1247480.1247487
10.1145/1376616.1376701
10.1109/IRI.2008.4583065
10.1016/B978-012722442-8/50013-6
10.1145/1645953.1646131
10.1145/348751.348758
10.1145/371920.372057
10.1016/j.ins.2011.12.011
10.1109/IDEAS.2000.880562
10.1145/1107499.1107502
10.1145/1620432.1620453
10.1145/1066157.1066217
10.1145/872760.872761
10.1145/275487.275488
10.1145/971699.318923
10.1016/B978-012722442-8/50080-X
10.1145/1007568.1007656
10.1016/B978-012088469-8.50010-3
10.1109/DEXA.2009.23
10.1145/860450.860451
10.1145/381854.381893
10.1145/1099554.1099559
10.1145/1247480.1247516
10.1145/1066157.1066168
10.1145/988672.988751
10.1145/1376616.1376702
10.1145/121133.121138
10.1109/ICDE.2002.994756
10.1145/564691.564782
10.1145/872760.872762
ContentType Journal Article
Copyright 2012 Elsevier Inc.
Copyright_xml – notice: 2012 Elsevier Inc.
DBID AAYXX
CITATION
DOI 10.1016/j.ins.2012.06.013
DatabaseName CrossRef
DatabaseTitle CrossRef
DatabaseTitleList
DeliveryMethod fulltext_linktorsrc
Discipline Engineering
Library & Information Science
EISSN 1872-6291
EndPage 50
ExternalDocumentID 10_1016_j_ins_2012_06_013
S0020025512004082
GroupedDBID --K
--M
--Z
-~X
.DC
.~1
0R~
1B1
1RT
1~.
1~5
4.4
457
4G.
5GY
5VS
7-5
71M
8P~
9JN
9JO
AAAKF
AABNK
AACTN
AAEDT
AAEDW
AAIAV
AAIKJ
AAKOC
AALRI
AAOAW
AAQFI
AARIN
AAXUO
AAYFN
ABAOU
ABBOA
ABFNM
ABJNI
ABMAC
ABUCO
ABXDB
ABYKQ
ACAZW
ACDAQ
ACGFS
ACRLP
ACZNC
ADBBV
ADEZE
ADGUI
ADTZH
AEBSH
AECPX
AEKER
AENEX
AFKWA
AFTJW
AGHFR
AGUBO
AGYEJ
AHHHB
AHJVU
AHZHX
AIALX
AIEXJ
AIGVJ
AIKHN
AITUG
AJBFU
AJOXV
ALMA_UNASSIGNED_HOLDINGS
AMFUW
AMRAJ
AOUOD
APLSM
ARUGR
AXJTR
BJAXD
BKOJK
BLXMC
CS3
DU5
EBS
EFJIC
EFLBG
EO8
EO9
EP2
EP3
F5P
FDB
FIRID
FNPLU
FYGXN
G-Q
GBLVA
GBOLZ
HAMUX
IHE
J1W
JJJVA
KOM
LG9
LY1
M41
MHUIS
MO0
MS~
N9A
O-L
O9-
OAUVE
OZT
P-8
P-9
P2P
PC.
Q38
RIG
ROL
RPZ
SDF
SDG
SDP
SES
SPC
SPCBC
SSB
SSD
SST
SSV
SSW
SSZ
T5K
TN5
TWZ
WH7
XPP
ZMT
~02
~G-
1OL
29I
77I
9DU
AAAKG
AAQXK
AATTM
AAXKI
AAYWO
AAYXX
ABEFU
ABWVN
ACLOT
ACNNM
ACRPL
ACVFH
ADCNI
ADJOM
ADMUD
ADNMO
ADVLN
AEIPS
AEUPX
AFFNX
AFJKZ
AFPUW
AGQPQ
AIGII
AIIUN
AKBMS
AKRWK
AKYEP
ANKPU
APXCP
ASPBG
AVWKF
AZFZN
CITATION
EFKBS
EJD
FEDTE
FGOYB
HLZ
HVGLF
HZ~
H~9
R2-
SBC
SDS
SEW
UHS
WUQ
YYP
ZY4
~HD
ID FETCH-LOGICAL-c297t-508e1bb27d095a964abf4f0425b1ff537a9c33107a89a17c59c15d9f58fcc6ab3
ISICitedReferencesCount 3
ISICitedReferencesURI http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000311194900003&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
ISSN 0020-0255
IngestDate Sat Nov 29 08:03:26 EST 2025
Tue Nov 18 21:55:37 EST 2025
Fri Feb 23 02:23:14 EST 2024
IsPeerReviewed true
IsScholarly true
Keywords Query processing and optimization
Personal information management
Graph data model
Dataspace
Semi-structured query
Language English
License https://www.elsevier.com/tdm/userlicense/1.0
LinkModel OpenURL
MergedId FETCHMERGED-LOGICAL-c297t-508e1bb27d095a964abf4f0425b1ff537a9c33107a89a17c59c15d9f58fcc6ab3
PageCount 20
ParticipantIDs crossref_primary_10_1016_j_ins_2012_06_013
crossref_citationtrail_10_1016_j_ins_2012_06_013
elsevier_sciencedirect_doi_10_1016_j_ins_2012_06_013
PublicationCentury 2000
PublicationDate 2013-01-01
2013-1-00
PublicationDateYYYYMMDD 2013-01-01
PublicationDate_xml – month: 01
  year: 2013
  text: 2013-01-01
  day: 01
PublicationDecade 2010
PublicationTitle Information sciences
PublicationYear 2013
Publisher Elsevier Inc
Publisher_xml – name: Elsevier Inc
References L. Guo, F. Shao, C. Botev, J. Shanmugasundaram, XRANK: ranked keyword search over XML documents, in: Proceedings of ACM SIGMOD International Conference on Management of Data, ACM, 2003, pp. 16–27.
Li, Li, Feng, Zhou (b0190) 2009; 179
V. Kacholia, S. Pandit, S. Chakrabarti, S. Sudarshan, R. Desai, H. Karambelkar, Bidirectional expansion for keyword search on graph databases, in: Proceedings of ACM SIGMOD International Conference on Management of Data, ACM, 2005, pp. 505–516.
A.D. Sarma, X. Dong, A. Halevy, Bootstrapping pay-as-you-go data integration systems, in: Proceedings of ACM SIGMOD International Conference on Management of Data, ACM, 2008, pp. 861–874.
M. Zhong, M. Liu, A flexible data warehousing approach for one-stop querying on heterogeneous personal information, in: Proceedings of Workshops of International Conference on Database and Expert Systems Applications, IEEE Computer Society, 2009, pp. 412–416.
H. Bast, I. Weber, The completesearch engine: interactive, efficient, and towards IR&DB integration, in: Proceedings of Conference on Innovative Data Systems Research, 2007, pp. 88–95. Available from
A. Halevy, M. Franklin, D. Maier, Principles of dataspace systems, in: Proceedings of ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems, ACM, 2006, pp. 1–9.
M. Zhong, M. Liu, 3SE: a semi-structured search engine for heterogeneous data in graph model, in: Proceedings of ACM Conference on Information and Knowledge Management, ACM, 2009, pp. 1405–1408.
J. Rekimoto, Time-machine computing: a time-centric approach for the information environment, in: Proceedings of ACM Symposium on User Interface Software and Technology, ACM, 1999, pp. 45–54.
Y. Xu, Y. Papakonstantinou, Efficient keyword search for smallest LCAs in XML databases, in: Proceedings of ACM SIGMOD International Conference on Management of Data, ACM, 2005, pp. 527–538.
V. Hristidis, L. Gravano, Y. Papakonstantinou, Efficient IR-style keyword search over relational databases, in: Proceedings of International Conference on Very Large Data Bases, Morgan Kaufmann, 2003, pp. 850–861.
Lou, Li, Chen (b0205) 2012; 190
J.L. Beckmann, A. Halverson, R. Krishnamurthy, J.F. Naughton, Extending rdbmss to support sparse datasets using an interpreted attribute storage format, in: Proceedings of IEEE International Conference on Data Engineering, IEEE Computer Society, 2006, p. 58.
Franklin, Halevy, Maier (b0105) 2005; 34
J.-P. Dittrich, M.A.V. Salles, D. Kossmann, L. Blunschi, iMeMex: escapes from the personal information jungle (demo paper), in: Proceedings of International Conference on Very Large Data Bases, ACM, 2005, pp. 1306–1309.
X. Dong, A. Halevy, E. Nemes, S. Sigundsson, P. Domingos, SEMEX: toward on-the-fly personal information integration, in: Proceedings of International Workshop on Information Integration on the Web, 2004.
R. Fagin, Combining fuzzy information from multiple systems, in: Proceedings of ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems, ACM, 1998, pp. 1–10.
D.J. Abadi, A. Marcus, S.R. Madden, K. Hollenbach, Scalable semantic web data management using vertical partitioning, in: Proceedings of International Conference on Very Large Data Bases, ACM, 2007, pp. 411–422.
S. Cohen, J. Mamou, Y. Kanza, Y. Sagiv, XSEarch: a semantic search engine for XML, in: Proceedings of International Conference on Very Large Data Bases, Morgan Kaufmann, 2003, pp. 45–56.
J.-P. Dittrich, iMeMex: a platform for personal dataspace management, in: Proceedings of Workshops of International ACM SIGIR Conference on Research and Development in Information Retrieval, ACM, 2006, pp. 40–43.
X. Dong, A. Halevy, J. Madhavan, Reference reconciliation in complex information spaces, in: Proceedings of ACM SIGMOD International Conference on Management of Data, ACM, 2005, pp. 85–96.
Dourish, Edwards, LaMarca, Lamping, Petersen, Salisbury, Terry, Thornton (b0090) 2000; 18
J. Kamps, M. Marx, M. de Rijke, B. Sigurbjörnsson, Structured queries in XML retrieval, in: Proceedings of ACM Conference on Information and Knowledge Management, ACM, 2005, pp. 4–11.
.
J. Graupmann, R. Schenkel, G. Weikum, The spheresearch engine for unified ranked retrieval of heterogeneous XML and web documents, in: Proceedings of International Conference on Very Large Data Bases, ACM, 2005, pp. 529–540.
M.A.V. Salles, J.-P. Dittrich, S.K. Karakashian, O.R. Girard, L. Blunschi, iTrails: pay-as-you-go information integration in dataspaces, in: Proceedings of International Conference on Very Large Data Bases, ACM, 2007, pp. 663–674.
S. Al-Khalifa, C. Yu, H.V. Jagadish, Querying structured text in an XML database, in: Proceedings of ACM SIGMOD International Conference on Management of Data, ACM Press, 2003, pp. 4–15.
G. Bhalotia, A. Hulgeriy, C. Nakhez, S. Chakrabarti, S. Sudarshan, Keyword searching and browsing in databases using banks, in: Proceedings of IEEE International Conference on Data Engineering, IEEE Computer Society, 2002, pp. 431–440.
N. Fuhr, K. Großjohann, XIRQL: a query language for information retrieval in XML documents, in: Proceedings of International ACM SIGIR Conference on Research and Development in Information Retrieval, ACM, 2001, pp. 172–180.
R. Kaushik, R. Krishnamurthy, J.F. Naughton, R. Ramakrishnan, On the integration of structure indexes and inverted lists, in: Proceedings of ACM SIGMOD International Conference on Management of Data, ACM, 2004, pp. 779–790.
M. Zhong, M. Liu, Q. Chen, Modeling heterogeneous data in dataspace, in: Proceedings of IEEE International Conference on Information Reuse and Integration, IEEE, 2008, pp. 404–409.
X. Dong, A. Halevy, Indexing dataspaces, in: Proceedings of ACM SIGMOD International Conference on Management of Data, ACM, 2007, pp. 43–54.
V. Hristidis, Y. Papakonstantinou, Discover: keyword search in relational databases, in: Proceedings of International Conference on Very Large Data Bases, Morgan Kaufmann, 2002, pp. 670–681.
E. Chu, J.L. Beckmann, J. Naughton, Extending rdbmss to support sparse datasets using an interpreted attribute storage format, in: Proceedings of ACM SIGMOD International Conference on Management of Data, ACM, 2007, pp. 821–832.
U. Masermann, G. Vossen, Sisql: schema-independent database querying (on and off the web), in: Proceedings of International Database Engineering & Applications Symposium, IEEE Computer Society, 2000, pp. 55–64.
E. Chu, A. Baid, T. Chen, A. Doan, J. Naughton, A relational approach to incrementally extracting and querying structure in unstructured data, in: Proceedings of International Conference on Very Large Data Bases, ACM, 2007, pp. 1045–1056.
D.R. Karger, K. Bakshi, D. Huynh, D. Quan, V. Sinha, Haystack: a customizable general-purpose information management tool for end users of semistructured data, in: Proceedings of Conference on Innovative Data Systems Research, 2005, pp. 13–26. Available from
S. Dumais, E. Cutrell, J. Cadiz, G. Jancke, R. Sarin, D.C. Robbins, Stuff i’ve seen: a system for personal information retrieval and re-use, in: Proceedings of International ACM SIGIR Conference on Research and Development in Information Retrieval, ACM, 2003, pp. 72–79.
W. Jones, H. Bruce, A report on the NSF-sponsored workshop on personal information management, in: Proceedings of NSF-Sponsored Workshop on Personal Information Management, Seattle, January 2005.
G.P. Copeland, S. Khoshafian, A decomposition storage model, in: Proceedings of ACM SIGMOD International Conference on Management of Data, ACM, 1985, pp. 268–279.
W.-S. Li, K. Candan, Q. Vu, D. Agrawal, Retrieving and organizing web pages by “information unit”, in: Proceedings of International World Wide Web Conference, ACM, 2001, pp. 230–244.
M. Zhong, M. Liu, Efficient keyword proximity search using a frontier-reduce strategy based on d-distance graph index, in: Proceedings of International Database Engineering and Applications Symposium, ACM, 2009, pp. 206–216.
J. Gemmell, G. Bell, R. Lueder, S. Drucker, C. Wong, MyLifeBits: fulfilling the Memex vision, in: Proceedings of ACM Multimedia, ACM, 2002, pp. 235–238.
J.-P. Dittrich, M.A.V. Salles, iDM: a unified and versatile data model for personal dataspace management, in: Proceedings of International Conference on Very Large Data Bases, ACM, 2006, pp. 367–378.
D.K. Gifford, P. Jouvelot, M.A. Sheldon, J. James W. O’Toole, Semantic file systems, in: Proceedings of ACM Symposium on Operating Systems Principles, ACM, 1991, pp. 16–25.
S.R. Jeffery, M.J. Franklin, A.Y. Halevy, Pay-as-you-go user feedback for dataspace systems, in: Proceedings of ACM SIGMOD International Conference on Management of Data, ACM, 2008, pp. 847–860.
Y. Li, C. Yu, H.V. Jagadish, Schema-free xquery, in: Proceedings of International Conference on Very Large Data Bases, Morgan Kaufmann, 2004, pp. 72–83.
S. Agrawal, S. Chaudhuri, G. Das, Dbxplorer: a system for keyword-based search over relational databases, in: Proceedings of IEEE International Conference on Data Engineering, IEEE Computer Society, 2002, pp. 5–16.
S. Amer-Yahia, C. Botev, J. Shanmugasundaram, TeXQuery: a fulltext search extension to xquery, in: Proceedings of International World Wide Web Conference, ACM, 2004, pp. 583–594.
H. He, H. Wang, J. Yang, P.S. Yu, BLINKS: ranked keyword searches on graphs, in: Proceedings of ACM SIGMOD International Conference on Management of Data, ACM, 2007, pp. 305–316.
Freeman, Gelernter (b0110) 1996; 25
10.1016/j.ins.2012.06.013_b0225
10.1016/j.ins.2012.06.013_b0035
10.1016/j.ins.2012.06.013_b0155
10.1016/j.ins.2012.06.013_b0230
10.1016/j.ins.2012.06.013_b0075
10.1016/j.ins.2012.06.013_b0030
10.1016/j.ins.2012.06.013_b0195
10.1016/j.ins.2012.06.013_b0150
10.1016/j.ins.2012.06.013_b0070
10.1016/j.ins.2012.06.013_b0215
10.1016/j.ins.2012.06.013_b0015
10.1016/j.ins.2012.06.013_b0025
Lou (10.1016/j.ins.2012.06.013_b0205) 2012; 190
10.1016/j.ins.2012.06.013_b0145
10.1016/j.ins.2012.06.013_b0100
10.1016/j.ins.2012.06.013_b0220
10.1016/j.ins.2012.06.013_b0065
10.1016/j.ins.2012.06.013_b0020
10.1016/j.ins.2012.06.013_b0185
10.1016/j.ins.2012.06.013_b0140
10.1016/j.ins.2012.06.013_b0060
10.1016/j.ins.2012.06.013_b0180
Li (10.1016/j.ins.2012.06.013_b0190) 2009; 179
Freeman (10.1016/j.ins.2012.06.013_b0110) 1996; 25
10.1016/j.ins.2012.06.013_b0005
10.1016/j.ins.2012.06.013_b0125
10.1016/j.ins.2012.06.013_b0135
10.1016/j.ins.2012.06.013_b0210
10.1016/j.ins.2012.06.013_b0055
10.1016/j.ins.2012.06.013_b0010
10.1016/j.ins.2012.06.013_b0175
Franklin (10.1016/j.ins.2012.06.013_b0105) 2005; 34
10.1016/j.ins.2012.06.013_b0130
10.1016/j.ins.2012.06.013_b0250
10.1016/j.ins.2012.06.013_b0095
10.1016/j.ins.2012.06.013_b0050
10.1016/j.ins.2012.06.013_b0170
10.1016/j.ins.2012.06.013_b0115
10.1016/j.ins.2012.06.013_b0235
10.1016/j.ins.2012.06.013_b0245
10.1016/j.ins.2012.06.013_b0200
10.1016/j.ins.2012.06.013_b0045
Dourish (10.1016/j.ins.2012.06.013_b0090) 2000; 18
10.1016/j.ins.2012.06.013_b0165
10.1016/j.ins.2012.06.013_b0120
10.1016/j.ins.2012.06.013_b0240
10.1016/j.ins.2012.06.013_b0085
10.1016/j.ins.2012.06.013_b0040
10.1016/j.ins.2012.06.013_b0160
10.1016/j.ins.2012.06.013_b0080
References_xml – reference: E. Chu, A. Baid, T. Chen, A. Doan, J. Naughton, A relational approach to incrementally extracting and querying structure in unstructured data, in: Proceedings of International Conference on Very Large Data Bases, ACM, 2007, pp. 1045–1056.
– reference: J.-P. Dittrich, M.A.V. Salles, iDM: a unified and versatile data model for personal dataspace management, in: Proceedings of International Conference on Very Large Data Bases, ACM, 2006, pp. 367–378.
– reference: J.-P. Dittrich, M.A.V. Salles, D. Kossmann, L. Blunschi, iMeMex: escapes from the personal information jungle (demo paper), in: Proceedings of International Conference on Very Large Data Bases, ACM, 2005, pp. 1306–1309.
– reference: J. Kamps, M. Marx, M. de Rijke, B. Sigurbjörnsson, Structured queries in XML retrieval, in: Proceedings of ACM Conference on Information and Knowledge Management, ACM, 2005, pp. 4–11.
– reference: S. Cohen, J. Mamou, Y. Kanza, Y. Sagiv, XSEarch: a semantic search engine for XML, in: Proceedings of International Conference on Very Large Data Bases, Morgan Kaufmann, 2003, pp. 45–56.
– reference: S.R. Jeffery, M.J. Franklin, A.Y. Halevy, Pay-as-you-go user feedback for dataspace systems, in: Proceedings of ACM SIGMOD International Conference on Management of Data, ACM, 2008, pp. 847–860.
– volume: 190
  start-page: 127
  year: 2012
  end-page: 143
  ident: b0205
  article-title: Semantic relevance ranking for XML keyword search
  publication-title: Information Sciences
– reference: J. Rekimoto, Time-machine computing: a time-centric approach for the information environment, in: Proceedings of ACM Symposium on User Interface Software and Technology, ACM, 1999, pp. 45–54.
– reference: V. Kacholia, S. Pandit, S. Chakrabarti, S. Sudarshan, R. Desai, H. Karambelkar, Bidirectional expansion for keyword search on graph databases, in: Proceedings of ACM SIGMOD International Conference on Management of Data, ACM, 2005, pp. 505–516.
– volume: 25
  start-page: 80
  year: 1996
  end-page: 86
  ident: b0110
  article-title: Lifestreams: a storage model for personal data
  publication-title: SIGMOD Record
– reference: H. He, H. Wang, J. Yang, P.S. Yu, BLINKS: ranked keyword searches on graphs, in: Proceedings of ACM SIGMOD International Conference on Management of Data, ACM, 2007, pp. 305–316.
– reference: H. Bast, I. Weber, The completesearch engine: interactive, efficient, and towards IR&DB integration, in: Proceedings of Conference on Innovative Data Systems Research, 2007, pp. 88–95. Available from: <:
– reference: M. Zhong, M. Liu, A flexible data warehousing approach for one-stop querying on heterogeneous personal information, in: Proceedings of Workshops of International Conference on Database and Expert Systems Applications, IEEE Computer Society, 2009, pp. 412–416.
– volume: 179
  start-page: 3745
  year: 2009
  end-page: 3762
  ident: b0190
  article-title: Sail: structure-aware indexing for effective and progressive top-
  publication-title: Information Sciences
– reference: M. Zhong, M. Liu, Efficient keyword proximity search using a frontier-reduce strategy based on d-distance graph index, in: Proceedings of International Database Engineering and Applications Symposium, ACM, 2009, pp. 206–216.
– reference: J. Graupmann, R. Schenkel, G. Weikum, The spheresearch engine for unified ranked retrieval of heterogeneous XML and web documents, in: Proceedings of International Conference on Very Large Data Bases, ACM, 2005, pp. 529–540.
– reference: R. Fagin, Combining fuzzy information from multiple systems, in: Proceedings of ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems, ACM, 1998, pp. 1–10.
– reference: R. Kaushik, R. Krishnamurthy, J.F. Naughton, R. Ramakrishnan, On the integration of structure indexes and inverted lists, in: Proceedings of ACM SIGMOD International Conference on Management of Data, ACM, 2004, pp. 779–790.
– reference: S. Dumais, E. Cutrell, J. Cadiz, G. Jancke, R. Sarin, D.C. Robbins, Stuff i’ve seen: a system for personal information retrieval and re-use, in: Proceedings of International ACM SIGIR Conference on Research and Development in Information Retrieval, ACM, 2003, pp. 72–79.
– volume: 34
  start-page: 27
  year: 2005
  end-page: 33
  ident: b0105
  article-title: From databases to dataspaces: a new abstraction for information management
  publication-title: SIGMOD Record
– reference: M. Zhong, M. Liu, 3SE: a semi-structured search engine for heterogeneous data in graph model, in: Proceedings of ACM Conference on Information and Knowledge Management, ACM, 2009, pp. 1405–1408.
– reference: S. Agrawal, S. Chaudhuri, G. Das, Dbxplorer: a system for keyword-based search over relational databases, in: Proceedings of IEEE International Conference on Data Engineering, IEEE Computer Society, 2002, pp. 5–16.
– reference: X. Dong, A. Halevy, E. Nemes, S. Sigundsson, P. Domingos, SEMEX: toward on-the-fly personal information integration, in: Proceedings of International Workshop on Information Integration on the Web, 2004.
– reference: D.R. Karger, K. Bakshi, D. Huynh, D. Quan, V. Sinha, Haystack: a customizable general-purpose information management tool for end users of semistructured data, in: Proceedings of Conference on Innovative Data Systems Research, 2005, pp. 13–26. Available from: <
– reference: J.L. Beckmann, A. Halverson, R. Krishnamurthy, J.F. Naughton, Extending rdbmss to support sparse datasets using an interpreted attribute storage format, in: Proceedings of IEEE International Conference on Data Engineering, IEEE Computer Society, 2006, p. 58.
– reference: A.D. Sarma, X. Dong, A. Halevy, Bootstrapping pay-as-you-go data integration systems, in: Proceedings of ACM SIGMOD International Conference on Management of Data, ACM, 2008, pp. 861–874.
– volume: 18
  start-page: 140
  year: 2000
  end-page: 170
  ident: b0090
  article-title: Extending document management systems with user-specific active properties
  publication-title: ACM Transactions on Information Systems
– reference: S. Amer-Yahia, C. Botev, J. Shanmugasundaram, TeXQuery: a fulltext search extension to xquery, in: Proceedings of International World Wide Web Conference, ACM, 2004, pp. 583–594.
– reference: S. Al-Khalifa, C. Yu, H.V. Jagadish, Querying structured text in an XML database, in: Proceedings of ACM SIGMOD International Conference on Management of Data, ACM Press, 2003, pp. 4–15.
– reference: J.-P. Dittrich, iMeMex: a platform for personal dataspace management, in: Proceedings of Workshops of International ACM SIGIR Conference on Research and Development in Information Retrieval, ACM, 2006, pp. 40–43.
– reference: A. Halevy, M. Franklin, D. Maier, Principles of dataspace systems, in: Proceedings of ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems, ACM, 2006, pp. 1–9.
– reference: W.-S. Li, K. Candan, Q. Vu, D. Agrawal, Retrieving and organizing web pages by “information unit”, in: Proceedings of International World Wide Web Conference, ACM, 2001, pp. 230–244.
– reference: N. Fuhr, K. Großjohann, XIRQL: a query language for information retrieval in XML documents, in: Proceedings of International ACM SIGIR Conference on Research and Development in Information Retrieval, ACM, 2001, pp. 172–180.
– reference: J. Gemmell, G. Bell, R. Lueder, S. Drucker, C. Wong, MyLifeBits: fulfilling the Memex vision, in: Proceedings of ACM Multimedia, ACM, 2002, pp. 235–238.
– reference: Y. Xu, Y. Papakonstantinou, Efficient keyword search for smallest LCAs in XML databases, in: Proceedings of ACM SIGMOD International Conference on Management of Data, ACM, 2005, pp. 527–538.
– reference: D.J. Abadi, A. Marcus, S.R. Madden, K. Hollenbach, Scalable semantic web data management using vertical partitioning, in: Proceedings of International Conference on Very Large Data Bases, ACM, 2007, pp. 411–422.
– reference: X. Dong, A. Halevy, Indexing dataspaces, in: Proceedings of ACM SIGMOD International Conference on Management of Data, ACM, 2007, pp. 43–54.
– reference: U. Masermann, G. Vossen, Sisql: schema-independent database querying (on and off the web), in: Proceedings of International Database Engineering & Applications Symposium, IEEE Computer Society, 2000, pp. 55–64.
– reference: X. Dong, A. Halevy, J. Madhavan, Reference reconciliation in complex information spaces, in: Proceedings of ACM SIGMOD International Conference on Management of Data, ACM, 2005, pp. 85–96.
– reference: L. Guo, F. Shao, C. Botev, J. Shanmugasundaram, XRANK: ranked keyword search over XML documents, in: Proceedings of ACM SIGMOD International Conference on Management of Data, ACM, 2003, pp. 16–27.
– reference: Y. Li, C. Yu, H.V. Jagadish, Schema-free xquery, in: Proceedings of International Conference on Very Large Data Bases, Morgan Kaufmann, 2004, pp. 72–83.
– reference: V. Hristidis, L. Gravano, Y. Papakonstantinou, Efficient IR-style keyword search over relational databases, in: Proceedings of International Conference on Very Large Data Bases, Morgan Kaufmann, 2003, pp. 850–861.
– reference: M. Zhong, M. Liu, Q. Chen, Modeling heterogeneous data in dataspace, in: Proceedings of IEEE International Conference on Information Reuse and Integration, IEEE, 2008, pp. 404–409.
– reference: G.P. Copeland, S. Khoshafian, A decomposition storage model, in: Proceedings of ACM SIGMOD International Conference on Management of Data, ACM, 1985, pp. 268–279.
– reference: >.
– reference: E. Chu, J.L. Beckmann, J. Naughton, Extending rdbmss to support sparse datasets using an interpreted attribute storage format, in: Proceedings of ACM SIGMOD International Conference on Management of Data, ACM, 2007, pp. 821–832.
– reference: M.A.V. Salles, J.-P. Dittrich, S.K. Karakashian, O.R. Girard, L. Blunschi, iTrails: pay-as-you-go information integration in dataspaces, in: Proceedings of International Conference on Very Large Data Bases, ACM, 2007, pp. 663–674.
– reference: G. Bhalotia, A. Hulgeriy, C. Nakhez, S. Chakrabarti, S. Sudarshan, Keyword searching and browsing in databases using banks, in: Proceedings of IEEE International Conference on Data Engineering, IEEE Computer Society, 2002, pp. 431–440.
– reference: D.K. Gifford, P. Jouvelot, M.A. Sheldon, J. James W. O’Toole, Semantic file systems, in: Proceedings of ACM Symposium on Operating Systems Principles, ACM, 1991, pp. 16–25.
– reference: V. Hristidis, Y. Papakonstantinou, Discover: keyword search in relational databases, in: Proceedings of International Conference on Very Large Data Bases, Morgan Kaufmann, 2002, pp. 670–681.
– reference: W. Jones, H. Bruce, A report on the NSF-sponsored workshop on personal information management, in: Proceedings of NSF-Sponsored Workshop on Personal Information Management, Seattle, January 2005.
– ident: 10.1016/j.ins.2012.06.013_b0140
  doi: 10.1145/1142351.1142352
– ident: 10.1016/j.ins.2012.06.013_b0155
  doi: 10.1016/B978-155860869-6/50065-2
– ident: 10.1016/j.ins.2012.06.013_b0065
– ident: 10.1016/j.ins.2012.06.013_b0170
– ident: 10.1016/j.ins.2012.06.013_b0180
– ident: 10.1016/j.ins.2012.06.013_b0120
  doi: 10.1145/641043.641053
– ident: 10.1016/j.ins.2012.06.013_b0045
  doi: 10.1109/ICDE.2006.67
– ident: 10.1016/j.ins.2012.06.013_b0215
  doi: 10.1145/320719.322582
– ident: 10.1016/j.ins.2012.06.013_b0115
  doi: 10.1145/383952.383985
– volume: 179
  start-page: 3745
  issue: 21
  year: 2009
  ident: 10.1016/j.ins.2012.06.013_b0190
  article-title: Sail: structure-aware indexing for effective and progressive top-k keyword search over XML documents
  publication-title: Information Sciences
  doi: 10.1016/j.ins.2009.06.025
– ident: 10.1016/j.ins.2012.06.013_b0030
  doi: 10.1109/ICDE.2006.67
– ident: 10.1016/j.ins.2012.06.013_b0075
  doi: 10.1145/1247480.1247487
– ident: 10.1016/j.ins.2012.06.013_b0160
  doi: 10.1145/1376616.1376701
– ident: 10.1016/j.ins.2012.06.013_b0250
  doi: 10.1109/IRI.2008.4583065
– ident: 10.1016/j.ins.2012.06.013_b0050
  doi: 10.1016/B978-012722442-8/50013-6
– ident: 10.1016/j.ins.2012.06.013_b0235
  doi: 10.1145/1645953.1646131
– volume: 18
  start-page: 140
  issue: 2
  year: 2000
  ident: 10.1016/j.ins.2012.06.013_b0090
  article-title: Extending document management systems with user-specific active properties
  publication-title: ACM Transactions on Information Systems
  doi: 10.1145/348751.348758
– ident: 10.1016/j.ins.2012.06.013_b0195
  doi: 10.1145/371920.372057
– volume: 190
  start-page: 127
  year: 2012
  ident: 10.1016/j.ins.2012.06.013_b0205
  article-title: Semantic relevance ranking for XML keyword search
  publication-title: Information Sciences
  doi: 10.1016/j.ins.2011.12.011
– ident: 10.1016/j.ins.2012.06.013_b0070
– ident: 10.1016/j.ins.2012.06.013_b0210
  doi: 10.1109/IDEAS.2000.880562
– volume: 34
  start-page: 27
  issue: 4
  year: 2005
  ident: 10.1016/j.ins.2012.06.013_b0105
  article-title: From databases to dataspaces: a new abstraction for information management
  publication-title: SIGMOD Record
  doi: 10.1145/1107499.1107502
– ident: 10.1016/j.ins.2012.06.013_b0240
  doi: 10.1145/1620432.1620453
– ident: 10.1016/j.ins.2012.06.013_b0230
  doi: 10.1145/1066157.1066217
– ident: 10.1016/j.ins.2012.06.013_b0015
  doi: 10.1145/872760.872761
– ident: 10.1016/j.ins.2012.06.013_b0100
  doi: 10.1145/275487.275488
– ident: 10.1016/j.ins.2012.06.013_b0025
– ident: 10.1016/j.ins.2012.06.013_b0165
– ident: 10.1016/j.ins.2012.06.013_b0130
– ident: 10.1016/j.ins.2012.06.013_b0055
  doi: 10.1145/971699.318923
– ident: 10.1016/j.ins.2012.06.013_b0150
  doi: 10.1016/B978-012722442-8/50080-X
– ident: 10.1016/j.ins.2012.06.013_b0185
  doi: 10.1145/1007568.1007656
– ident: 10.1016/j.ins.2012.06.013_b0200
  doi: 10.1016/B978-012088469-8.50010-3
– ident: 10.1016/j.ins.2012.06.013_b0245
  doi: 10.1109/DEXA.2009.23
– ident: 10.1016/j.ins.2012.06.013_b0095
  doi: 10.1145/860450.860451
– ident: 10.1016/j.ins.2012.06.013_b0040
– volume: 25
  start-page: 80
  issue: 1
  year: 1996
  ident: 10.1016/j.ins.2012.06.013_b0110
  article-title: Lifestreams: a storage model for personal data
  publication-title: SIGMOD Record
  doi: 10.1145/381854.381893
– ident: 10.1016/j.ins.2012.06.013_b0175
  doi: 10.1145/1099554.1099559
– ident: 10.1016/j.ins.2012.06.013_b0005
– ident: 10.1016/j.ins.2012.06.013_b0145
  doi: 10.1145/1247480.1247516
– ident: 10.1016/j.ins.2012.06.013_b0060
– ident: 10.1016/j.ins.2012.06.013_b0080
  doi: 10.1145/1066157.1066168
– ident: 10.1016/j.ins.2012.06.013_b0085
– ident: 10.1016/j.ins.2012.06.013_b0020
  doi: 10.1145/988672.988751
– ident: 10.1016/j.ins.2012.06.013_b0220
– ident: 10.1016/j.ins.2012.06.013_b0225
  doi: 10.1145/1376616.1376702
– ident: 10.1016/j.ins.2012.06.013_b0125
  doi: 10.1145/121133.121138
– ident: 10.1016/j.ins.2012.06.013_b0035
  doi: 10.1109/ICDE.2002.994756
– ident: 10.1016/j.ins.2012.06.013_b0010
  doi: 10.1145/564691.564782
– ident: 10.1016/j.ins.2012.06.013_b0135
  doi: 10.1145/872760.872762
SSID ssj0004766
Score 2.0800753
Snippet Nowadays, personal information is being distributed into more and more heterogeneous sources, which presents a huge obstacle to management and retrieval of...
SourceID crossref
elsevier
SourceType Enrichment Source
Index Database
Publisher
StartPage 31
SubjectTerms Dataspace
Graph data model
Personal information management
Query processing and optimization
Semi-structured query
Title 3SEPIAS: A Semi-Structured Search Engine for Personal Information in dAtaspace System
URI https://dx.doi.org/10.1016/j.ins.2012.06.013
Volume 218
WOSCitedRecordID wos000311194900003&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVESC
  databaseName: Elsevier SD Freedom Collection Journals 2021
  customDbUrl:
  eissn: 1872-6291
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0004766
  issn: 0020-0255
  databaseCode: AIEXJ
  dateStart: 19950101
  isFulltext: true
  titleUrlDefault: https://www.sciencedirect.com
  providerName: Elsevier
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV3Nb9MwFLfQxgEO0xggNjbkA-IAMmqcpLa5RaiIITFV6iZVXCLbsbdMkE1bh_rn8_yVZoUhduASVVbspv29vA_7995D6HU2koUec0nAmlNSNIoSxYUmRnFreZ5b0Mu-2QQ7OuLzuZhG_vy1byfAuo4vl-Lyv0INYwC2S529B9z9ojAAnwF0uALscP0n4PPZZHpYzULK-cz8aMnM14i9cUzzwC6OVQg9xXAavfF3MTEpkR-baiFB27hCnb7Y89CLHd4aTWjvmn87iyTfr8koOrpPe-OHTHeqz9rV_qs3ALJbgoyeDvcfXC-Ifv8h5QOMiItMhjqVRqUatGLU88G-hjqzv2nusIlwDuGGK6LudmjH70chTfV2lew169VzChNd7byGJWq3RO0oe66j8SZlpQCVt1kdTuZfVmmzLBxlpx-QDr09_W_tOf7stgxckeNttBVjCFwF7J-gB6bbQY8HlSV30EHMR8Fv8AAsHDX5U3QSpeQDrvCajOAgIzish2EyTjJya622w72M4CAjz9DJp8nxx88kdtggmgq2IOCdm0wpyhrwtKUYF1LZwjo9rjJry5xJoXMIAJjkQmZMl0JnZSNsya3WY6ny52iju-jMC4SVO2K2rks8NQU18NL7ULehxoos53oXjdIfWOtYft51Qfle3wncLnrbT7kMtVf-dnORUKmj5AensAYJu3va3n2-4yV6tHoD9tEGIGMO0EP9c9FeX72K4vULeUKNUg
linkProvider Elsevier
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=3SEPIAS%3A+A+Semi-Structured+Search+Engine+for+Personal+Information+in+dAtaspace+System&rft.jtitle=Information+sciences&rft.au=Zhong%2C+Ming&rft.au=Liu%2C+Mengchi&rft.au=He%2C+Yanxiang&rft.date=2013-01-01&rft.issn=0020-0255&rft.volume=218&rft.spage=31&rft.epage=50&rft_id=info:doi/10.1016%2Fj.ins.2012.06.013&rft.externalDBID=n%2Fa&rft.externalDocID=10_1016_j_ins_2012_06_013
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0020-0255&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0020-0255&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0020-0255&client=summon