Applying genetic algorithms to query optimization in document retrieval

This paper proposes a novel approach to automatically retrieve keywords and then uses genetic algorithms to adapt the keyword weights. One of the contributions of the paper is to combine the Bigram (Chen, A., He, J., Xu, L., Gey, F. C., & Meggs, J. 1997. Chinese text retrieval without using a di...

Full description

Saved in:
Bibliographic Details
Published in:Information processing & management Vol. 36; no. 5; pp. 737 - 759
Main Authors: Horng, Jorng-Tzong, Yeh, Ching-Chang
Format: Journal Article
Language:English
Published: Oxford Elsevier Ltd 01.09.2000
Elsevier Science
Elsevier Science Ltd
Subjects:
ISSN:0306-4573, 1873-5371
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Abstract This paper proposes a novel approach to automatically retrieve keywords and then uses genetic algorithms to adapt the keyword weights. One of the contributions of the paper is to combine the Bigram (Chen, A., He, J., Xu, L., Gey, F. C., & Meggs, J. 1997. Chinese text retrieval without using a dictionary, ACM SIGIR’97, Philadelphia, PA, USA, pp. 42–49; Yang, Y.-Y., Chang, J.-S., & Chen, K.-J. 1993), Document automatic classification and ranking, Master thesis, Department of Computer Science, National Tsing Hua University) model and PAT-tree structure (Chien, L.-F., Huang, T.-I., & Chien, M.-C. 1997 Pat-tree-based keyword extraction for Chinese information retrieval, ACM SIGIR’97, Philadelphia, PA, US, pp. 50–59) to retrieve keywords. The approach extracts bigrams from documents and uses the bigrams to construct a PAT-tree to retrieve keywords. The proposed approach can retrieve any type of keywords such as technical keywords and a person’s name. Effectiveness of the proposed approach is demonstrated by comparing how effective are the keywords found by both this approach and the PAT-tree based approach. This comparison reveals that our keyword retrieval approach is as accurate as the PAT-tree based approach, yet our approach is faster and uses less memory. The study then applies genetic algorithms to tune the weight of retrieved keywords. Moreover, several documents obtained from web sites are tested and experimental results are compared with those of other approaches, indicating that the proposed approach is highly promising for applications.
AbstractList Proposes a novel approach to automatically retrieve keywords and then uses genetic algorithms to adapt the keyword weights. Discusses Chinese text retrieval, term frequency rating formulas, vector space models, bigrams, the PAT-tree structure for information retrieval, query vectors, and relevance feedback. (Author/LRW)
This paper proposes a novel approach to automatically retrieve keywords and then uses genetic algorithms to adapt the keyword weights. One of the contributions of the paper is to combine the Bigram (Chen, A., He, J., Xu, L., Gey, F. C., & Meggs, J. 1997. Chinese text retrieval without using a dictionary, ACM SIGIR’97, Philadelphia, PA, USA, pp. 42–49; Yang, Y.-Y., Chang, J.-S., & Chen, K.-J. 1993), Document automatic classification and ranking, Master thesis, Department of Computer Science, National Tsing Hua University) model and PAT-tree structure (Chien, L.-F., Huang, T.-I., & Chien, M.-C. 1997 Pat-tree-based keyword extraction for Chinese information retrieval, ACM SIGIR’97, Philadelphia, PA, US, pp. 50–59) to retrieve keywords. The approach extracts bigrams from documents and uses the bigrams to construct a PAT-tree to retrieve keywords. The proposed approach can retrieve any type of keywords such as technical keywords and a person’s name. Effectiveness of the proposed approach is demonstrated by comparing how effective are the keywords found by both this approach and the PAT-tree based approach. This comparison reveals that our keyword retrieval approach is as accurate as the PAT-tree based approach, yet our approach is faster and uses less memory. The study then applies genetic algorithms to tune the weight of retrieved keywords. Moreover, several documents obtained from web sites are tested and experimental results are compared with those of other approaches, indicating that the proposed approach is highly promising for applications.
This paper proposes a novel approach to automatically retrieve keywords and then uses genetic algorithms to adapt the keyword weights.
Proposes a novel approach to automatically retrieve keywords and uses genetic algorithms to adapt the keyword weights. Combines the bigram model and PAT-tree structure to retrieve keywords. The approach extracts bigrams from documents and uses the bigrams to construct a PAT-tree to retrieve keywords. The proposed approach can retrieve any type of keywords such as technical keywords and a person's name. Effectiveness of the proposed approach is demonstrated by comparing how effective are the keywords found by both this approach and the PAT-tree based approach, but this approach is faster and uses less memory. Applies genetic algorithms to tune the weight of retrieved keywords. Several documents obtained from web sites are tested and experimental results are compared with those of other approaches, indicating that the proposed approach is promising for applications. (Original abstract - amended)
Author Yeh, Ching-Chang
Horng, Jorng-Tzong
Author_xml – sequence: 1
  givenname: Jorng-Tzong
  surname: Horng
  fullname: Horng, Jorng-Tzong
  email: horng@db.csie.ncu.edu.tw
– sequence: 2
  givenname: Ching-Chang
  surname: Yeh
  fullname: Yeh, Ching-Chang
BackLink http://eric.ed.gov/ERICWebPortal/detail?accno=EJ606818$$DView record in ERIC
http://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&idt=1459573$$DView record in Pascal Francis
BookMark eNqFkc1rFTEUxYNU8LX6HygMImIXo8nkawYXUkpblYILu-gu5GXuPG-ZScYkr_D868374C26aTZZ3N-593DOKTnxwQMh7xj9zChTX35TTlUtpOafKD2n5bX1_QuyYK3mteSanZDFEXlFTlN6KIyQrFmQm4t5HjfoV9UKPGR0lR1XIWL-M6Uqh-rvGuKmCnPGCf_ZjMFX6Ks-uPUEPlcRckR4tONr8nKwY4I3h_-M3F1f3V1-r29_3fy4vLitnWAy170d2mXLVQ-97kGIVtrlMIDulFi6peUdVZR1jZSSKyGFstAMurHt0FHaOMvPyMf92jmGYi1lM2FyMI7WQ1gnI7XQraZtAd8_AR_COvpizbBOdJR3WhTowwGyydlxiNY7TGaOONm4MUzIriRWsLd7DCK64_Tqp6KqZdtTX_djF0NKEQbjMO-iytHiaBg125rMriaz7cBQanY1mfuilk_Ux_PP6L4dTJW0HxGiSQ7BO-gxgsumD_jMhv9-dqtR
CODEN IPMADK
CitedBy_id crossref_primary_10_1016_j_ijar_2003_07_010
crossref_primary_10_1023_A_1023293820057
crossref_primary_10_1002_asi_10179
crossref_primary_10_1016_j_amc_2006_07_044
crossref_primary_10_1016_S0305_0548_03_00194_1
crossref_primary_10_1007_s10791_006_1682_6
crossref_primary_10_1016_j_eswa_2008_06_024
crossref_primary_10_1016_S0306_4573_01_00061_9
crossref_primary_10_1186_s13634_016_0324_4
crossref_primary_10_1007_s10710_006_7008_z
crossref_primary_10_1016_j_ipm_2008_09_002
crossref_primary_10_1016_S0306_4573_02_00044_4
crossref_primary_10_1016_S0306_4573_02_00048_1
crossref_primary_10_1002_asi_10119
crossref_primary_10_1109_TEVC_2004_842093
crossref_primary_10_1108_02635570810847608
crossref_primary_10_1109_TCSVT_2021_3070129
crossref_primary_10_1007_s10844_012_0197_4
crossref_primary_10_1016_j_artint_2012_06_006
crossref_primary_10_1016_j_neucom_2013_07_045
crossref_primary_10_3233_IDT_220007
crossref_primary_10_1016_j_datak_2009_10_010
crossref_primary_10_1016_j_ipm_2005_02_006
crossref_primary_10_1016_j_is_2004_04_002
crossref_primary_10_1177_0165551514533771
crossref_primary_10_1007_s40747_021_00450_6
crossref_primary_10_1016_j_ipm_2003_10_003
crossref_primary_10_1108_00220411011052939
crossref_primary_10_1007_s12652_019_01247_9
crossref_primary_10_1177_01655515211018401
crossref_primary_10_1023_A_1011262119636
crossref_primary_10_1109_TEVC_2005_863130
crossref_primary_10_1007_s10462_005_9001_y
crossref_primary_10_1016_j_asoc_2016_04_042
crossref_primary_10_1007_s11042_020_09172_2
crossref_primary_10_1007_s00530_011_0231_3
crossref_primary_10_1002_asi_20009
crossref_primary_10_1504_IJCAT_2009_026605
crossref_primary_10_1016_j_lcats_2012_05_001
Cites_doi 10.1145/258525.258532
10.1145/183422.183425
10.1145/183422.183424
10.1108/eb026526
10.1016/0020-0190(91)90032-D
10.1145/258525.258534
10.1016/0306-4573(88)90021-0
10.1016/0305-0548(93)E0020-T
10.1145/243199.243277
10.1145/258525.258531
10.1145/243199.243270
10.1145/63039.63044
ContentType Journal Article
Copyright 2000 Elsevier Science Ltd
2000 INIST-CNRS
Copyright Pergamon Press Inc. Sep 2000
Copyright_xml – notice: 2000 Elsevier Science Ltd
– notice: 2000 INIST-CNRS
– notice: Copyright Pergamon Press Inc. Sep 2000
DBID AAYXX
CITATION
7SW
BJH
BNH
BNI
BNJ
BNO
ERI
PET
REK
WWN
IQODW
E3H
F2A
DOI 10.1016/S0306-4573(00)00008-X
DatabaseName CrossRef
ERIC
ERIC (Ovid)
ERIC
ERIC
ERIC (Legacy Platform)
ERIC( SilverPlatter )
ERIC
ERIC PlusText (Legacy Platform)
Education Resources Information Center (ERIC)
ERIC
Pascal-Francis
Library & Information Sciences Abstracts (LISA)
Library & Information Science Abstracts (LISA)
DatabaseTitle CrossRef
ERIC
Library and Information Science Abstracts (LISA)
DatabaseTitleList ERIC

Library and Information Science Abstracts (LISA)
Library and Information Science Abstracts (LISA)
DeliveryMethod fulltext_linktorsrc
Discipline Library & Information Science
EISSN 1873-5371
ERIC EJ606818
EndPage 759
ExternalDocumentID 69078992
1459573
EJ606818
10_1016_S0306_4573_00_00008_X
S030645730000008X
Genre Feature
GroupedDBID --K
--M
-~X
.DC
.~1
0B8
0R~
1B1
1RT
1~.
1~5
29I
4.4
41~
457
4G.
5GY
5VS
7-5
71M
77K
8P~
9JN
9JO
AABNK
AACTN
AAEDT
AAEDW
AAFJI
AAIAV
AAIKJ
AAKOC
AALRI
AAOAW
AAQFI
AAQXK
AAXUO
AAYFN
AAYOK
ABBOA
ABFNM
ABFRF
ABJNI
ABMAC
ABMMH
ABPPZ
ABXDB
ABYKQ
ACDAQ
ACGFS
ACHQT
ACNNM
ACRLP
ACZNC
ADBBV
ADEZE
ADJOM
ADMUD
AEBSH
AEFWE
AEKER
AENEX
AFKWA
AFTJW
AGHFR
AGUBO
AGYEJ
AHHHB
AHZHX
AIALX
AIEXJ
AIKHN
AITUG
AJBFU
AJOXV
AKYCK
ALMA_UNASSIGNED_HOLDINGS
AMFUW
AMRAJ
AOMHK
AOUOD
ASPBG
AVARZ
AVWKF
AXJTR
AZFZN
BKOJK
BLXMC
CS3
DU5
EBS
EFJIC
EFLBG
EJD
EO8
EO9
EP2
EP3
FDB
FEDTE
FGOYB
FIRID
FNPLU
FYGXN
G-2
G-Q
GBLVA
GBOLZ
HLZ
HMY
HVGLF
HZ~
H~9
IHE
J1W
KOM
LG9
LPU
LY1
M3Y
M41
MO0
MS~
MVM
N9A
O-L
O9-
OAUVE
OHT
OZT
P-8
P-9
P2P
PC.
PQQKQ
PRBVW
Q38
R2-
RIG
ROL
RPZ
SBC
SDF
SDG
SDP
SDS
SES
SEW
SPC
SPCBC
SSB
SSO
SSS
SSV
SSZ
T5K
TN5
U5U
UHB
UHS
UNMZH
WUQ
XFK
ZMT
~G-
77I
9DU
AATTM
AAXKI
AAYWO
AAYXX
ABWVN
ACLOT
ACRPL
ACVFH
ADCNI
ADMHG
ADNMO
AEIPS
AEUPX
AFJKZ
AFPUW
AGQPQ
AIGII
AIIUN
AKBMS
AKRWK
AKYEP
ANKPU
APXCP
CITATION
EFKBS
~HD
7SW
BJH
BNH
BNI
BNJ
BNO
ERI
PET
REK
WWN
08R
ABPIF
IQODW
E3H
F2A
ID FETCH-LOGICAL-c415t-daf8b836ded7de4485abffe7964bcba39060192555364546ae2f72a8f9002ca3
ISICitedReferencesCount 70
ISICitedReferencesURI http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000087256800003&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
ISSN 0306-4573
IngestDate Sun Sep 28 04:29:10 EDT 2025
Wed Nov 19 00:35:30 EST 2025
Sun Oct 22 16:06:59 EDT 2023
Tue Dec 02 16:51:09 EST 2025
Tue Nov 18 21:41:07 EST 2025
Sat Nov 29 01:48:31 EST 2025
Fri Feb 23 02:20:09 EST 2024
IsDoiOpenAccess false
IsOpenAccess false
IsPeerReviewed true
IsScholarly true
Issue 5
Keywords Keyword
Information retrieval
PAT-tree
Performance evaluation
Automatic classification
Weighting
Genetic algorithm
Automatic indexing
Feedback regulation
Cluster
Method
Comparative study
Optimization
Language English
License https://www.elsevier.com/tdm/userlicense/1.0
CC BY 4.0
LinkModel OpenURL
MergedId FETCHMERGED-LOGICAL-c415t-daf8b836ded7de4485abffe7964bcba39060192555364546ae2f72a8f9002ca3
Notes SourceType-Scholarly Journals-1
ObjectType-Feature-1
content type line 14
ObjectType-Article-1
ObjectType-Feature-2
content type line 23
PQID 194903974
PQPubID 46166
PageCount 23
ParticipantIDs proquest_miscellaneous_57478708
proquest_journals_194903974
pascalfrancis_primary_1459573
eric_primary_EJ606818
crossref_citationtrail_10_1016_S0306_4573_00_00008_X
crossref_primary_10_1016_S0306_4573_00_00008_X
elsevier_sciencedirect_doi_10_1016_S0306_4573_00_00008_X
PublicationCentury 2000
PublicationDate 2000-09-01
PublicationDateYYYYMMDD 2000-09-01
PublicationDate_xml – month: 09
  year: 2000
  text: 2000-09-01
  day: 01
PublicationDecade 2000
PublicationPlace Oxford
PublicationPlace_xml – name: Oxford
PublicationTitle Information processing & management
PublicationYear 2000
Publisher Elsevier Ltd
Elsevier Science
Elsevier Science Ltd
Publisher_xml – name: Elsevier Ltd
– name: Elsevier Science
– name: Elsevier Science Ltd
References Yang, Chute (BIB34) 1994; 12
Fung, P., & Wu, D. (1994). Statistical augmentation of a Chinese machine-readable dictionary.
ACM SIGIR’96, Zurich, Switzerland, pp. 298–306
Buckley, Salton (BIB4) 1995
Taipei, Taiwan
Chang, C.-H., & Hsu, C.-C. (1999).
Lochbaum, Streeter (BIB24) 1989; 6
Harman (BIB15) 1992
Buckley, Salton, Allan, Singhal (BIB3) 1994
ACM SIGIR’97, Philadelphia, PA, USA, pp. 42–49
Ph.D. thesis, Department of CSIE, National Taiwan University
Lundquist, Grossman, Frieder (BIB25) 1997
Nie, J.-Y., Brisebois, M., & Ren, X. (1996).
Rocchio (BIB29) 1971
Yang, Y.-Y., Chang, J.-S., & Chen, K.-J. (1993).
Chen, A., He, J., Xu, L., Gey, F. C., & Meggs, J. (1997).
Yang, Korfhage (BIB35) 1993
Zhai, Tong, Milic-Frayling, Evans (BIB37) 1996
Baeza-Yates (BIB2) 1992
ACM SIGIR’96, Zurich, Switzerland, pp. 225–233
Jones (BIB18) 1972; 28
Manber, Baeza-Yates (BIB26) 1991; 37
ACM SIGIR’97, Philadelphia, PA, USA pp. 50–59
Buckley, Singhal, Mitra (BIB5) 1995
Lewis, D. D., Schapire, R. E., Callan, J. P., & Papka, R. (1996).
Gonnet, Baeza-Yates, Snider (BIB13) 1992
Chien, L.-F., Huang, T.-I., & Chien, M.-C. (1997).
Liu (BIB23) 1987; 8
Allan (BIB1) 1996
pp. 33–56
Tate, Smith (BIB33) 1995; 22
Salton, MacGill (BIB31) 1983
ACM SIGIR’97, Philadelphia, PA, USA, pp. 34–41
Kwok, K. L. (1997).
Chang, C.-H., & Hsu, C.-C. (1997). Information searching and exploring agent applying clustering and genetic algorithm.
He, Xu, Chen, Meggs, Gey (BIB17) 1996
Goldberg (BIB12) 1989
Salton, Buckley (BIB32) 1988; 24
Liddy, Paik, Yu (BIB22) 1994; 12
Master thesis, Department of Computer Science, National Tsing Hua University
Gordon (BIB14) 1988; 31
Huang (BIB16) 1997; 59
Allan (10.1016/S0306-4573(00)00008-X_BIB1) 1996
Buckley (10.1016/S0306-4573(00)00008-X_BIB5) 1995
Rocchio (10.1016/S0306-4573(00)00008-X_BIB29) 1971
Salton (10.1016/S0306-4573(00)00008-X_BIB32) 1988; 24
Gordon (10.1016/S0306-4573(00)00008-X_BIB14) 1988; 31
Buckley (10.1016/S0306-4573(00)00008-X_BIB4) 1995
He (10.1016/S0306-4573(00)00008-X_BIB17) 1996
Liu (10.1016/S0306-4573(00)00008-X_BIB23) 1987; 8
10.1016/S0306-4573(00)00008-X_BIB27
Huang (10.1016/S0306-4573(00)00008-X_BIB16) 1997; 59
10.1016/S0306-4573(00)00008-X_BIB21
Harman (10.1016/S0306-4573(00)00008-X_BIB15) 1992
Salton (10.1016/S0306-4573(00)00008-X_BIB31) 1983
Manber (10.1016/S0306-4573(00)00008-X_BIB26) 1991; 37
Lundquist (10.1016/S0306-4573(00)00008-X_BIB25) 1997
Baeza-Yates (10.1016/S0306-4573(00)00008-X_BIB2) 1992
Lochbaum (10.1016/S0306-4573(00)00008-X_BIB24) 1989; 6
Liddy (10.1016/S0306-4573(00)00008-X_BIB22) 1994; 12
10.1016/S0306-4573(00)00008-X_BIB19
Buckley (10.1016/S0306-4573(00)00008-X_BIB3) 1994
Yang (10.1016/S0306-4573(00)00008-X_BIB34) 1994; 12
Gonnet (10.1016/S0306-4573(00)00008-X_BIB13) 1992
Yang (10.1016/S0306-4573(00)00008-X_BIB35) 1993
Goldberg (10.1016/S0306-4573(00)00008-X_BIB12) 1989
10.1016/S0306-4573(00)00008-X_BIB8
10.1016/S0306-4573(00)00008-X_BIB11
Tate (10.1016/S0306-4573(00)00008-X_BIB33) 1995; 22
10.1016/S0306-4573(00)00008-X_BIB36
Zhai (10.1016/S0306-4573(00)00008-X_BIB37) 1996
10.1016/S0306-4573(00)00008-X_BIB7
10.1016/S0306-4573(00)00008-X_BIB10
Jones (10.1016/S0306-4573(00)00008-X_BIB18) 1972; 28
10.1016/S0306-4573(00)00008-X_BIB6
References_xml – start-page: 465
  year: 1992
  end-page: 476
  ident: BIB2
  article-title: Text retrieval: theory and practice
  publication-title: International Federation for Information Processing Congress, Vol. 1, Madrid, Spain
– reference: . ACM SIGIR’96, Zurich, Switzerland, pp. 225–233
– reference: , Taipei, Taiwan
– start-page: 16
  year: 1997
  end-page: 23
  ident: BIB25
  article-title: Improving relevance feedback in the vector space model
  publication-title: Proceedings of the Sixth International Conference on Information and Knowledge Management, Las Vegas, Nevada
– volume: 8
  start-page: 64
  year: 1987
  end-page: 70
  ident: BIB23
  article-title: New advances in computers and natural language processing in China
  publication-title: Information Science
– start-page: 351
  year: 1995
  end-page: 357
  ident: BIB4
  article-title: Optimization of relevance feedback weights
  publication-title: Proceedings of the Eighteenth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Seattle, Washington, USA
– reference: . Master thesis, Department of Computer Science, National Tsing Hua University
– start-page: 69
  year: 1994
  end-page: 80
  ident: BIB3
  article-title: Automatic query expansion using SMART: TREC 3
  publication-title: Proceedings of the Third Text REtrieval Conference, Gaithersburg, Maryland
– volume: 6
  start-page: 25
  year: 1989
  end-page: 33
  ident: BIB24
  article-title: Comparing and combining the effectiveness of latent semantic indexing and the ordinary vector space model for information retrieval
  publication-title: Journal of Information Science
– reference: . ACM SIGIR’96, Zurich, Switzerland, pp. 298–306
– reference: Lewis, D. D., Schapire, R. E., Callan, J. P., & Papka, R. (1996).
– start-page: 66
  year: 1992
  end-page: 82
  ident: BIB13
  article-title: New indices for text PAT trees and PAT arrays
  publication-title: Information retrieval data structures and algorithms
– volume: 12
  start-page: 278
  year: 1994
  end-page: 295
  ident: BIB22
  article-title: Text categorization for multiple users based on semantic features from a machine-readable dictionary
  publication-title: ACM Transaction on Information Systems
– start-page: 1
  year: 1992
  end-page: 10
  ident: BIB15
  article-title: Relevance feedback revisited
  publication-title: Proceedings of the Fifteenth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Copenhagen, Denmark
– reference: Chen, A., He, J., Xu, L., Gey, F. C., & Meggs, J. (1997).
– start-page: 25
  year: 1995
  end-page: 48
  ident: BIB5
  article-title: New retrieval approaches using SMART: TREC 4
  publication-title: Proceedings of the Fourth Text REtrieval Conference, Gaithersburg, Maryland
– reference: . ACM SIGIR’97, Philadelphia, PA, USA, pp. 42–49
– volume: 59
  start-page: 109
  year: 1997
  end-page: 126
  ident: BIB16
  article-title: The development of indexing system evaluation — theory and practice
  publication-title: Chinese Library Society Report
– year: 1989
  ident: BIB12
  publication-title: Genetic algorithms in search, optimization and machine learning
– volume: 31
  start-page: 1208
  year: 1988
  end-page: 1218
  ident: BIB14
  article-title: Probabilistic and genetic algorithms in document retrieval
  publication-title: Communications of the ACM
– reference: , pp. 33–56
– volume: 28
  start-page: 11
  year: 1972
  end-page: 20
  ident: BIB18
  article-title: A statistical interpretation of term specificity and its application in retrieval
  publication-title: J. Documentation
– start-page: 603
  year: 1993
  end-page: 611
  ident: BIB35
  article-title: Query optimization in information retrieval using genetic algorithms
  publication-title: Proceedings of the Fifth International Conference on Genetic Algorithms, Urbana, IL
– volume: 24
  start-page: 513
  year: 1988
  end-page: 523
  ident: BIB32
  article-title: Term-weighting approaches in automatic text retrieval
  publication-title: Information Processing and Management
– reference: Yang, Y.-Y., Chang, J.-S., & Chen, K.-J. (1993).
– start-page: 191
  year: 1996
  end-page: 198
  ident: BIB17
  article-title: Berkeley Chinese information retrieval at TREC-5: technical report
  publication-title: Proceeding of the Fifth Text REtrieval Conference, Gaithersburg, Maryland
– volume: 37
  start-page: 133
  year: 1991
  end-page: 136
  ident: BIB26
  article-title: An algorithm for string matching with a sequence of don’t cares
  publication-title: Information Processing Letters
– start-page: 335
  year: 1996
  end-page: 340
  ident: BIB37
  article-title: Experiments on Chinese text indexing — CLARIT TREC-5 Chinese track report
  publication-title: Proceedings of the Fifth Text Retrieval Conference, Gaithersburg, Maryland
– reference: Nie, J.-Y., Brisebois, M., & Ren, X. (1996).
– volume: 12
  start-page: 252
  year: 1994
  end-page: 277
  ident: BIB34
  article-title: An example-based mapping method for text categorization and retrieval
  publication-title: ACM Transactions on Information Systems
– start-page: 313
  year: 1971
  end-page: 323
  ident: BIB29
  article-title: Relevance feedback in information retrieval
  publication-title: The SMART retrieval system: Experiments in automatic document processing
– reference: Kwok, K. L. (1997).
– reference: . Ph.D. thesis, Department of CSIE, National Taiwan University
– reference: Chien, L.-F., Huang, T.-I., & Chien, M.-C. (1997).
– reference: . ACM SIGIR’97, Philadelphia, PA, USA, pp. 34–41
– reference: Chang, C.-H., & Hsu, C.-C. (1997). Information searching and exploring agent applying clustering and genetic algorithm.
– reference: . ACM SIGIR’97, Philadelphia, PA, USA pp. 50–59
– volume: 22
  start-page: 73
  year: 1995
  end-page: 83
  ident: BIB33
  article-title: A genetic approach to the quardatic assignment problem
  publication-title: Computers and Operations Research
– year: 1983
  ident: BIB31
  publication-title: Introduciton to modern information retrieval
– reference: Fung, P., & Wu, D. (1994). Statistical augmentation of a Chinese machine-readable dictionary.
– reference: Chang, C.-H., & Hsu, C.-C. (1999).
– start-page: 270
  year: 1996
  end-page: 278
  ident: BIB1
  article-title: Incremental relevance feedback for information filtering
  publication-title: Proceedings of the Nineteenth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Zurich, Switzerland
– ident: 10.1016/S0306-4573(00)00008-X_BIB36
– ident: 10.1016/S0306-4573(00)00008-X_BIB11
– ident: 10.1016/S0306-4573(00)00008-X_BIB8
  doi: 10.1145/258525.258532
– volume: 12
  start-page: 278
  issue: 3
  year: 1994
  ident: 10.1016/S0306-4573(00)00008-X_BIB22
  article-title: Text categorization for multiple users based on semantic features from a machine-readable dictionary
  publication-title: ACM Transaction on Information Systems
  doi: 10.1145/183422.183425
– start-page: 465
  year: 1992
  ident: 10.1016/S0306-4573(00)00008-X_BIB2
  article-title: Text retrieval: theory and practice
– start-page: 16
  year: 1997
  ident: 10.1016/S0306-4573(00)00008-X_BIB25
  article-title: Improving relevance feedback in the vector space model
– start-page: 351
  year: 1995
  ident: 10.1016/S0306-4573(00)00008-X_BIB4
  article-title: Optimization of relevance feedback weights
– volume: 12
  start-page: 252
  issue: 3
  year: 1994
  ident: 10.1016/S0306-4573(00)00008-X_BIB34
  article-title: An example-based mapping method for text categorization and retrieval
  publication-title: ACM Transactions on Information Systems
  doi: 10.1145/183422.183424
– ident: 10.1016/S0306-4573(00)00008-X_BIB7
– volume: 28
  start-page: 11
  issue: 1
  year: 1972
  ident: 10.1016/S0306-4573(00)00008-X_BIB18
  article-title: A statistical interpretation of term specificity and its application in retrieval
  publication-title: J. Documentation
  doi: 10.1108/eb026526
– start-page: 603
  year: 1993
  ident: 10.1016/S0306-4573(00)00008-X_BIB35
  article-title: Query optimization in information retrieval using genetic algorithms
– start-page: 1
  year: 1992
  ident: 10.1016/S0306-4573(00)00008-X_BIB15
  article-title: Relevance feedback revisited
– volume: 37
  start-page: 133
  year: 1991
  ident: 10.1016/S0306-4573(00)00008-X_BIB26
  article-title: An algorithm for string matching with a sequence of don’t cares
  publication-title: Information Processing Letters
  doi: 10.1016/0020-0190(91)90032-D
– year: 1989
  ident: 10.1016/S0306-4573(00)00008-X_BIB12
– volume: 8
  start-page: 64
  year: 1987
  ident: 10.1016/S0306-4573(00)00008-X_BIB23
  article-title: New advances in computers and natural language processing in China
  publication-title: Information Science
– ident: 10.1016/S0306-4573(00)00008-X_BIB10
  doi: 10.1145/258525.258534
– volume: 59
  start-page: 109
  year: 1997
  ident: 10.1016/S0306-4573(00)00008-X_BIB16
  article-title: The development of indexing system evaluation — theory and practice
  publication-title: Chinese Library Society Report
– start-page: 191
  year: 1996
  ident: 10.1016/S0306-4573(00)00008-X_BIB17
  article-title: Berkeley Chinese information retrieval at TREC-5: technical report
– start-page: 335
  year: 1996
  ident: 10.1016/S0306-4573(00)00008-X_BIB37
  article-title: Experiments on Chinese text indexing — CLARIT TREC-5 Chinese track report
– start-page: 25
  year: 1995
  ident: 10.1016/S0306-4573(00)00008-X_BIB5
  article-title: New retrieval approaches using SMART: TREC 4
– volume: 24
  start-page: 513
  issue: 5
  year: 1988
  ident: 10.1016/S0306-4573(00)00008-X_BIB32
  article-title: Term-weighting approaches in automatic text retrieval
  publication-title: Information Processing and Management
  doi: 10.1016/0306-4573(88)90021-0
– volume: 22
  start-page: 73
  issue: 1
  year: 1995
  ident: 10.1016/S0306-4573(00)00008-X_BIB33
  article-title: A genetic approach to the quardatic assignment problem
  publication-title: Computers and Operations Research
  doi: 10.1016/0305-0548(93)E0020-T
– start-page: 270
  year: 1996
  ident: 10.1016/S0306-4573(00)00008-X_BIB1
  article-title: Incremental relevance feedback for information filtering
– start-page: 66
  year: 1992
  ident: 10.1016/S0306-4573(00)00008-X_BIB13
  article-title: New indices for text PAT trees and PAT arrays
– ident: 10.1016/S0306-4573(00)00008-X_BIB21
  doi: 10.1145/243199.243277
– volume: 6
  start-page: 25
  year: 1989
  ident: 10.1016/S0306-4573(00)00008-X_BIB24
  article-title: Comparing and combining the effectiveness of latent semantic indexing and the ordinary vector space model for information retrieval
  publication-title: Journal of Information Science
– start-page: 313
  year: 1971
  ident: 10.1016/S0306-4573(00)00008-X_BIB29
  article-title: Relevance feedback in information retrieval
– ident: 10.1016/S0306-4573(00)00008-X_BIB19
  doi: 10.1145/258525.258531
– ident: 10.1016/S0306-4573(00)00008-X_BIB27
  doi: 10.1145/243199.243270
– year: 1983
  ident: 10.1016/S0306-4573(00)00008-X_BIB31
– start-page: 69
  year: 1994
  ident: 10.1016/S0306-4573(00)00008-X_BIB3
  article-title: Automatic query expansion using SMART: TREC 3
– ident: 10.1016/S0306-4573(00)00008-X_BIB6
– volume: 31
  start-page: 1208
  year: 1988
  ident: 10.1016/S0306-4573(00)00008-X_BIB14
  article-title: Probabilistic and genetic algorithms in document retrieval
  publication-title: Communications of the ACM
  doi: 10.1145/63039.63044
SSID ssj0004512
Score 1.90022
Snippet This paper proposes a novel approach to automatically retrieve keywords and then uses genetic algorithms to adapt the keyword weights. One of the contributions...
Proposes a novel approach to automatically retrieve keywords and then uses genetic algorithms to adapt the keyword weights. Discusses Chinese text retrieval,...
This paper proposes a novel approach to automatically retrieve keywords and then uses genetic algorithms to adapt the keyword weights.
Proposes a novel approach to automatically retrieve keywords and uses genetic algorithms to adapt the keyword weights. Combines the bigram model and PAT-tree...
SourceID proquest
pascalfrancis
eric
crossref
elsevier
SourceType Aggregation Database
Index Database
Enrichment Source
Publisher
StartPage 737
SubjectTerms Algorithms
Bigram Strategy
Chinese
Content analysis
Exact sciences and technology
Genetic algorithms
Genetics
Indexing. Classification. Abstracting. Syntheses
Information and communication sciences
Information and document structure and analysis
Information processing and retrieval
Information Retrieval
Information science. Documentation
Keyword
Keywords
Mathematical Formulas
Online information retrieval
Optimization
PAT-tree
Query Processing
Relevance (Information Retrieval)
Retrieval
Sciences and techniques of general use
Searching
Studies
Vector Spaces
Weighted Term Searching
Weighting
Title Applying genetic algorithms to query optimization in document retrieval
URI https://dx.doi.org/10.1016/S0306-4573(00)00008-X
http://eric.ed.gov/ERICWebPortal/detail?accno=EJ606818
https://www.proquest.com/docview/194903974
https://www.proquest.com/docview/57478708
Volume 36
WOSCitedRecordID wos000087256800003&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVESC
  databaseName: Elsevier SD Freedom Collection Journals 2021
  customDbUrl:
  eissn: 1873-5371
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0004512
  issn: 0306-4573
  databaseCode: AIEXJ
  dateStart: 19950101
  isFulltext: true
  titleUrlDefault: https://www.sciencedirect.com
  providerName: Elsevier
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV1bb9MwFLbKxgMSQlyGKKPgB4ZAyCNN7CR-nEbLmKaCRJDCk5WkDivqktKbJn4Iv5fjW9JqgsEDL1GUxBf5fDk-PleEnsMexGQhKclhAyA0LymJWZaRQPqFlwc5HUeFLjYRjUZxmvKPnc5PFwuznkZVFV9e8tl_JTU8A2Kr0Nl_IHfTKTyAeyA6XIHscP0rwiu5UscuwRdSp2Odfq3nk-X5hU7mAPuAsqoDp7iwIZjaIbYuVtotYK4rbK3tIN-cn3sT4_h6ZiILjIYhtN6vm-4zJ_XcefnCDUl-1HZz1E4E0I4cOyX1F3m-pXXwGrcqqwpz4TCOA7U-SDoUywsJZaZGyaE0nDWOAsICU2_FsV6T-8RCjG3w0chkgrnC342q4VMzAEjhqpg116IvSdtNzRnyRx_E8PPZmUgGabL9Vu_hSjcAJ07_IBjOvhNVi0zZ7A-CtwYXN9CuHzEODH_36P0gPd3IQ9-39ikzjzY27E07uZee98pO7HdSj_Wxvz3LFvBLlqaWyhWxQMs6yV10xx5S8JEB1z3UkdV91LMhLvgF3sADtqR5gN454GELPNwCDy9rrIGHN4GHJxV2wMMN8PZQMhwkxyfElukgBUh_SzLOyjiPg3Asx9FYwnGfZXlZShXjnBd5FnCV8ofD0ZUpkzcNM-mXkZ_FJVDNL7LgIdqp6ko-QjiWgWQU-vI9SmVY8H5ZKoGfUxrKLMq7iLpVFIVNYa8qqUxF66sIiy_U4gtP5b1VNVbTLjpsms1MDpfrGsSORMIKokbAFIDD65ruKZI24wxOQy8EobiLels0bidCGYcuumjf0VxYZrIQfU65BwcG2kXPmrfA_pVNL6tkvVoIpupfRF78-I_t99Gt9id-gnaW85XsoZvFejlZzJ9acP8Cms3KmQ
linkProvider Elsevier
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Applying+genetic+algorithms+to+query+optimization+in+document+retrieval&rft.jtitle=Information+processing+%26+management&rft.au=Horng%2C+Jorng-Tzong&rft.au=Ching-Chang%2C+Yeh&rft.date=2000-09-01&rft.pub=Elsevier+Science+Ltd&rft.issn=0306-4573&rft.eissn=1873-5371&rft.volume=36&rft.issue=5&rft.spage=737&rft_id=info:doi/10.1016%2FS0306-4573%2800%2900008-X&rft.externalDBID=NO_FULL_TEXT&rft.externalDocID=69078992
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0306-4573&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0306-4573&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0306-4573&client=summon