Applying genetic algorithms to query optimization in document retrieval
This paper proposes a novel approach to automatically retrieve keywords and then uses genetic algorithms to adapt the keyword weights. One of the contributions of the paper is to combine the Bigram (Chen, A., He, J., Xu, L., Gey, F. C., & Meggs, J. 1997. Chinese text retrieval without using a di...
Saved in:
| Published in: | Information processing & management Vol. 36; no. 5; pp. 737 - 759 |
|---|---|
| Main Authors: | , |
| Format: | Journal Article |
| Language: | English |
| Published: |
Oxford
Elsevier Ltd
01.09.2000
Elsevier Science Elsevier Science Ltd |
| Subjects: | |
| ISSN: | 0306-4573, 1873-5371 |
| Online Access: | Get full text |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| Abstract | This paper proposes a novel approach to automatically retrieve keywords and then uses genetic algorithms to adapt the keyword weights. One of the contributions of the paper is to combine the Bigram (Chen, A., He, J., Xu, L., Gey, F. C., & Meggs, J. 1997.
Chinese text retrieval without using a dictionary, ACM SIGIR’97, Philadelphia, PA, USA, pp. 42–49; Yang, Y.-Y., Chang, J.-S., & Chen, K.-J. 1993),
Document automatic classification and ranking, Master thesis, Department of Computer Science, National Tsing Hua University) model and PAT-tree structure (Chien, L.-F., Huang, T.-I., & Chien, M.-C. 1997
Pat-tree-based keyword extraction for Chinese information retrieval, ACM SIGIR’97, Philadelphia, PA, US, pp. 50–59) to retrieve keywords. The approach extracts bigrams from documents and uses the bigrams to construct a PAT-tree to retrieve keywords. The proposed approach can retrieve any type of keywords such as technical keywords and a person’s name. Effectiveness of the proposed approach is demonstrated by comparing how effective are the keywords found by both this approach and the PAT-tree based approach. This comparison reveals that our keyword retrieval approach is as accurate as the PAT-tree based approach, yet our approach is faster and uses less memory. The study then applies genetic algorithms to tune the weight of retrieved keywords. Moreover, several documents obtained from web sites are tested and experimental results are compared with those of other approaches, indicating that the proposed approach is highly promising for applications. |
|---|---|
| AbstractList | Proposes a novel approach to automatically retrieve keywords and then uses genetic algorithms to adapt the keyword weights. Discusses Chinese text retrieval, term frequency rating formulas, vector space models, bigrams, the PAT-tree structure for information retrieval, query vectors, and relevance feedback. (Author/LRW) This paper proposes a novel approach to automatically retrieve keywords and then uses genetic algorithms to adapt the keyword weights. One of the contributions of the paper is to combine the Bigram (Chen, A., He, J., Xu, L., Gey, F. C., & Meggs, J. 1997. Chinese text retrieval without using a dictionary, ACM SIGIR’97, Philadelphia, PA, USA, pp. 42–49; Yang, Y.-Y., Chang, J.-S., & Chen, K.-J. 1993), Document automatic classification and ranking, Master thesis, Department of Computer Science, National Tsing Hua University) model and PAT-tree structure (Chien, L.-F., Huang, T.-I., & Chien, M.-C. 1997 Pat-tree-based keyword extraction for Chinese information retrieval, ACM SIGIR’97, Philadelphia, PA, US, pp. 50–59) to retrieve keywords. The approach extracts bigrams from documents and uses the bigrams to construct a PAT-tree to retrieve keywords. The proposed approach can retrieve any type of keywords such as technical keywords and a person’s name. Effectiveness of the proposed approach is demonstrated by comparing how effective are the keywords found by both this approach and the PAT-tree based approach. This comparison reveals that our keyword retrieval approach is as accurate as the PAT-tree based approach, yet our approach is faster and uses less memory. The study then applies genetic algorithms to tune the weight of retrieved keywords. Moreover, several documents obtained from web sites are tested and experimental results are compared with those of other approaches, indicating that the proposed approach is highly promising for applications. This paper proposes a novel approach to automatically retrieve keywords and then uses genetic algorithms to adapt the keyword weights. Proposes a novel approach to automatically retrieve keywords and uses genetic algorithms to adapt the keyword weights. Combines the bigram model and PAT-tree structure to retrieve keywords. The approach extracts bigrams from documents and uses the bigrams to construct a PAT-tree to retrieve keywords. The proposed approach can retrieve any type of keywords such as technical keywords and a person's name. Effectiveness of the proposed approach is demonstrated by comparing how effective are the keywords found by both this approach and the PAT-tree based approach, but this approach is faster and uses less memory. Applies genetic algorithms to tune the weight of retrieved keywords. Several documents obtained from web sites are tested and experimental results are compared with those of other approaches, indicating that the proposed approach is promising for applications. (Original abstract - amended) |
| Author | Yeh, Ching-Chang Horng, Jorng-Tzong |
| Author_xml | – sequence: 1 givenname: Jorng-Tzong surname: Horng fullname: Horng, Jorng-Tzong email: horng@db.csie.ncu.edu.tw – sequence: 2 givenname: Ching-Chang surname: Yeh fullname: Yeh, Ching-Chang |
| BackLink | http://eric.ed.gov/ERICWebPortal/detail?accno=EJ606818$$DView record in ERIC http://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&idt=1459573$$DView record in Pascal Francis |
| BookMark | eNqFkc1rFTEUxYNU8LX6HygMImIXo8nkawYXUkpblYILu-gu5GXuPG-ZScYkr_D868374C26aTZZ3N-593DOKTnxwQMh7xj9zChTX35TTlUtpOafKD2n5bX1_QuyYK3mteSanZDFEXlFTlN6KIyQrFmQm4t5HjfoV9UKPGR0lR1XIWL-M6Uqh-rvGuKmCnPGCf_ZjMFX6Ks-uPUEPlcRckR4tONr8nKwY4I3h_-M3F1f3V1-r29_3fy4vLitnWAy170d2mXLVQ-97kGIVtrlMIDulFi6peUdVZR1jZSSKyGFstAMurHt0FHaOMvPyMf92jmGYi1lM2FyMI7WQ1gnI7XQraZtAd8_AR_COvpizbBOdJR3WhTowwGyydlxiNY7TGaOONm4MUzIriRWsLd7DCK64_Tqp6KqZdtTX_djF0NKEQbjMO-iytHiaBg125rMriaz7cBQanY1mfuilk_Ux_PP6L4dTJW0HxGiSQ7BO-gxgsumD_jMhv9-dqtR |
| CODEN | IPMADK |
| CitedBy_id | crossref_primary_10_1016_j_ijar_2003_07_010 crossref_primary_10_1023_A_1023293820057 crossref_primary_10_1002_asi_10179 crossref_primary_10_1016_j_amc_2006_07_044 crossref_primary_10_1016_S0305_0548_03_00194_1 crossref_primary_10_1007_s10791_006_1682_6 crossref_primary_10_1016_j_eswa_2008_06_024 crossref_primary_10_1016_S0306_4573_01_00061_9 crossref_primary_10_1186_s13634_016_0324_4 crossref_primary_10_1007_s10710_006_7008_z crossref_primary_10_1016_j_ipm_2008_09_002 crossref_primary_10_1016_S0306_4573_02_00044_4 crossref_primary_10_1016_S0306_4573_02_00048_1 crossref_primary_10_1002_asi_10119 crossref_primary_10_1109_TEVC_2004_842093 crossref_primary_10_1108_02635570810847608 crossref_primary_10_1109_TCSVT_2021_3070129 crossref_primary_10_1007_s10844_012_0197_4 crossref_primary_10_1016_j_artint_2012_06_006 crossref_primary_10_1016_j_neucom_2013_07_045 crossref_primary_10_3233_IDT_220007 crossref_primary_10_1016_j_datak_2009_10_010 crossref_primary_10_1016_j_ipm_2005_02_006 crossref_primary_10_1016_j_is_2004_04_002 crossref_primary_10_1177_0165551514533771 crossref_primary_10_1007_s40747_021_00450_6 crossref_primary_10_1016_j_ipm_2003_10_003 crossref_primary_10_1108_00220411011052939 crossref_primary_10_1007_s12652_019_01247_9 crossref_primary_10_1177_01655515211018401 crossref_primary_10_1023_A_1011262119636 crossref_primary_10_1109_TEVC_2005_863130 crossref_primary_10_1007_s10462_005_9001_y crossref_primary_10_1016_j_asoc_2016_04_042 crossref_primary_10_1007_s11042_020_09172_2 crossref_primary_10_1007_s00530_011_0231_3 crossref_primary_10_1002_asi_20009 crossref_primary_10_1504_IJCAT_2009_026605 crossref_primary_10_1016_j_lcats_2012_05_001 |
| Cites_doi | 10.1145/258525.258532 10.1145/183422.183425 10.1145/183422.183424 10.1108/eb026526 10.1016/0020-0190(91)90032-D 10.1145/258525.258534 10.1016/0306-4573(88)90021-0 10.1016/0305-0548(93)E0020-T 10.1145/243199.243277 10.1145/258525.258531 10.1145/243199.243270 10.1145/63039.63044 |
| ContentType | Journal Article |
| Copyright | 2000 Elsevier Science Ltd 2000 INIST-CNRS Copyright Pergamon Press Inc. Sep 2000 |
| Copyright_xml | – notice: 2000 Elsevier Science Ltd – notice: 2000 INIST-CNRS – notice: Copyright Pergamon Press Inc. Sep 2000 |
| DBID | AAYXX CITATION 7SW BJH BNH BNI BNJ BNO ERI PET REK WWN IQODW E3H F2A |
| DOI | 10.1016/S0306-4573(00)00008-X |
| DatabaseName | CrossRef ERIC ERIC (Ovid) ERIC ERIC ERIC (Legacy Platform) ERIC( SilverPlatter ) ERIC ERIC PlusText (Legacy Platform) Education Resources Information Center (ERIC) ERIC Pascal-Francis Library & Information Sciences Abstracts (LISA) Library & Information Science Abstracts (LISA) |
| DatabaseTitle | CrossRef ERIC Library and Information Science Abstracts (LISA) |
| DatabaseTitleList | ERIC Library and Information Science Abstracts (LISA) Library and Information Science Abstracts (LISA) |
| DeliveryMethod | fulltext_linktorsrc |
| Discipline | Library & Information Science |
| EISSN | 1873-5371 |
| ERIC | EJ606818 |
| EndPage | 759 |
| ExternalDocumentID | 69078992 1459573 EJ606818 10_1016_S0306_4573_00_00008_X S030645730000008X |
| Genre | Feature |
| GroupedDBID | --K --M -~X .DC .~1 0B8 0R~ 1B1 1RT 1~. 1~5 29I 4.4 41~ 457 4G. 5GY 5VS 7-5 71M 77K 8P~ 9JN 9JO AABNK AACTN AAEDT AAEDW AAFJI AAIAV AAIKJ AAKOC AALRI AAOAW AAQFI AAQXK AAXUO AAYFN AAYOK ABBOA ABFNM ABFRF ABJNI ABMAC ABMMH ABPPZ ABXDB ABYKQ ACDAQ ACGFS ACHQT ACNNM ACRLP ACZNC ADBBV ADEZE ADJOM ADMUD AEBSH AEFWE AEKER AENEX AFKWA AFTJW AGHFR AGUBO AGYEJ AHHHB AHZHX AIALX AIEXJ AIKHN AITUG AJBFU AJOXV AKYCK ALMA_UNASSIGNED_HOLDINGS AMFUW AMRAJ AOMHK AOUOD ASPBG AVARZ AVWKF AXJTR AZFZN BKOJK BLXMC CS3 DU5 EBS EFJIC EFLBG EJD EO8 EO9 EP2 EP3 FDB FEDTE FGOYB FIRID FNPLU FYGXN G-2 G-Q GBLVA GBOLZ HLZ HMY HVGLF HZ~ H~9 IHE J1W KOM LG9 LPU LY1 M3Y M41 MO0 MS~ MVM N9A O-L O9- OAUVE OHT OZT P-8 P-9 P2P PC. PQQKQ PRBVW Q38 R2- RIG ROL RPZ SBC SDF SDG SDP SDS SES SEW SPC SPCBC SSB SSO SSS SSV SSZ T5K TN5 U5U UHB UHS UNMZH WUQ XFK ZMT ~G- 77I 9DU AATTM AAXKI AAYWO AAYXX ABWVN ACLOT ACRPL ACVFH ADCNI ADMHG ADNMO AEIPS AEUPX AFJKZ AFPUW AGQPQ AIGII AIIUN AKBMS AKRWK AKYEP ANKPU APXCP CITATION EFKBS ~HD 7SW BJH BNH BNI BNJ BNO ERI PET REK WWN 08R ABPIF IQODW E3H F2A |
| ID | FETCH-LOGICAL-c415t-daf8b836ded7de4485abffe7964bcba39060192555364546ae2f72a8f9002ca3 |
| ISICitedReferencesCount | 70 |
| ISICitedReferencesURI | http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000087256800003&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| ISSN | 0306-4573 |
| IngestDate | Sun Sep 28 04:29:10 EDT 2025 Wed Nov 19 00:35:30 EST 2025 Sun Oct 22 16:06:59 EDT 2023 Tue Dec 02 16:51:09 EST 2025 Tue Nov 18 21:41:07 EST 2025 Sat Nov 29 01:48:31 EST 2025 Fri Feb 23 02:20:09 EST 2024 |
| IsDoiOpenAccess | false |
| IsOpenAccess | false |
| IsPeerReviewed | true |
| IsScholarly | true |
| Issue | 5 |
| Keywords | Keyword Information retrieval PAT-tree Performance evaluation Automatic classification Weighting Genetic algorithm Automatic indexing Feedback regulation Cluster Method Comparative study Optimization |
| Language | English |
| License | https://www.elsevier.com/tdm/userlicense/1.0 CC BY 4.0 |
| LinkModel | OpenURL |
| MergedId | FETCHMERGED-LOGICAL-c415t-daf8b836ded7de4485abffe7964bcba39060192555364546ae2f72a8f9002ca3 |
| Notes | SourceType-Scholarly Journals-1 ObjectType-Feature-1 content type line 14 ObjectType-Article-1 ObjectType-Feature-2 content type line 23 |
| PQID | 194903974 |
| PQPubID | 46166 |
| PageCount | 23 |
| ParticipantIDs | proquest_miscellaneous_57478708 proquest_journals_194903974 pascalfrancis_primary_1459573 eric_primary_EJ606818 crossref_citationtrail_10_1016_S0306_4573_00_00008_X crossref_primary_10_1016_S0306_4573_00_00008_X elsevier_sciencedirect_doi_10_1016_S0306_4573_00_00008_X |
| PublicationCentury | 2000 |
| PublicationDate | 2000-09-01 |
| PublicationDateYYYYMMDD | 2000-09-01 |
| PublicationDate_xml | – month: 09 year: 2000 text: 2000-09-01 day: 01 |
| PublicationDecade | 2000 |
| PublicationPlace | Oxford |
| PublicationPlace_xml | – name: Oxford |
| PublicationTitle | Information processing & management |
| PublicationYear | 2000 |
| Publisher | Elsevier Ltd Elsevier Science Elsevier Science Ltd |
| Publisher_xml | – name: Elsevier Ltd – name: Elsevier Science – name: Elsevier Science Ltd |
| References | Yang, Chute (BIB34) 1994; 12 Fung, P., & Wu, D. (1994). Statistical augmentation of a Chinese machine-readable dictionary. ACM SIGIR’96, Zurich, Switzerland, pp. 298–306 Buckley, Salton (BIB4) 1995 Taipei, Taiwan Chang, C.-H., & Hsu, C.-C. (1999). Lochbaum, Streeter (BIB24) 1989; 6 Harman (BIB15) 1992 Buckley, Salton, Allan, Singhal (BIB3) 1994 ACM SIGIR’97, Philadelphia, PA, USA, pp. 42–49 Ph.D. thesis, Department of CSIE, National Taiwan University Lundquist, Grossman, Frieder (BIB25) 1997 Nie, J.-Y., Brisebois, M., & Ren, X. (1996). Rocchio (BIB29) 1971 Yang, Y.-Y., Chang, J.-S., & Chen, K.-J. (1993). Chen, A., He, J., Xu, L., Gey, F. C., & Meggs, J. (1997). Yang, Korfhage (BIB35) 1993 Zhai, Tong, Milic-Frayling, Evans (BIB37) 1996 Baeza-Yates (BIB2) 1992 ACM SIGIR’96, Zurich, Switzerland, pp. 225–233 Jones (BIB18) 1972; 28 Manber, Baeza-Yates (BIB26) 1991; 37 ACM SIGIR’97, Philadelphia, PA, USA pp. 50–59 Buckley, Singhal, Mitra (BIB5) 1995 Lewis, D. D., Schapire, R. E., Callan, J. P., & Papka, R. (1996). Gonnet, Baeza-Yates, Snider (BIB13) 1992 Chien, L.-F., Huang, T.-I., & Chien, M.-C. (1997). Liu (BIB23) 1987; 8 Allan (BIB1) 1996 pp. 33–56 Tate, Smith (BIB33) 1995; 22 Salton, MacGill (BIB31) 1983 ACM SIGIR’97, Philadelphia, PA, USA, pp. 34–41 Kwok, K. L. (1997). Chang, C.-H., & Hsu, C.-C. (1997). Information searching and exploring agent applying clustering and genetic algorithm. He, Xu, Chen, Meggs, Gey (BIB17) 1996 Goldberg (BIB12) 1989 Salton, Buckley (BIB32) 1988; 24 Liddy, Paik, Yu (BIB22) 1994; 12 Master thesis, Department of Computer Science, National Tsing Hua University Gordon (BIB14) 1988; 31 Huang (BIB16) 1997; 59 Allan (10.1016/S0306-4573(00)00008-X_BIB1) 1996 Buckley (10.1016/S0306-4573(00)00008-X_BIB5) 1995 Rocchio (10.1016/S0306-4573(00)00008-X_BIB29) 1971 Salton (10.1016/S0306-4573(00)00008-X_BIB32) 1988; 24 Gordon (10.1016/S0306-4573(00)00008-X_BIB14) 1988; 31 Buckley (10.1016/S0306-4573(00)00008-X_BIB4) 1995 He (10.1016/S0306-4573(00)00008-X_BIB17) 1996 Liu (10.1016/S0306-4573(00)00008-X_BIB23) 1987; 8 10.1016/S0306-4573(00)00008-X_BIB27 Huang (10.1016/S0306-4573(00)00008-X_BIB16) 1997; 59 10.1016/S0306-4573(00)00008-X_BIB21 Harman (10.1016/S0306-4573(00)00008-X_BIB15) 1992 Salton (10.1016/S0306-4573(00)00008-X_BIB31) 1983 Manber (10.1016/S0306-4573(00)00008-X_BIB26) 1991; 37 Lundquist (10.1016/S0306-4573(00)00008-X_BIB25) 1997 Baeza-Yates (10.1016/S0306-4573(00)00008-X_BIB2) 1992 Lochbaum (10.1016/S0306-4573(00)00008-X_BIB24) 1989; 6 Liddy (10.1016/S0306-4573(00)00008-X_BIB22) 1994; 12 10.1016/S0306-4573(00)00008-X_BIB19 Buckley (10.1016/S0306-4573(00)00008-X_BIB3) 1994 Yang (10.1016/S0306-4573(00)00008-X_BIB34) 1994; 12 Gonnet (10.1016/S0306-4573(00)00008-X_BIB13) 1992 Yang (10.1016/S0306-4573(00)00008-X_BIB35) 1993 Goldberg (10.1016/S0306-4573(00)00008-X_BIB12) 1989 10.1016/S0306-4573(00)00008-X_BIB8 10.1016/S0306-4573(00)00008-X_BIB11 Tate (10.1016/S0306-4573(00)00008-X_BIB33) 1995; 22 10.1016/S0306-4573(00)00008-X_BIB36 Zhai (10.1016/S0306-4573(00)00008-X_BIB37) 1996 10.1016/S0306-4573(00)00008-X_BIB7 10.1016/S0306-4573(00)00008-X_BIB10 Jones (10.1016/S0306-4573(00)00008-X_BIB18) 1972; 28 10.1016/S0306-4573(00)00008-X_BIB6 |
| References_xml | – start-page: 465 year: 1992 end-page: 476 ident: BIB2 article-title: Text retrieval: theory and practice publication-title: International Federation for Information Processing Congress, Vol. 1, Madrid, Spain – reference: . ACM SIGIR’96, Zurich, Switzerland, pp. 225–233 – reference: , Taipei, Taiwan – start-page: 16 year: 1997 end-page: 23 ident: BIB25 article-title: Improving relevance feedback in the vector space model publication-title: Proceedings of the Sixth International Conference on Information and Knowledge Management, Las Vegas, Nevada – volume: 8 start-page: 64 year: 1987 end-page: 70 ident: BIB23 article-title: New advances in computers and natural language processing in China publication-title: Information Science – start-page: 351 year: 1995 end-page: 357 ident: BIB4 article-title: Optimization of relevance feedback weights publication-title: Proceedings of the Eighteenth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Seattle, Washington, USA – reference: . Master thesis, Department of Computer Science, National Tsing Hua University – start-page: 69 year: 1994 end-page: 80 ident: BIB3 article-title: Automatic query expansion using SMART: TREC 3 publication-title: Proceedings of the Third Text REtrieval Conference, Gaithersburg, Maryland – volume: 6 start-page: 25 year: 1989 end-page: 33 ident: BIB24 article-title: Comparing and combining the effectiveness of latent semantic indexing and the ordinary vector space model for information retrieval publication-title: Journal of Information Science – reference: . ACM SIGIR’96, Zurich, Switzerland, pp. 298–306 – reference: Lewis, D. D., Schapire, R. E., Callan, J. P., & Papka, R. (1996). – start-page: 66 year: 1992 end-page: 82 ident: BIB13 article-title: New indices for text PAT trees and PAT arrays publication-title: Information retrieval data structures and algorithms – volume: 12 start-page: 278 year: 1994 end-page: 295 ident: BIB22 article-title: Text categorization for multiple users based on semantic features from a machine-readable dictionary publication-title: ACM Transaction on Information Systems – start-page: 1 year: 1992 end-page: 10 ident: BIB15 article-title: Relevance feedback revisited publication-title: Proceedings of the Fifteenth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Copenhagen, Denmark – reference: Chen, A., He, J., Xu, L., Gey, F. C., & Meggs, J. (1997). – start-page: 25 year: 1995 end-page: 48 ident: BIB5 article-title: New retrieval approaches using SMART: TREC 4 publication-title: Proceedings of the Fourth Text REtrieval Conference, Gaithersburg, Maryland – reference: . ACM SIGIR’97, Philadelphia, PA, USA, pp. 42–49 – volume: 59 start-page: 109 year: 1997 end-page: 126 ident: BIB16 article-title: The development of indexing system evaluation — theory and practice publication-title: Chinese Library Society Report – year: 1989 ident: BIB12 publication-title: Genetic algorithms in search, optimization and machine learning – volume: 31 start-page: 1208 year: 1988 end-page: 1218 ident: BIB14 article-title: Probabilistic and genetic algorithms in document retrieval publication-title: Communications of the ACM – reference: , pp. 33–56 – volume: 28 start-page: 11 year: 1972 end-page: 20 ident: BIB18 article-title: A statistical interpretation of term specificity and its application in retrieval publication-title: J. Documentation – start-page: 603 year: 1993 end-page: 611 ident: BIB35 article-title: Query optimization in information retrieval using genetic algorithms publication-title: Proceedings of the Fifth International Conference on Genetic Algorithms, Urbana, IL – volume: 24 start-page: 513 year: 1988 end-page: 523 ident: BIB32 article-title: Term-weighting approaches in automatic text retrieval publication-title: Information Processing and Management – reference: Yang, Y.-Y., Chang, J.-S., & Chen, K.-J. (1993). – start-page: 191 year: 1996 end-page: 198 ident: BIB17 article-title: Berkeley Chinese information retrieval at TREC-5: technical report publication-title: Proceeding of the Fifth Text REtrieval Conference, Gaithersburg, Maryland – volume: 37 start-page: 133 year: 1991 end-page: 136 ident: BIB26 article-title: An algorithm for string matching with a sequence of don’t cares publication-title: Information Processing Letters – start-page: 335 year: 1996 end-page: 340 ident: BIB37 article-title: Experiments on Chinese text indexing — CLARIT TREC-5 Chinese track report publication-title: Proceedings of the Fifth Text Retrieval Conference, Gaithersburg, Maryland – reference: Nie, J.-Y., Brisebois, M., & Ren, X. (1996). – volume: 12 start-page: 252 year: 1994 end-page: 277 ident: BIB34 article-title: An example-based mapping method for text categorization and retrieval publication-title: ACM Transactions on Information Systems – start-page: 313 year: 1971 end-page: 323 ident: BIB29 article-title: Relevance feedback in information retrieval publication-title: The SMART retrieval system: Experiments in automatic document processing – reference: Kwok, K. L. (1997). – reference: . Ph.D. thesis, Department of CSIE, National Taiwan University – reference: Chien, L.-F., Huang, T.-I., & Chien, M.-C. (1997). – reference: . ACM SIGIR’97, Philadelphia, PA, USA, pp. 34–41 – reference: Chang, C.-H., & Hsu, C.-C. (1997). Information searching and exploring agent applying clustering and genetic algorithm. – reference: . ACM SIGIR’97, Philadelphia, PA, USA pp. 50–59 – volume: 22 start-page: 73 year: 1995 end-page: 83 ident: BIB33 article-title: A genetic approach to the quardatic assignment problem publication-title: Computers and Operations Research – year: 1983 ident: BIB31 publication-title: Introduciton to modern information retrieval – reference: Fung, P., & Wu, D. (1994). Statistical augmentation of a Chinese machine-readable dictionary. – reference: Chang, C.-H., & Hsu, C.-C. (1999). – start-page: 270 year: 1996 end-page: 278 ident: BIB1 article-title: Incremental relevance feedback for information filtering publication-title: Proceedings of the Nineteenth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Zurich, Switzerland – ident: 10.1016/S0306-4573(00)00008-X_BIB36 – ident: 10.1016/S0306-4573(00)00008-X_BIB11 – ident: 10.1016/S0306-4573(00)00008-X_BIB8 doi: 10.1145/258525.258532 – volume: 12 start-page: 278 issue: 3 year: 1994 ident: 10.1016/S0306-4573(00)00008-X_BIB22 article-title: Text categorization for multiple users based on semantic features from a machine-readable dictionary publication-title: ACM Transaction on Information Systems doi: 10.1145/183422.183425 – start-page: 465 year: 1992 ident: 10.1016/S0306-4573(00)00008-X_BIB2 article-title: Text retrieval: theory and practice – start-page: 16 year: 1997 ident: 10.1016/S0306-4573(00)00008-X_BIB25 article-title: Improving relevance feedback in the vector space model – start-page: 351 year: 1995 ident: 10.1016/S0306-4573(00)00008-X_BIB4 article-title: Optimization of relevance feedback weights – volume: 12 start-page: 252 issue: 3 year: 1994 ident: 10.1016/S0306-4573(00)00008-X_BIB34 article-title: An example-based mapping method for text categorization and retrieval publication-title: ACM Transactions on Information Systems doi: 10.1145/183422.183424 – ident: 10.1016/S0306-4573(00)00008-X_BIB7 – volume: 28 start-page: 11 issue: 1 year: 1972 ident: 10.1016/S0306-4573(00)00008-X_BIB18 article-title: A statistical interpretation of term specificity and its application in retrieval publication-title: J. Documentation doi: 10.1108/eb026526 – start-page: 603 year: 1993 ident: 10.1016/S0306-4573(00)00008-X_BIB35 article-title: Query optimization in information retrieval using genetic algorithms – start-page: 1 year: 1992 ident: 10.1016/S0306-4573(00)00008-X_BIB15 article-title: Relevance feedback revisited – volume: 37 start-page: 133 year: 1991 ident: 10.1016/S0306-4573(00)00008-X_BIB26 article-title: An algorithm for string matching with a sequence of don’t cares publication-title: Information Processing Letters doi: 10.1016/0020-0190(91)90032-D – year: 1989 ident: 10.1016/S0306-4573(00)00008-X_BIB12 – volume: 8 start-page: 64 year: 1987 ident: 10.1016/S0306-4573(00)00008-X_BIB23 article-title: New advances in computers and natural language processing in China publication-title: Information Science – ident: 10.1016/S0306-4573(00)00008-X_BIB10 doi: 10.1145/258525.258534 – volume: 59 start-page: 109 year: 1997 ident: 10.1016/S0306-4573(00)00008-X_BIB16 article-title: The development of indexing system evaluation — theory and practice publication-title: Chinese Library Society Report – start-page: 191 year: 1996 ident: 10.1016/S0306-4573(00)00008-X_BIB17 article-title: Berkeley Chinese information retrieval at TREC-5: technical report – start-page: 335 year: 1996 ident: 10.1016/S0306-4573(00)00008-X_BIB37 article-title: Experiments on Chinese text indexing — CLARIT TREC-5 Chinese track report – start-page: 25 year: 1995 ident: 10.1016/S0306-4573(00)00008-X_BIB5 article-title: New retrieval approaches using SMART: TREC 4 – volume: 24 start-page: 513 issue: 5 year: 1988 ident: 10.1016/S0306-4573(00)00008-X_BIB32 article-title: Term-weighting approaches in automatic text retrieval publication-title: Information Processing and Management doi: 10.1016/0306-4573(88)90021-0 – volume: 22 start-page: 73 issue: 1 year: 1995 ident: 10.1016/S0306-4573(00)00008-X_BIB33 article-title: A genetic approach to the quardatic assignment problem publication-title: Computers and Operations Research doi: 10.1016/0305-0548(93)E0020-T – start-page: 270 year: 1996 ident: 10.1016/S0306-4573(00)00008-X_BIB1 article-title: Incremental relevance feedback for information filtering – start-page: 66 year: 1992 ident: 10.1016/S0306-4573(00)00008-X_BIB13 article-title: New indices for text PAT trees and PAT arrays – ident: 10.1016/S0306-4573(00)00008-X_BIB21 doi: 10.1145/243199.243277 – volume: 6 start-page: 25 year: 1989 ident: 10.1016/S0306-4573(00)00008-X_BIB24 article-title: Comparing and combining the effectiveness of latent semantic indexing and the ordinary vector space model for information retrieval publication-title: Journal of Information Science – start-page: 313 year: 1971 ident: 10.1016/S0306-4573(00)00008-X_BIB29 article-title: Relevance feedback in information retrieval – ident: 10.1016/S0306-4573(00)00008-X_BIB19 doi: 10.1145/258525.258531 – ident: 10.1016/S0306-4573(00)00008-X_BIB27 doi: 10.1145/243199.243270 – year: 1983 ident: 10.1016/S0306-4573(00)00008-X_BIB31 – start-page: 69 year: 1994 ident: 10.1016/S0306-4573(00)00008-X_BIB3 article-title: Automatic query expansion using SMART: TREC 3 – ident: 10.1016/S0306-4573(00)00008-X_BIB6 – volume: 31 start-page: 1208 year: 1988 ident: 10.1016/S0306-4573(00)00008-X_BIB14 article-title: Probabilistic and genetic algorithms in document retrieval publication-title: Communications of the ACM doi: 10.1145/63039.63044 |
| SSID | ssj0004512 |
| Score | 1.90022 |
| Snippet | This paper proposes a novel approach to automatically retrieve keywords and then uses genetic algorithms to adapt the keyword weights. One of the contributions... Proposes a novel approach to automatically retrieve keywords and then uses genetic algorithms to adapt the keyword weights. Discusses Chinese text retrieval,... This paper proposes a novel approach to automatically retrieve keywords and then uses genetic algorithms to adapt the keyword weights. Proposes a novel approach to automatically retrieve keywords and uses genetic algorithms to adapt the keyword weights. Combines the bigram model and PAT-tree... |
| SourceID | proquest pascalfrancis eric crossref elsevier |
| SourceType | Aggregation Database Index Database Enrichment Source Publisher |
| StartPage | 737 |
| SubjectTerms | Algorithms Bigram Strategy Chinese Content analysis Exact sciences and technology Genetic algorithms Genetics Indexing. Classification. Abstracting. Syntheses Information and communication sciences Information and document structure and analysis Information processing and retrieval Information Retrieval Information science. Documentation Keyword Keywords Mathematical Formulas Online information retrieval Optimization PAT-tree Query Processing Relevance (Information Retrieval) Retrieval Sciences and techniques of general use Searching Studies Vector Spaces Weighted Term Searching Weighting |
| Title | Applying genetic algorithms to query optimization in document retrieval |
| URI | https://dx.doi.org/10.1016/S0306-4573(00)00008-X http://eric.ed.gov/ERICWebPortal/detail?accno=EJ606818 https://www.proquest.com/docview/194903974 https://www.proquest.com/docview/57478708 |
| Volume | 36 |
| WOSCitedRecordID | wos000087256800003&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| journalDatabaseRights | – providerCode: PRVESC databaseName: Elsevier SD Freedom Collection Journals 2021 customDbUrl: eissn: 1873-5371 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0004512 issn: 0306-4573 databaseCode: AIEXJ dateStart: 19950101 isFulltext: true titleUrlDefault: https://www.sciencedirect.com providerName: Elsevier |
| link | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV1bb9MwFLbKxgMSQlyGKKPgB4ZAyCNN7CR-nEbLmKaCRJDCk5WkDivqktKbJn4Iv5fjW9JqgsEDL1GUxBf5fDk-PleEnsMexGQhKclhAyA0LymJWZaRQPqFlwc5HUeFLjYRjUZxmvKPnc5PFwuznkZVFV9e8tl_JTU8A2Kr0Nl_IHfTKTyAeyA6XIHscP0rwiu5UscuwRdSp2Odfq3nk-X5hU7mAPuAsqoDp7iwIZjaIbYuVtotYK4rbK3tIN-cn3sT4_h6ZiILjIYhtN6vm-4zJ_XcefnCDUl-1HZz1E4E0I4cOyX1F3m-pXXwGrcqqwpz4TCOA7U-SDoUywsJZaZGyaE0nDWOAsICU2_FsV6T-8RCjG3w0chkgrnC342q4VMzAEjhqpg116IvSdtNzRnyRx_E8PPZmUgGabL9Vu_hSjcAJ07_IBjOvhNVi0zZ7A-CtwYXN9CuHzEODH_36P0gPd3IQ9-39ikzjzY27E07uZee98pO7HdSj_Wxvz3LFvBLlqaWyhWxQMs6yV10xx5S8JEB1z3UkdV91LMhLvgF3sADtqR5gN454GELPNwCDy9rrIGHN4GHJxV2wMMN8PZQMhwkxyfElukgBUh_SzLOyjiPg3Asx9FYwnGfZXlZShXjnBd5FnCV8ofD0ZUpkzcNM-mXkZ_FJVDNL7LgIdqp6ko-QjiWgWQU-vI9SmVY8H5ZKoGfUxrKLMq7iLpVFIVNYa8qqUxF66sIiy_U4gtP5b1VNVbTLjpsms1MDpfrGsSORMIKokbAFIDD65ruKZI24wxOQy8EobiLels0bidCGYcuumjf0VxYZrIQfU65BwcG2kXPmrfA_pVNL6tkvVoIpupfRF78-I_t99Gt9id-gnaW85XsoZvFejlZzJ9acP8Cms3KmQ |
| linkProvider | Elsevier |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Applying+genetic+algorithms+to+query+optimization+in+document+retrieval&rft.jtitle=Information+processing+%26+management&rft.au=Horng%2C+Jorng-Tzong&rft.au=Ching-Chang%2C+Yeh&rft.date=2000-09-01&rft.pub=Elsevier+Science+Ltd&rft.issn=0306-4573&rft.eissn=1873-5371&rft.volume=36&rft.issue=5&rft.spage=737&rft_id=info:doi/10.1016%2FS0306-4573%2800%2900008-X&rft.externalDBID=NO_FULL_TEXT&rft.externalDocID=69078992 |
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0306-4573&client=summon |
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0306-4573&client=summon |
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0306-4573&client=summon |