Telecom paristech at imageclefphoto 2008: Bi-modal text and image retrieval with diversity enhancement

Uloženo v:
Podrobná bibliografie
Název: Telecom paristech at imageclefphoto 2008: Bi-modal text and image retrieval with diversity enhancement
Autoři: Marin Ferecatu, Hichem Sahbi
Přispěvatelé: The Pennsylvania State University CiteSeerX Archives
Zdroj: http://clef.isti.cnr.it/2008/working_notes/ferecatu-paperCLEF2008.pdf.
Rok vydání: 2008
Sbírka: CiteSeerX
Témata: Categories and Subject Descriptors H.3 [Information Storage and Retrieval, H.3.1 Content Analysis and Indexing, H.3.3 Information Search and Retrieval, H.3.4 Systems and Software, H.3.7 Digital Libraries, H.2.3 [Database Manage- ment, Languages—Query Languages General Terms Measurement, Performance, Experimentation. Keywords Image retrieval, Reranking, Support Vector Machines, Hybrid Text and Image Search
Popis: In this paper we describe the participation of TELECOM ParisTech in the ImageClefphoto 2008 challenge. This edition focuses on promoting diversity in the results produced by the retrieval systems. Given the high level semantic content of the topics, search engines based solely on text or visual descriptors are unlikely to offer satisfactory results. Our system uses several text and visual descriptors, as well as several combination algorithms to improve the overall retrieval performance. The text part includes a collection of manually built boolean queries and a set of textual descriptors extracted automatically using dictionary filtering and dimensionality reduction. Text and visual descriptors are combined using two strategies: ad-hoc concatenation and re-ranking. Diversity makes it possible to reduce the redundancy in the final results and it is obtained using two techniques, threshold clustering and maxmin exploration. Several runs were submitted to the challenge, including individual (text or visual), combined, and with different settings of diversity. The results show that the combined runs outperform by a significant amount the individual runs. These results clearly corroborate (i) the complementarity of text and visual descriptors and (ii) the effectiveness of boolean queries suggesting promising future research directions.
Druh dokumentu: text
Popis souboru: application/pdf
Jazyk: English
Relation: http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.505.1655
Dostupnost: http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.505.1655
http://clef.isti.cnr.it/2008/working_notes/ferecatu-paperCLEF2008.pdf
Rights: Metadata may be used without restrictions as long as the oai identifier remains attached to it.
Přístupové číslo: edsbas.6456C2C
Databáze: BASE
FullText Text:
  Availability: 0
CustomLinks:
  – Url: http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.505.1655#
    Name: EDS - BASE (s4221598)
    Category: fullText
    Text: View record from BASE
  – Url: https://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=EBSCO&SrcAuth=EBSCO&DestApp=WOS&ServiceName=TransferToWoS&DestLinkType=GeneralSearchSummary&Func=Links&author=Ferecatu%20M
    Name: ISI
    Category: fullText
    Text: Nájsť tento článok vo Web of Science
    Icon: https://imagesrvr.epnet.com/ls/20docs.gif
    MouseOverText: Nájsť tento článok vo Web of Science
Header DbId: edsbas
DbLabel: BASE
An: edsbas.6456C2C
RelevancyScore: 839
AccessLevel: 3
PubType: Academic Journal
PubTypeId: academicJournal
PreciseRelevancyScore: 838.664306640625
IllustrationInfo
Items – Name: Title
  Label: Title
  Group: Ti
  Data: Telecom paristech at imageclefphoto 2008: Bi-modal text and image retrieval with diversity enhancement
– Name: Author
  Label: Authors
  Group: Au
  Data: <searchLink fieldCode="AR" term="%22Marin+Ferecatu%22">Marin Ferecatu</searchLink><br /><searchLink fieldCode="AR" term="%22Hichem+Sahbi%22">Hichem Sahbi</searchLink>
– Name: Author
  Label: Contributors
  Group: Au
  Data: The Pennsylvania State University CiteSeerX Archives
– Name: TitleSource
  Label: Source
  Group: Src
  Data: <i>http://clef.isti.cnr.it/2008/working_notes/ferecatu-paperCLEF2008.pdf</i>.
– Name: DatePubCY
  Label: Publication Year
  Group: Date
  Data: 2008
– Name: Subset
  Label: Collection
  Group: HoldingsInfo
  Data: CiteSeerX
– Name: Subject
  Label: Subject Terms
  Group: Su
  Data: <searchLink fieldCode="DE" term="%22Categories+and+Subject+Descriptors+H%2E3+[Information+Storage+and+Retrieval%22">Categories and Subject Descriptors H.3 [Information Storage and Retrieval</searchLink><br /><searchLink fieldCode="DE" term="%22H%2E3%2E1+Content+Analysis+and+Indexing%22">H.3.1 Content Analysis and Indexing</searchLink><br /><searchLink fieldCode="DE" term="%22H%2E3%2E3+Information+Search+and+Retrieval%22">H.3.3 Information Search and Retrieval</searchLink><br /><searchLink fieldCode="DE" term="%22H%2E3%2E4+Systems+and+Software%22">H.3.4 Systems and Software</searchLink><br /><searchLink fieldCode="DE" term="%22H%2E3%2E7+Digital+Libraries%22">H.3.7 Digital Libraries</searchLink><br /><searchLink fieldCode="DE" term="%22H%2E2%2E3+[Database+Manage-+ment%22">H.2.3 [Database Manage- ment</searchLink><br /><searchLink fieldCode="DE" term="%22Languages—Query+Languages+General+Terms+Measurement%22">Languages—Query Languages General Terms Measurement</searchLink><br /><searchLink fieldCode="DE" term="%22Performance%22">Performance</searchLink><br /><searchLink fieldCode="DE" term="%22Experimentation%2E+Keywords+Image+retrieval%22">Experimentation. Keywords Image retrieval</searchLink><br /><searchLink fieldCode="DE" term="%22Reranking%22">Reranking</searchLink><br /><searchLink fieldCode="DE" term="%22Support+Vector+Machines%22">Support Vector Machines</searchLink><br /><searchLink fieldCode="DE" term="%22Hybrid+Text+and+Image+Search%22">Hybrid Text and Image Search</searchLink>
– Name: Abstract
  Label: Description
  Group: Ab
  Data: In this paper we describe the participation of TELECOM ParisTech in the ImageClefphoto 2008 challenge. This edition focuses on promoting diversity in the results produced by the retrieval systems. Given the high level semantic content of the topics, search engines based solely on text or visual descriptors are unlikely to offer satisfactory results. Our system uses several text and visual descriptors, as well as several combination algorithms to improve the overall retrieval performance. The text part includes a collection of manually built boolean queries and a set of textual descriptors extracted automatically using dictionary filtering and dimensionality reduction. Text and visual descriptors are combined using two strategies: ad-hoc concatenation and re-ranking. Diversity makes it possible to reduce the redundancy in the final results and it is obtained using two techniques, threshold clustering and maxmin exploration. Several runs were submitted to the challenge, including individual (text or visual), combined, and with different settings of diversity. The results show that the combined runs outperform by a significant amount the individual runs. These results clearly corroborate (i) the complementarity of text and visual descriptors and (ii) the effectiveness of boolean queries suggesting promising future research directions.
– Name: TypeDocument
  Label: Document Type
  Group: TypDoc
  Data: text
– Name: Format
  Label: File Description
  Group: SrcInfo
  Data: application/pdf
– Name: Language
  Label: Language
  Group: Lang
  Data: English
– Name: NoteTitleSource
  Label: Relation
  Group: SrcInfo
  Data: http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.505.1655
– Name: URL
  Label: Availability
  Group: URL
  Data: http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.505.1655<br />http://clef.isti.cnr.it/2008/working_notes/ferecatu-paperCLEF2008.pdf
– Name: Copyright
  Label: Rights
  Group: Cpyrght
  Data: Metadata may be used without restrictions as long as the oai identifier remains attached to it.
– Name: AN
  Label: Accession Number
  Group: ID
  Data: edsbas.6456C2C
PLink https://erproxy.cvtisr.sk/sfx/access?url=https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&db=edsbas&AN=edsbas.6456C2C
RecordInfo BibRecord:
  BibEntity:
    Languages:
      – Text: English
    Subjects:
      – SubjectFull: Categories and Subject Descriptors H.3 [Information Storage and Retrieval
        Type: general
      – SubjectFull: H.3.1 Content Analysis and Indexing
        Type: general
      – SubjectFull: H.3.3 Information Search and Retrieval
        Type: general
      – SubjectFull: H.3.4 Systems and Software
        Type: general
      – SubjectFull: H.3.7 Digital Libraries
        Type: general
      – SubjectFull: H.2.3 [Database Manage- ment
        Type: general
      – SubjectFull: Languages—Query Languages General Terms Measurement
        Type: general
      – SubjectFull: Performance
        Type: general
      – SubjectFull: Experimentation. Keywords Image retrieval
        Type: general
      – SubjectFull: Reranking
        Type: general
      – SubjectFull: Support Vector Machines
        Type: general
      – SubjectFull: Hybrid Text and Image Search
        Type: general
    Titles:
      – TitleFull: Telecom paristech at imageclefphoto 2008: Bi-modal text and image retrieval with diversity enhancement
        Type: main
  BibRelationships:
    HasContributorRelationships:
      – PersonEntity:
          Name:
            NameFull: Marin Ferecatu
      – PersonEntity:
          Name:
            NameFull: Hichem Sahbi
      – PersonEntity:
          Name:
            NameFull: The Pennsylvania State University CiteSeerX Archives
    IsPartOfRelationships:
      – BibEntity:
          Dates:
            – D: 01
              M: 01
              Type: published
              Y: 2008
          Identifiers:
            – Type: issn-locals
              Value: edsbas
            – Type: issn-locals
              Value: edsbas.oa
          Titles:
            – TitleFull: http://clef.isti.cnr.it/2008/working_notes/ferecatu-paperCLEF2008.pdf
              Type: main
ResultId 1