CRIM's content-based audio copy detection system for TRECVID 2009.

Saved in:
Bibliographic Details
Title: CRIM's content-based audio copy detection system for TRECVID 2009.
Authors: Gupta, Vishwa, Boulianne, Gilles, Cardinal, Patrick
Source: Multimedia Tools & Applications; Sep2012, Vol. 60 Issue 2, p371-387, 17p
Subject Terms: AUDIO communication, DEMODULATION, COPYING, COMPUTER file sharing, ALGORITHMS
Abstract: We report results on audio copy detection for TRECVID 2009 copy detection task. This task involves searching for transformed audio queries in over 385 h of test audio. The queries were transformed in seven different ways, three of them involved mixing unrelated speech to the original query, making it a much more difficult task. We give results with two different audio fingerprints and show that mapping each test frame to the nearest query frame (nearest-neighbor fingerprint) results in robust audio copy detection. The most difficult task in TRECVID 2009 was to detect audio copies using predetermined thresholds computed from 2008 data. We show that the nearest-neighbor fingerprints were robust to even this task and gave actual minimal normalized detection cost rate (NDCR) of around 0.06 for all the transformations. These results are close to those obtained by using the optimal threshold for each transform. This result shows the robustness of the nearest-neighbor fingerprints. These nearest-neighbor fingerprints can be efficiently computed on a graphics processing unit, leading to a very fast search. [ABSTRACT FROM AUTHOR]
Copyright of Multimedia Tools & Applications is the property of Springer Nature and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
Database: Complementary Index
FullText Text:
  Availability: 0
CustomLinks:
  – Url: https://resolver.ebscohost.com/openurl?sid=EBSCO:edb&genre=article&issn=13807501&ISBN=&volume=60&issue=2&date=20120915&spage=371&pages=371-387&title=Multimedia Tools & Applications&atitle=CRIM%27s%20content-based%20audio%20copy%20detection%20system%20for%20TRECVID%202009.&aulast=Gupta%2C%20Vishwa&id=DOI:10.1007/s11042-010-0608-x
    Name: Full Text Finder
    Category: fullText
    Text: Full Text Finder
    Icon: https://imageserver.ebscohost.com/branding/images/FTF.gif
    MouseOverText: Full Text Finder
  – Url: https://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=EBSCO&SrcAuth=EBSCO&DestApp=WOS&ServiceName=TransferToWoS&DestLinkType=GeneralSearchSummary&Func=Links&author=Gupta%20V
    Name: ISI
    Category: fullText
    Text: Nájsť tento článok vo Web of Science
    Icon: https://imagesrvr.epnet.com/ls/20docs.gif
    MouseOverText: Nájsť tento článok vo Web of Science
Header DbId: edb
DbLabel: Complementary Index
An: 76573653
RelevancyScore: 834
AccessLevel: 6
PubType: Academic Journal
PubTypeId: academicJournal
PreciseRelevancyScore: 834.346252441406
IllustrationInfo
Items – Name: Title
  Label: Title
  Group: Ti
  Data: CRIM's content-based audio copy detection system for TRECVID 2009.
– Name: Author
  Label: Authors
  Group: Au
  Data: <searchLink fieldCode="AR" term="%22Gupta%2C+Vishwa%22">Gupta, Vishwa</searchLink><br /><searchLink fieldCode="AR" term="%22Boulianne%2C+Gilles%22">Boulianne, Gilles</searchLink><br /><searchLink fieldCode="AR" term="%22Cardinal%2C+Patrick%22">Cardinal, Patrick</searchLink>
– Name: TitleSource
  Label: Source
  Group: Src
  Data: Multimedia Tools & Applications; Sep2012, Vol. 60 Issue 2, p371-387, 17p
– Name: Subject
  Label: Subject Terms
  Group: Su
  Data: <searchLink fieldCode="DE" term="%22AUDIO+communication%22">AUDIO communication</searchLink><br /><searchLink fieldCode="DE" term="%22DEMODULATION%22">DEMODULATION</searchLink><br /><searchLink fieldCode="DE" term="%22COPYING%22">COPYING</searchLink><br /><searchLink fieldCode="DE" term="%22COMPUTER+file+sharing%22">COMPUTER file sharing</searchLink><br /><searchLink fieldCode="DE" term="%22ALGORITHMS%22">ALGORITHMS</searchLink>
– Name: Abstract
  Label: Abstract
  Group: Ab
  Data: We report results on audio copy detection for TRECVID 2009 copy detection task. This task involves searching for transformed audio queries in over 385 h of test audio. The queries were transformed in seven different ways, three of them involved mixing unrelated speech to the original query, making it a much more difficult task. We give results with two different audio fingerprints and show that mapping each test frame to the nearest query frame (nearest-neighbor fingerprint) results in robust audio copy detection. The most difficult task in TRECVID 2009 was to detect audio copies using predetermined thresholds computed from 2008 data. We show that the nearest-neighbor fingerprints were robust to even this task and gave actual minimal normalized detection cost rate (NDCR) of around 0.06 for all the transformations. These results are close to those obtained by using the optimal threshold for each transform. This result shows the robustness of the nearest-neighbor fingerprints. These nearest-neighbor fingerprints can be efficiently computed on a graphics processing unit, leading to a very fast search. [ABSTRACT FROM AUTHOR]
– Name: Abstract
  Label:
  Group: Ab
  Data: <i>Copyright of Multimedia Tools & Applications is the property of Springer Nature and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract.</i> (Copyright applies to all Abstracts.)
PLink https://erproxy.cvtisr.sk/sfx/access?url=https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&db=edb&AN=76573653
RecordInfo BibRecord:
  BibEntity:
    Identifiers:
      – Type: doi
        Value: 10.1007/s11042-010-0608-x
    Languages:
      – Code: eng
        Text: English
    PhysicalDescription:
      Pagination:
        PageCount: 17
        StartPage: 371
    Subjects:
      – SubjectFull: AUDIO communication
        Type: general
      – SubjectFull: DEMODULATION
        Type: general
      – SubjectFull: COPYING
        Type: general
      – SubjectFull: COMPUTER file sharing
        Type: general
      – SubjectFull: ALGORITHMS
        Type: general
    Titles:
      – TitleFull: CRIM's content-based audio copy detection system for TRECVID 2009.
        Type: main
  BibRelationships:
    HasContributorRelationships:
      – PersonEntity:
          Name:
            NameFull: Gupta, Vishwa
      – PersonEntity:
          Name:
            NameFull: Boulianne, Gilles
      – PersonEntity:
          Name:
            NameFull: Cardinal, Patrick
    IsPartOfRelationships:
      – BibEntity:
          Dates:
            – D: 15
              M: 09
              Text: Sep2012
              Type: published
              Y: 2012
          Identifiers:
            – Type: issn-print
              Value: 13807501
          Numbering:
            – Type: volume
              Value: 60
            – Type: issue
              Value: 2
          Titles:
            – TitleFull: Multimedia Tools & Applications
              Type: main
ResultId 1