CRIM's content-based audio copy detection system for TRECVID 2009.
Saved in:
| Title: | CRIM's content-based audio copy detection system for TRECVID 2009. |
|---|---|
| Authors: | Gupta, Vishwa, Boulianne, Gilles, Cardinal, Patrick |
| Source: | Multimedia Tools & Applications; Sep2012, Vol. 60 Issue 2, p371-387, 17p |
| Subject Terms: | AUDIO communication, DEMODULATION, COPYING, COMPUTER file sharing, ALGORITHMS |
| Abstract: | We report results on audio copy detection for TRECVID 2009 copy detection task. This task involves searching for transformed audio queries in over 385 h of test audio. The queries were transformed in seven different ways, three of them involved mixing unrelated speech to the original query, making it a much more difficult task. We give results with two different audio fingerprints and show that mapping each test frame to the nearest query frame (nearest-neighbor fingerprint) results in robust audio copy detection. The most difficult task in TRECVID 2009 was to detect audio copies using predetermined thresholds computed from 2008 data. We show that the nearest-neighbor fingerprints were robust to even this task and gave actual minimal normalized detection cost rate (NDCR) of around 0.06 for all the transformations. These results are close to those obtained by using the optimal threshold for each transform. This result shows the robustness of the nearest-neighbor fingerprints. These nearest-neighbor fingerprints can be efficiently computed on a graphics processing unit, leading to a very fast search. [ABSTRACT FROM AUTHOR] |
| Copyright of Multimedia Tools & Applications is the property of Springer Nature and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.) | |
| Database: | Complementary Index |
| FullText | Text: Availability: 0 CustomLinks: – Url: https://resolver.ebscohost.com/openurl?sid=EBSCO:edb&genre=article&issn=13807501&ISBN=&volume=60&issue=2&date=20120915&spage=371&pages=371-387&title=Multimedia Tools & Applications&atitle=CRIM%27s%20content-based%20audio%20copy%20detection%20system%20for%20TRECVID%202009.&aulast=Gupta%2C%20Vishwa&id=DOI:10.1007/s11042-010-0608-x Name: Full Text Finder Category: fullText Text: Full Text Finder Icon: https://imageserver.ebscohost.com/branding/images/FTF.gif MouseOverText: Full Text Finder – Url: https://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=EBSCO&SrcAuth=EBSCO&DestApp=WOS&ServiceName=TransferToWoS&DestLinkType=GeneralSearchSummary&Func=Links&author=Gupta%20V Name: ISI Category: fullText Text: Nájsť tento článok vo Web of Science Icon: https://imagesrvr.epnet.com/ls/20docs.gif MouseOverText: Nájsť tento článok vo Web of Science |
|---|---|
| Header | DbId: edb DbLabel: Complementary Index An: 76573653 RelevancyScore: 834 AccessLevel: 6 PubType: Academic Journal PubTypeId: academicJournal PreciseRelevancyScore: 834.346252441406 |
| IllustrationInfo | |
| Items | – Name: Title Label: Title Group: Ti Data: CRIM's content-based audio copy detection system for TRECVID 2009. – Name: Author Label: Authors Group: Au Data: <searchLink fieldCode="AR" term="%22Gupta%2C+Vishwa%22">Gupta, Vishwa</searchLink><br /><searchLink fieldCode="AR" term="%22Boulianne%2C+Gilles%22">Boulianne, Gilles</searchLink><br /><searchLink fieldCode="AR" term="%22Cardinal%2C+Patrick%22">Cardinal, Patrick</searchLink> – Name: TitleSource Label: Source Group: Src Data: Multimedia Tools & Applications; Sep2012, Vol. 60 Issue 2, p371-387, 17p – Name: Subject Label: Subject Terms Group: Su Data: <searchLink fieldCode="DE" term="%22AUDIO+communication%22">AUDIO communication</searchLink><br /><searchLink fieldCode="DE" term="%22DEMODULATION%22">DEMODULATION</searchLink><br /><searchLink fieldCode="DE" term="%22COPYING%22">COPYING</searchLink><br /><searchLink fieldCode="DE" term="%22COMPUTER+file+sharing%22">COMPUTER file sharing</searchLink><br /><searchLink fieldCode="DE" term="%22ALGORITHMS%22">ALGORITHMS</searchLink> – Name: Abstract Label: Abstract Group: Ab Data: We report results on audio copy detection for TRECVID 2009 copy detection task. This task involves searching for transformed audio queries in over 385 h of test audio. The queries were transformed in seven different ways, three of them involved mixing unrelated speech to the original query, making it a much more difficult task. We give results with two different audio fingerprints and show that mapping each test frame to the nearest query frame (nearest-neighbor fingerprint) results in robust audio copy detection. The most difficult task in TRECVID 2009 was to detect audio copies using predetermined thresholds computed from 2008 data. We show that the nearest-neighbor fingerprints were robust to even this task and gave actual minimal normalized detection cost rate (NDCR) of around 0.06 for all the transformations. These results are close to those obtained by using the optimal threshold for each transform. This result shows the robustness of the nearest-neighbor fingerprints. These nearest-neighbor fingerprints can be efficiently computed on a graphics processing unit, leading to a very fast search. [ABSTRACT FROM AUTHOR] – Name: Abstract Label: Group: Ab Data: <i>Copyright of Multimedia Tools & Applications is the property of Springer Nature and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract.</i> (Copyright applies to all Abstracts.) |
| PLink | https://erproxy.cvtisr.sk/sfx/access?url=https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&db=edb&AN=76573653 |
| RecordInfo | BibRecord: BibEntity: Identifiers: – Type: doi Value: 10.1007/s11042-010-0608-x Languages: – Code: eng Text: English PhysicalDescription: Pagination: PageCount: 17 StartPage: 371 Subjects: – SubjectFull: AUDIO communication Type: general – SubjectFull: DEMODULATION Type: general – SubjectFull: COPYING Type: general – SubjectFull: COMPUTER file sharing Type: general – SubjectFull: ALGORITHMS Type: general Titles: – TitleFull: CRIM's content-based audio copy detection system for TRECVID 2009. Type: main BibRelationships: HasContributorRelationships: – PersonEntity: Name: NameFull: Gupta, Vishwa – PersonEntity: Name: NameFull: Boulianne, Gilles – PersonEntity: Name: NameFull: Cardinal, Patrick IsPartOfRelationships: – BibEntity: Dates: – D: 15 M: 09 Text: Sep2012 Type: published Y: 2012 Identifiers: – Type: issn-print Value: 13807501 Numbering: – Type: volume Value: 60 – Type: issue Value: 2 Titles: – TitleFull: Multimedia Tools & Applications Type: main |
| ResultId | 1 |
Full Text Finder
Nájsť tento článok vo Web of Science