Query-Driven Video Summarization for Long Video Footage Analysis Using Faster-RCNN and Determinantal Point Processes

With ever growing volume of video content, video summarization becomes crucial for efficiently condensing lengthy videos into informative representations. This paper introduces a novel approach for video summarization by combining object identification with Determinantal Point Processes (DPP). Objec...

Celý popis

Uloženo v:
Podrobná bibliografie
Vydáno v:Procedia computer science Ročník 258; s. 3989 - 3999
Hlavní autoři: Bhute, Maitrey M., Tare, Sanskar S., S, Sridhar Raj
Médium: Journal Article
Jazyk:angličtina
Vydáno: Elsevier B.V 2025
Témata:
ISSN:1877-0509, 1877-0509
On-line přístup:Získat plný text
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Popis
Shrnutí:With ever growing volume of video content, video summarization becomes crucial for efficiently condensing lengthy videos into informative representations. This paper introduces a novel approach for video summarization by combining object identification with Determinantal Point Processes (DPP). Object identification, powered by advanced computer vision techniques, tracks and recognizes objects across video frames (Convolutional Neural Network/Computer Vision), allowing for identification and prioritization of frames with key objects and respective interactions. The proposed algorithm leverages DPP and R-CNN concurrently to select frames while avoiding redundancy and preserving diversity Extensive experiments using diverse video datasets demonstrates that this method performs competitively with existing summarization techniques while tested on TVSum dataset [12], a popular baseline. The results highlight the efficacy of the proposed algorithm in striking a balance between information retention and summary length. It provides accuracy of 84.34% and recall of 13.09%. This research contributes to the field of multimedia content analysis, with potential applications in video indexing, retrieval, and content recommendation systems. Through integration of object identification and DPP, this work introduces an innovative dimension to video summarization. The proposed approach holds promise for generating more informative and visually coherent video summaries. This work holds the potential to advance the state-of-the-art in video summarization techniques, benefiting various multimedia applications by aiding content analysis and user experience.
ISSN:1877-0509
1877-0509
DOI:10.1016/j.procs.2025.04.650