Enhancing Video Anomaly Detection Using Spatio-Temporal Autoencoders and Convolutional LSTM Networks

Identifying suspicious activities or behaviors is essential in the domain of Anomaly Detection (AD). In crowded scenes, the presence of inter-object occlusions often complicates the detection of such behaviors. Therefore, developing a robust method capable of accurately detecting and locating anomal...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:SN computer science Jg. 5; H. 1; S. 190
Hauptverfasser: Almahadin, Ghayth, Subburaj, Maheswari, Hiari, Mohammad, Sathasivam Singaram, Saranya, Kolla, Bhanu Prakash, Dadheech, Pankaj, Vibhute, Amol D., Sengan, Sudhakar
Format: Journal Article
Sprache:Englisch
Veröffentlicht: Singapore Springer Nature Singapore 11.01.2024
Springer Nature B.V
Schlagworte:
ISSN:2661-8907, 2662-995X, 2661-8907
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Identifying suspicious activities or behaviors is essential in the domain of Anomaly Detection (AD). In crowded scenes, the presence of inter-object occlusions often complicates the detection of such behaviors. Therefore, developing a robust method capable of accurately detecting and locating anomalous activities within video sequences becomes crucial, especially in densely populated environments. This research initiative aims to address this challenge by proposing a novel approach focusing on AD behaviors in crowded settings. By leveraging a spatio-temporal method, the proposed approach harnesses the power of both spatial and temporal dimensions. This enables the method to effectively capture and analyze the intricate motion patterns and spatial information embedded within the continuous frames of video data. The objective is to create a comprehensive model that can efficiently detect and precisely locate anomalies within complex video sequences, specifically those featuring human crowds. The efficacy of the proposed model will be rigorously evaluated using a benchmark dataset encompassing diverse scenarios involving crowded environments. The dataset is designed to simulate real-world conditions where millions of video footage need to be continuously monitored in real time. The focus is on identifying anomalies, which might occur within short time frames, sometimes as brief as five minutes or even less. Given the challenges posed by the massive volume of data and the requirement for rapid AD, the research emphasizes the limitations of traditional Supervised Learning (SL) methods in this context.
Bibliographie:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:2661-8907
2662-995X
2661-8907
DOI:10.1007/s42979-023-02542-1