DSpace at KIST: Where and What: Contextual Dynamics-Aware Anomaly Detection in Surveillance Videos

Browse

DSpace at KISTKIST Article 2025

Full metadata record

DC Field	Value	Language
dc.contributor.author	Ahn, Deok-Hyun	-
dc.contributor.author	Jo, Yong-Jin	-
dc.contributor.author	Kim, Dong-Bum	-
dc.contributor.author	Nam, Gi Pyo	-
dc.contributor.author	Han, Jae-Ho	-
dc.contributor.author	Kim, Haksub	-
dc.date.accessioned	2025-11-26T07:32:37Z	-
dc.date.available	2025-11-26T07:32:37Z	-
dc.date.created	2025-11-25	-
dc.date.issued	2025-10	-
dc.identifier.issn	1057-7149	-
dc.identifier.uri	https://pubs.kist.re.kr/handle/201004/153649	-
dc.description.abstract	In surveillance environments, detecting anomalies requires understanding the contextual dynamics of the environment, human behaviors, and movements within a scene. Effective anomaly detection must address both the where and what of events, but existing approaches such as unimodal action-based methods or LLM-integrated multimodal frameworks have limitations. These methods either rely on implicit scene information, making it difficult to localize where anomalies occur, or fail to adapt to surveillance specific challenges such as view changes, subtle actions, low light conditions, and crowded scenes. As a result, these challenges hinder accurate detection of what occurs. To overcome these limitations, our system takes advantage of features from a lightweight scene classification model to discern where an event occurs, acquiring explicit location-based context. To identify what events occur, it focuses on atomic actions, which remain underexplored in this field and are better suited to interpreting intricate abnormal behaviors than conventional abstract action features. To achieve robust anomaly detection, the proposed Temporal-Semantic Relationship Network (TSRN) models spatio-temporal relationships among multimodal features and employs a Segment-selective Focal Margin loss (SFML) to effectively address class imbalance, outperforming conventional MIL-based methods. Experimental results on public datasets demonstrate that the proposed system effectively reduces false alarms while maintaining robustness and practicality for real-world surveillance applications.	-
dc.language	English	-
dc.publisher	Institute of Electrical and Electronics Engineers	-
dc.title	Where and What: Contextual Dynamics-Aware Anomaly Detection in Surveillance Videos	-
dc.type	Article	-
dc.identifier.doi	10.1109/tip.2025.3623392	-
dc.description.journalClass	1	-
dc.identifier.bibliographicCitation	IEEE Transactions on Image Processing, v.34, pp.6993 - 7007	-
dc.citation.title	IEEE Transactions on Image Processing	-
dc.citation.volume	34	-
dc.citation.startPage	6993	-
dc.citation.endPage	7007	-
dc.description.isOpenAccess	N	-
dc.description.journalRegisteredClass	scie	-
dc.description.journalRegisteredClass	scopus	-
dc.identifier.wosid	001608943900001	-
dc.relation.journalWebOfScienceCategory	Computer Science, Artificial Intelligence	-
dc.relation.journalWebOfScienceCategory	Engineering, Electrical & Electronic	-
dc.relation.journalResearchArea	Computer Science	-
dc.relation.journalResearchArea	Engineering	-
dc.type.docType	Article	-
dc.subject.keywordAuthor	Surveillance videos	-
dc.subject.keywordAuthor	weakly-supervised video anomaly detection	-
dc.subject.keywordAuthor	weakly-supervised video anomaly detection	-
dc.subject.keywordAuthor	weakly-supervised video anomaly detection	-
dc.subject.keywordAuthor	contextual dynamics	-

Appears in Collections:: KIST Article > 2025

Export: RIS (EndNote); XLS (Excel); XML

Show Simple Item Record

KIST Library Institutional Repository

Browse

BROWSE