Full metadata record

DC Field Value Language
dc.contributor.authorAhn, Deok-Hyun-
dc.contributor.authorJo, Yong-Jin-
dc.contributor.authorKim, Dong-Bum-
dc.contributor.authorNam, Gi Pyo-
dc.contributor.authorHan, Jae-Ho-
dc.contributor.authorKim, Haksub-
dc.date.accessioned2025-11-26T07:32:37Z-
dc.date.available2025-11-26T07:32:37Z-
dc.date.created2025-11-25-
dc.date.issued2025-10-
dc.identifier.issn1057-7149-
dc.identifier.urihttps://pubs.kist.re.kr/handle/201004/153649-
dc.description.abstractIn surveillance environments, detecting anomalies requires understanding the contextual dynamics of the environment, human behaviors, and movements within a scene. Effective anomaly detection must address both the where and what of events, but existing approaches such as unimodal action-based methods or LLM-integrated multimodal frameworks have limitations. These methods either rely on implicit scene information, making it difficult to localize where anomalies occur, or fail to adapt to surveillance specific challenges such as view changes, subtle actions, low light conditions, and crowded scenes. As a result, these challenges hinder accurate detection of what occurs. To overcome these limitations, our system takes advantage of features from a lightweight scene classification model to discern where an event occurs, acquiring explicit location-based context. To identify what events occur, it focuses on atomic actions, which remain underexplored in this field and are better suited to interpreting intricate abnormal behaviors than conventional abstract action features. To achieve robust anomaly detection, the proposed Temporal-Semantic Relationship Network (TSRN) models spatio-temporal relationships among multimodal features and employs a Segment-selective Focal Margin loss (SFML) to effectively address class imbalance, outperforming conventional MIL-based methods. Experimental results on public datasets demonstrate that the proposed system effectively reduces false alarms while maintaining robustness and practicality for real-world surveillance applications.-
dc.languageEnglish-
dc.publisherInstitute of Electrical and Electronics Engineers-
dc.titleWhere and What: Contextual Dynamics-Aware Anomaly Detection in Surveillance Videos-
dc.typeArticle-
dc.identifier.doi10.1109/tip.2025.3623392-
dc.description.journalClass1-
dc.identifier.bibliographicCitationIEEE Transactions on Image Processing, v.34, pp.6993 - 7007-
dc.citation.titleIEEE Transactions on Image Processing-
dc.citation.volume34-
dc.citation.startPage6993-
dc.citation.endPage7007-
dc.description.isOpenAccessN-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
dc.identifier.wosid001608943900001-
dc.relation.journalWebOfScienceCategoryComputer Science, Artificial Intelligence-
dc.relation.journalWebOfScienceCategoryEngineering, Electrical & Electronic-
dc.relation.journalResearchAreaComputer Science-
dc.relation.journalResearchAreaEngineering-
dc.type.docTypeArticle-
dc.subject.keywordAuthorSurveillance videos-
dc.subject.keywordAuthorcontextual dynamics-
dc.subject.keywordAuthorweakly-supervised video anomaly detection-
dc.subject.keywordAuthorweakly-supervised video anomaly detection-
dc.subject.keywordAuthorweakly-supervised video anomaly detection-
Appears in Collections:
KIST Article > 2025
Export
RIS (EndNote)
XLS (Excel)
XML

qrcode

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

BROWSE