Full metadata record
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Ahn, Deok-Hyun | - |
| dc.contributor.author | Jo, Yong-Jin | - |
| dc.contributor.author | Kim, Dong-Bum | - |
| dc.contributor.author | Nam, Gi Pyo | - |
| dc.contributor.author | Han, Jae-Ho | - |
| dc.contributor.author | Kim, Haksub | - |
| dc.date.accessioned | 2025-11-26T07:32:37Z | - |
| dc.date.available | 2025-11-26T07:32:37Z | - |
| dc.date.created | 2025-11-25 | - |
| dc.date.issued | 2025-10 | - |
| dc.identifier.issn | 1057-7149 | - |
| dc.identifier.uri | https://pubs.kist.re.kr/handle/201004/153649 | - |
| dc.description.abstract | In surveillance environments, detecting anomalies requires understanding the contextual dynamics of the environment, human behaviors, and movements within a scene. Effective anomaly detection must address both the where and what of events, but existing approaches such as unimodal action-based methods or LLM-integrated multimodal frameworks have limitations. These methods either rely on implicit scene information, making it difficult to localize where anomalies occur, or fail to adapt to surveillance specific challenges such as view changes, subtle actions, low light conditions, and crowded scenes. As a result, these challenges hinder accurate detection of what occurs. To overcome these limitations, our system takes advantage of features from a lightweight scene classification model to discern where an event occurs, acquiring explicit location-based context. To identify what events occur, it focuses on atomic actions, which remain underexplored in this field and are better suited to interpreting intricate abnormal behaviors than conventional abstract action features. To achieve robust anomaly detection, the proposed Temporal-Semantic Relationship Network (TSRN) models spatio-temporal relationships among multimodal features and employs a Segment-selective Focal Margin loss (SFML) to effectively address class imbalance, outperforming conventional MIL-based methods. Experimental results on public datasets demonstrate that the proposed system effectively reduces false alarms while maintaining robustness and practicality for real-world surveillance applications. | - |
| dc.language | English | - |
| dc.publisher | Institute of Electrical and Electronics Engineers | - |
| dc.title | Where and What: Contextual Dynamics-Aware Anomaly Detection in Surveillance Videos | - |
| dc.type | Article | - |
| dc.identifier.doi | 10.1109/tip.2025.3623392 | - |
| dc.description.journalClass | 1 | - |
| dc.identifier.bibliographicCitation | IEEE Transactions on Image Processing, v.34, pp.6993 - 7007 | - |
| dc.citation.title | IEEE Transactions on Image Processing | - |
| dc.citation.volume | 34 | - |
| dc.citation.startPage | 6993 | - |
| dc.citation.endPage | 7007 | - |
| dc.description.isOpenAccess | N | - |
| dc.description.journalRegisteredClass | scie | - |
| dc.description.journalRegisteredClass | scopus | - |
| dc.identifier.wosid | 001608943900001 | - |
| dc.relation.journalWebOfScienceCategory | Computer Science, Artificial Intelligence | - |
| dc.relation.journalWebOfScienceCategory | Engineering, Electrical & Electronic | - |
| dc.relation.journalResearchArea | Computer Science | - |
| dc.relation.journalResearchArea | Engineering | - |
| dc.type.docType | Article | - |
| dc.subject.keywordAuthor | Surveillance videos | - |
| dc.subject.keywordAuthor | contextual dynamics | - |
| dc.subject.keywordAuthor | weakly-supervised video anomaly detection | - |
| dc.subject.keywordAuthor | weakly-supervised video anomaly detection | - |
| dc.subject.keywordAuthor | weakly-supervised video anomaly detection | - |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.