Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Lee, Jiyoung | - |
dc.contributor.author | Kim, Seungryong | - |
dc.contributor.author | Kim, Sunok | - |
dc.contributor.author | Sohn, Kwanghoon | - |
dc.date.accessioned | 2024-07-18T05:00:06Z | - |
dc.date.available | 2024-07-18T05:00:06Z | - |
dc.date.created | 2024-07-18 | - |
dc.date.issued | 2024-11 | - |
dc.identifier.issn | 0031-3203 | - |
dc.identifier.uri | https://pubs.kist.re.kr/handle/201004/150243 | - |
dc.description.abstract | We propose a novel framework for spatiotemporal action detection using only video -level class labels as weak supervision. Traditional fully -supervised approaches rely on labor-intensive manual annotation of bounding boxes for each frame. In contrast, collecting video -level class labels is significantly less tedious and more feasible compared to annotating frame -level sequences with bounding boxes. To address this challenge, we propose a discriminative action tubelet detector, called DAT-detector, designed to discern discriminative tubelets from action tubelet proposals (ATPs). Whereas the previous approaches have only focused on tubelet selection among the predefined object proposals, our DAT-detector prioritizes the generation of more precise action tubelets using regression and attention modules. Moreover, we introduce an ATP generation method that enhances the quality of tubelet proposals. Our approach achieves state-of-the-art performance on several benchmarks, and also demonstrates competitive performance even with fully -supervised approaches. | - |
dc.language | English | - |
dc.publisher | Pergamon Press | - |
dc.title | Discriminative action tubelet detector for weakly-supervised action detection | - |
dc.type | Article | - |
dc.identifier.doi | 10.1016/j.patcog.2024.110704 | - |
dc.description.journalClass | 1 | - |
dc.identifier.bibliographicCitation | Pattern Recognition, v.155 | - |
dc.citation.title | Pattern Recognition | - |
dc.citation.volume | 155 | - |
dc.description.isOpenAccess | N | - |
dc.description.journalRegisteredClass | scie | - |
dc.description.journalRegisteredClass | scopus | - |
dc.identifier.wosid | 001261570900001 | - |
dc.identifier.scopusid | 2-s2.0-85196843107 | - |
dc.relation.journalWebOfScienceCategory | Computer Science, Artificial Intelligence | - |
dc.relation.journalWebOfScienceCategory | Engineering, Electrical & Electronic | - |
dc.relation.journalResearchArea | Computer Science | - |
dc.relation.journalResearchArea | Engineering | - |
dc.type.docType | Article | - |
dc.subject.keywordAuthor | Weakly-supervised learning | - |
dc.subject.keywordAuthor | Spatiotemporal action detection | - |
dc.subject.keywordAuthor | Action proposal | - |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.