DSpace at KIST: Discriminative action tubelet detector for weakly-supervised action detection

Browse

DSpace at KISTKIST Article 2024

Full metadata record

DC Field	Value	Language
dc.contributor.author	Lee, Jiyoung	-
dc.contributor.author	Kim, Seungryong	-
dc.contributor.author	Kim, Sunok	-
dc.contributor.author	Sohn, Kwanghoon	-
dc.date.accessioned	2024-07-18T05:00:06Z	-
dc.date.available	2024-07-18T05:00:06Z	-
dc.date.created	2024-07-18	-
dc.date.issued	2024-11	-
dc.identifier.issn	0031-3203	-
dc.identifier.uri	https://pubs.kist.re.kr/handle/201004/150243	-
dc.description.abstract	We propose a novel framework for spatiotemporal action detection using only video -level class labels as weak supervision. Traditional fully -supervised approaches rely on labor-intensive manual annotation of bounding boxes for each frame. In contrast, collecting video -level class labels is significantly less tedious and more feasible compared to annotating frame -level sequences with bounding boxes. To address this challenge, we propose a discriminative action tubelet detector, called DAT-detector, designed to discern discriminative tubelets from action tubelet proposals (ATPs). Whereas the previous approaches have only focused on tubelet selection among the predefined object proposals, our DAT-detector prioritizes the generation of more precise action tubelets using regression and attention modules. Moreover, we introduce an ATP generation method that enhances the quality of tubelet proposals. Our approach achieves state-of-the-art performance on several benchmarks, and also demonstrates competitive performance even with fully -supervised approaches.	-
dc.language	English	-
dc.publisher	Pergamon Press	-
dc.title	Discriminative action tubelet detector for weakly-supervised action detection	-
dc.type	Article	-
dc.identifier.doi	10.1016/j.patcog.2024.110704	-
dc.description.journalClass	1	-
dc.identifier.bibliographicCitation	Pattern Recognition, v.155	-
dc.citation.title	Pattern Recognition	-
dc.citation.volume	155	-
dc.description.isOpenAccess	N	-
dc.description.journalRegisteredClass	scie	-
dc.description.journalRegisteredClass	scopus	-
dc.identifier.wosid	001261570900001	-
dc.identifier.scopusid	2-s2.0-85196843107	-
dc.relation.journalWebOfScienceCategory	Computer Science, Artificial Intelligence	-
dc.relation.journalWebOfScienceCategory	Engineering, Electrical & Electronic	-
dc.relation.journalResearchArea	Computer Science	-
dc.relation.journalResearchArea	Engineering	-
dc.type.docType	Article	-
dc.subject.keywordAuthor	Weakly-supervised learning	-
dc.subject.keywordAuthor	Spatiotemporal action detection	-
dc.subject.keywordAuthor	Action proposal	-

Appears in Collections:: KIST Article > 2024

Export: RIS (EndNote); XLS (Excel); XML

Show Simple Item Record

KIST Library Institutional Repository

Browse

BROWSE