DSpace at KIST: CLIPping Imbalances: A Novel Evaluation Baseline and PEARL Dataset for Pedestrian Attribute Recognition

Browse

DSpace at KISTKIST Conference Paper Others

Full metadata record

DC Field	Value	Language
dc.contributor.author	Thakare, Kamalakar Vijay	-
dc.contributor.author	Lohani, Lalit	-
dc.contributor.author	Nayak, Kamakshya Prasad	-
dc.contributor.author	Dogra, Debi Prosad	-
dc.contributor.author	Choi, Heeseung	-
dc.contributor.author	Jung, Hyungjoo	-
dc.contributor.author	Kim, Ig-Jae	-
dc.date.accessioned	2025-09-24T00:30:30Z	-
dc.date.available	2025-09-24T00:30:30Z	-
dc.date.created	2025-09-17	-
dc.date.issued	2025-02	-
dc.identifier.issn	2472-6737	-
dc.identifier.uri	https://pubs.kist.re.kr/handle/201004/153231	-
dc.description.abstract	Pedestrian Attribute Recognition (PAR) serves as a fundamental task in computer vision and is crucial for upgrading security systems. It helps in precisely identifying and characterizing various attributes of pedestrians. However, current PAR datasets have certain issues in representing a wide range of attributes correctly, which makes the existing PAR methods less effective in real-world scenarios. Addressing this limitation, this paper introduces PEARL, a comprehensive dataset comprising of diverse pedestrian images annotated with 146 attributes. These samples have been sourced from surveillance videos across twelve countries. This paper also formulates an image-based PAR using language-image fusion strategy and utilizes CLIP as a new evaluation baseline. Specifically, we leverage textual information by transforming sets of attributes into meaningful sentences. Addressing the inherent data imbalance in PAR, we provide three types of prompt settings to optimize the training of the CLIP model. Our evaluation encompasses a thorough assessment of the proposed baseline model across various datasets, including PEARL dataset as well as established PAR benchmarks such as PA100K, RAP, and PETA.	-
dc.language	English	-
dc.publisher	IEEE COMPUTER SOC	-
dc.title	CLIPping Imbalances: A Novel Evaluation Baseline and PEARL Dataset for Pedestrian Attribute Recognition	-
dc.type	Conference	-
dc.identifier.doi	10.1109/WACV61041.2025.00690	-
dc.description.journalClass	1	-
dc.identifier.bibliographicCitation	2025 Winter Conference on Applications of Computer Vision-WACV, pp.7102 - 7111	-
dc.citation.title	2025 Winter Conference on Applications of Computer Vision-WACV	-
dc.citation.startPage	7102	-
dc.citation.endPage	7111	-
dc.citation.conferencePlace	US	-
dc.citation.conferencePlace	Tucson, AZ	-
dc.citation.conferenceDate	2025-02-26	-
dc.relation.isPartOf	2025 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV)	-
dc.identifier.wosid	001521272600201	-
dc.identifier.scopusid	2-s2.0-105003624477	-

Appears in Collections:: KIST Conference Paper > Others

Export: RIS (EndNote); XLS (Excel); XML

Show Simple Item Record

KIST Library Institutional Repository

Browse

BROWSE