Full metadata record

DC Field Value Language
dc.contributor.authorThakare, Kamalakar Vijay-
dc.contributor.authorLohani, Lalit-
dc.contributor.authorNayak, Kamakshya Prasad-
dc.contributor.authorDogra, Debi Prosad-
dc.contributor.authorChoi, Heeseung-
dc.contributor.authorJung, Hyungjoo-
dc.contributor.authorKim, Ig-Jae-
dc.date.accessioned2025-09-24T00:30:30Z-
dc.date.available2025-09-24T00:30:30Z-
dc.date.created2025-09-17-
dc.date.issued2025-02-
dc.identifier.issn2472-6737-
dc.identifier.urihttps://pubs.kist.re.kr/handle/201004/153231-
dc.description.abstractPedestrian Attribute Recognition (PAR) serves as a fundamental task in computer vision and is crucial for upgrading security systems. It helps in precisely identifying and characterizing various attributes of pedestrians. However, current PAR datasets have certain issues in representing a wide range of attributes correctly, which makes the existing PAR methods less effective in real-world scenarios. Addressing this limitation, this paper introduces PEARL, a comprehensive dataset comprising of diverse pedestrian images annotated with 146 attributes. These samples have been sourced from surveillance videos across twelve countries. This paper also formulates an image-based PAR using language-image fusion strategy and utilizes CLIP as a new evaluation baseline. Specifically, we leverage textual information by transforming sets of attributes into meaningful sentences. Addressing the inherent data imbalance in PAR, we provide three types of prompt settings to optimize the training of the CLIP model. Our evaluation encompasses a thorough assessment of the proposed baseline model across various datasets, including PEARL dataset as well as established PAR benchmarks such as PA100K, RAP, and PETA.-
dc.languageEnglish-
dc.publisherIEEE COMPUTER SOC-
dc.titleCLIPping Imbalances: A Novel Evaluation Baseline and PEARL Dataset for Pedestrian Attribute Recognition-
dc.typeConference-
dc.identifier.doi10.1109/WACV61041.2025.00690-
dc.description.journalClass1-
dc.identifier.bibliographicCitation2025 Winter Conference on Applications of Computer Vision-WACV, pp.7102 - 7111-
dc.citation.title2025 Winter Conference on Applications of Computer Vision-WACV-
dc.citation.startPage7102-
dc.citation.endPage7111-
dc.citation.conferencePlaceUS-
dc.citation.conferencePlaceTucson, AZ-
dc.citation.conferenceDate2025-02-28-
dc.relation.isPartOf2025 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV)-
dc.identifier.wosid001521272600201-
dc.identifier.scopusid2-s2.0-105003624477-
Appears in Collections:
KIST Conference Paper > Others
Files in This Item:
There are no files associated with this item.
Export
RIS (EndNote)
XLS (Excel)
XML

qrcode

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

BROWSE