Full metadata record

DC Field Value Language
dc.contributor.authorLee, Sanghoon-
dc.contributor.authorOh, Youngmin-
dc.contributor.authorBaek, Donghyeon-
dc.contributor.authorLee, Junghyup-
dc.contributor.authorHam, Bumsub-
dc.date.accessioned2024-02-07T05:16:01Z-
dc.date.available2024-02-07T05:16:01Z-
dc.date.created2023-01-30-
dc.date.issued2022-10-
dc.identifier.issn0302-9743-
dc.identifier.urihttps://pubs.kist.re.kr/handle/201004/148561-
dc.description.abstractWe address the task of person search, that is, localizing and re-identifying query persons from a set of raw scene images. Recent approaches are typically built upon OIMNet, a pioneer work on person search, that learns joint person representations for performing both detection and person re-identification (reID) tasks. To obtain the representations, they extract features from pedestrian proposals, and then project them on a unit hypersphere with L2 normalization. These methods also incorporate all positive proposals, that sufficiently overlap with the ground truth, equally to learn person representations for reID. We have found that 1) the L2 normalization without considering feature distributions degenerates the discriminative power of person representations, and 2) positive proposals often also depict background clutter and person overlaps, which could encode noisy features to person representations. In this paper, we introduce OIMNet++ that addresses the aforementioned limitations. To this end, we introduce a novel normalization layer, dubbed ProtoNorm, that calibrates features from pedestrian proposals, while considering a long-tail distribution of person IDs, enabling L2 normalized person representations to be discriminative. We also propose a localization-aware feature learning scheme that encourages better-aligned proposals to contribute more in learning discriminative representations. Experimental results and analysis on standard person search benchmarks demonstrate the effectiveness of OIMNet++.-
dc.languageEnglish-
dc.publisherSPRINGER INTERNATIONAL PUBLISHING AG-
dc.titleOIMNet plus plus: Prototypical Normalization and Localization-Aware Learning for Person Search-
dc.typeConference-
dc.identifier.doi10.1007/978-3-031-20080-9_36-
dc.description.journalClass1-
dc.identifier.bibliographicCitation17th European Conference on Computer Vision (ECCV), pp.621 - 637-
dc.citation.title17th European Conference on Computer Vision (ECCV)-
dc.citation.startPage621-
dc.citation.endPage637-
dc.citation.conferencePlaceSZ-
dc.citation.conferencePlaceTel Aviv, ISRAEL-
dc.citation.conferenceDate2022-10-23-
dc.relation.isPartOfCOMPUTER VISION, ECCV 2022, PT X-
dc.identifier.wosid000897089200036-
dc.identifier.scopusid2-s2.0-85144543335-
Appears in Collections:
KIST Conference Paper > 2022
Files in This Item:
There are no files associated with this item.
Export
RIS (EndNote)
XLS (Excel)
XML

qrcode

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

BROWSE