Full metadata record

DC Field Value Language
dc.contributor.authorKwon, Hyeongjun-
dc.contributor.authorSong, Taeyong-
dc.contributor.authorJeong, Somi-
dc.contributor.authorKim, Jin-
dc.contributor.authorJang, Jinhyun-
dc.contributor.authorSohn, Kwanghoon-
dc.date.accessioned2024-01-12T02:46:02Z-
dc.date.available2024-01-12T02:46:02Z-
dc.date.created2023-11-17-
dc.date.issued2023-06-
dc.identifier.issn1063-6919-
dc.identifier.urihttps://pubs.kist.re.kr/handle/201004/76430-
dc.description.abstractRecent progress in deterministic prompt learning has become a promising alternative to various downstream vision tasks, enabling models to learn powerful visual representations with the help of pre-trained vision-language models. However, this approach results in limited performance for dense prediction tasks that require handling more complex and diverse objects, since a single and deterministic description cannot sufficiently represent the entire image. In this paper, we present a novel probabilistic prompt learning to fully exploit the vision-language knowledge in dense prediction tasks. First, we introduce learnable class-agnostic attribute prompts to describe universal attributes across the object class. The attributes are combined with class information and visual-context knowledge to define the class-specific textual distribution. Text representations are sampled and used to guide the dense prediction task using the probabilistic pixel-text matching loss, enhancing the stability and generalization capability of the proposed method. Extensive experiments on different dense prediction tasks and ablation studies demonstrate the effectiveness of our proposed method.-
dc.languageEnglish-
dc.publisherIEEE COMPUTER SOC-
dc.titleProbabilistic Prompt Learning for Dense Prediction-
dc.typeConference-
dc.identifier.doi10.1109/CVPR52729.2023.00654-
dc.description.journalClass1-
dc.identifier.bibliographicCitationIEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp.6768 - 6777-
dc.citation.titleIEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)-
dc.citation.startPage6768-
dc.citation.endPage6777-
dc.citation.conferencePlaceUS-
dc.citation.conferencePlaceVancouver, CANADA-
dc.citation.conferenceDate2023-06-17-
dc.relation.isPartOf2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR-
dc.identifier.wosid001058542607012-
dc.identifier.scopusid2-s2.0-85162130548-
Appears in Collections:
KIST Conference Paper > 2023
Files in This Item:
There are no files associated with this item.
Export
RIS (EndNote)
XLS (Excel)
XML

qrcode

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

BROWSE