Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Kwon, Hyeongjun | - |
dc.contributor.author | Song, Taeyong | - |
dc.contributor.author | Jeong, Somi | - |
dc.contributor.author | Kim, Jin | - |
dc.contributor.author | Jang, Jinhyun | - |
dc.contributor.author | Sohn, Kwanghoon | - |
dc.date.accessioned | 2024-01-12T02:46:02Z | - |
dc.date.available | 2024-01-12T02:46:02Z | - |
dc.date.created | 2023-11-17 | - |
dc.date.issued | 2023-06 | - |
dc.identifier.issn | 1063-6919 | - |
dc.identifier.uri | https://pubs.kist.re.kr/handle/201004/76430 | - |
dc.description.abstract | Recent progress in deterministic prompt learning has become a promising alternative to various downstream vision tasks, enabling models to learn powerful visual representations with the help of pre-trained vision-language models. However, this approach results in limited performance for dense prediction tasks that require handling more complex and diverse objects, since a single and deterministic description cannot sufficiently represent the entire image. In this paper, we present a novel probabilistic prompt learning to fully exploit the vision-language knowledge in dense prediction tasks. First, we introduce learnable class-agnostic attribute prompts to describe universal attributes across the object class. The attributes are combined with class information and visual-context knowledge to define the class-specific textual distribution. Text representations are sampled and used to guide the dense prediction task using the probabilistic pixel-text matching loss, enhancing the stability and generalization capability of the proposed method. Extensive experiments on different dense prediction tasks and ablation studies demonstrate the effectiveness of our proposed method. | - |
dc.language | English | - |
dc.publisher | IEEE COMPUTER SOC | - |
dc.title | Probabilistic Prompt Learning for Dense Prediction | - |
dc.type | Conference | - |
dc.identifier.doi | 10.1109/CVPR52729.2023.00654 | - |
dc.description.journalClass | 1 | - |
dc.identifier.bibliographicCitation | IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp.6768 - 6777 | - |
dc.citation.title | IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) | - |
dc.citation.startPage | 6768 | - |
dc.citation.endPage | 6777 | - |
dc.citation.conferencePlace | US | - |
dc.citation.conferencePlace | Vancouver, CANADA | - |
dc.citation.conferenceDate | 2023-06-17 | - |
dc.relation.isPartOf | 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR | - |
dc.identifier.wosid | 001058542607012 | - |
dc.identifier.scopusid | 2-s2.0-85162130548 | - |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.