Probabilistic Prompt Learning for Dense Prediction
- Authors
- Kwon, Hyeongjun; Song, Taeyong; Jeong, Somi; Kim, Jin; Jang, Jinhyun; Sohn, Kwanghoon
- Issue Date
- 2023-06
- Publisher
- IEEE COMPUTER SOC
- Citation
- IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp.6768 - 6777
- Abstract
- Recent progress in deterministic prompt learning has become a promising alternative to various downstream vision tasks, enabling models to learn powerful visual representations with the help of pre-trained vision-language models. However, this approach results in limited performance for dense prediction tasks that require handling more complex and diverse objects, since a single and deterministic description cannot sufficiently represent the entire image. In this paper, we present a novel probabilistic prompt learning to fully exploit the vision-language knowledge in dense prediction tasks. First, we introduce learnable class-agnostic attribute prompts to describe universal attributes across the object class. The attributes are combined with class information and visual-context knowledge to define the class-specific textual distribution. Text representations are sampled and used to guide the dense prediction task using the probabilistic pixel-text matching loss, enhancing the stability and generalization capability of the proposed method. Extensive experiments on different dense prediction tasks and ablation studies demonstrate the effectiveness of our proposed method.
- ISSN
- 1063-6919
- URI
- https://pubs.kist.re.kr/handle/201004/76430
- DOI
- 10.1109/CVPR52729.2023.00654
- Appears in Collections:
- KIST Conference Paper > 2023
- Files in This Item:
There are no files associated with this item.
- Export
- RIS (EndNote)
- XLS (Excel)
- XML
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.