Probabilistic Prompt Learning for Dense Prediction

Authors
Kwon, HyeongjunSong, TaeyongJeong, SomiKim, JinJang, JinhyunSohn, Kwanghoon
Issue Date
2023-06
Publisher
IEEE COMPUTER SOC
Citation
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp.6768 - 6777
Abstract
Recent progress in deterministic prompt learning has become a promising alternative to various downstream vision tasks, enabling models to learn powerful visual representations with the help of pre-trained vision-language models. However, this approach results in limited performance for dense prediction tasks that require handling more complex and diverse objects, since a single and deterministic description cannot sufficiently represent the entire image. In this paper, we present a novel probabilistic prompt learning to fully exploit the vision-language knowledge in dense prediction tasks. First, we introduce learnable class-agnostic attribute prompts to describe universal attributes across the object class. The attributes are combined with class information and visual-context knowledge to define the class-specific textual distribution. Text representations are sampled and used to guide the dense prediction task using the probabilistic pixel-text matching loss, enhancing the stability and generalization capability of the proposed method. Extensive experiments on different dense prediction tasks and ablation studies demonstrate the effectiveness of our proposed method.
ISSN
1063-6919
URI
https://pubs.kist.re.kr/handle/201004/76430
DOI
10.1109/CVPR52729.2023.00654
Appears in Collections:
KIST Conference Paper > 2023
Files in This Item:
There are no files associated with this item.
Export
RIS (EndNote)
XLS (Excel)
XML

qrcode

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

BROWSE