Residential College | false |
Status | 已發表Published |
Explicit Visual Prompting for Low-Level Structure Segmentations | |
Weihuang Liu1![]() ![]() ![]() ![]() | |
2023-06 | |
Conference Name | 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) |
Source Publication | Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
![]() |
Volume | 2023-June |
Pages | 19434-19445 |
Conference Date | JUN 17-24, 2023 |
Conference Place | Vancouver, BC, Canada |
Country | Canada |
Publication Place | USA |
Publisher | IEEE Computer Society |
Abstract | We consider the generic problem of detecting low-level structures in images, which includes segmenting the manipulated parts, identifying out-of-focus pixels, separating shadow regions, and detecting concealed objects. Whereas each such topic has been typically addressed with a domainspecific solution, we show that a unified approach performs well across all of them. We take inspiration from the widelyused pre-training and then prompt tuning protocols in NLP and propose a new visual prompting model, named Explicit Visual Prompting (EVP). Different from the previous visual prompting which is typically a dataset-level implicit embedding, our key insight is to enforce the tunable parameters focusing on the explicit visual content from each individual image, i.e., the features from frozen patch embeddings and the input’s high-frequency components. The proposed EVP significantly outperforms other parameter-efficient tuning protocols under the same amount of tunable parameters (5.7% extra trainable parameters of each task). EVP also achieves state-of-the-art performances on diverse lowlevel structure segmentation tasks compared to task-specific solutions. Our code is available at: https://github.com/NiFangBaAGe/Explicit-Visual-Prompt. |
DOI | 10.1109/CVPR52729.2023.01862 |
URL | View the original |
Indexed By | CPCI-S |
Language | 英語English |
WOS Research Area | Computer Science |
WOS Subject | Computer Science, Artificial Intelligence |
WOS ID | WOS:001062531303071 |
Scopus ID | 2-s2.0-85165567435 |
Fulltext Access | |
Citation statistics | |
Document Type | Conference paper |
Collection | Faculty of Science and Technology DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE |
Corresponding Author | Chi-Man Pun; Xiaodong Cun |
Affiliation | 1.University of Macau 2.Tencent AI Lab |
First Author Affilication | University of Macau |
Corresponding Author Affilication | University of Macau |
Recommended Citation GB/T 7714 | Weihuang Liu,Xi Shen,Chi-Man Pun,et al. Explicit Visual Prompting for Low-Level Structure Segmentations[C], USA:IEEE Computer Society, 2023, 19434-19445. |
APA | Weihuang Liu., Xi Shen., Chi-Man Pun., & Xiaodong Cun (2023). Explicit Visual Prompting for Low-Level Structure Segmentations. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2023-June, 19434-19445. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment