Residential College | false |
Status | 已發表Published |
A Transformer-based Adaptive Prototype Matching Network for Few-Shot Semantic Segmentation | |
Chen, Sihan1; Chen, Yadang1; Zheng, Yuhui2; Yang, Zhi Xin3; Wu, Enhua4 | |
2024 | |
Conference Name | 33rd International Joint Conference on Artificial Intelligence, IJCAI 2024 |
Source Publication | Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence |
Pages | 659-667 |
Conference Date | 3-9 August 2024 |
Conference Place | Jeju, South Korea |
Country | Republic of Korea |
Publisher | International Joint Conferences on Artificial Intelligence |
Abstract | Few-shot semantic segmentation (FSS) aims to generate a model for segmenting novel classes using a limited number of annotated samples. Previous FSS methods have shown sensitivity to background noise due to inherent bias, attention bias, and spatial-aware bias. In this study, we propose a Transformer-Based Adaptive Prototype Matching Network to establish robust matching relationships by improving the semantic and spatial perception of query features. The model includes three modules: target enhancement module (TEM), dual constraint aggregation module (DCAM), and dual classification module (DCM). In particular, TEM mitigates inherent bias by exploring the relevance of multi-scale local context to enhance foreground features. Then, DCAM addresses attention bias through the dual semantic-aware attention mechanism to strengthen constraints. Finally, the DCM module decouples the segmentation task into semantic alignment and spatial alignment to alleviate spatial-aware bias. Extensive experiments on PASCAL-5i and COCO-20i confirm the effectiveness of our approach. |
Keyword | Computer Vision Segmentation Representation Learning Scene Analysis And understAnding Transfer, Low-shot, Semi- And Un- Supervised Learning |
DOI | 10.24963/ijcai.2024/73 |
URL | View the original |
Language | 英語English |
Scopus ID | 2-s2.0-85204284881 |
Fulltext Access | |
Citation statistics | |
Document Type | Conference paper |
Collection | THE STATE KEY LABORATORY OF INTERNET OF THINGS FOR SMART CITY (UNIVERSITY OF MACAU) DEPARTMENT OF ELECTROMECHANICAL ENGINEERING |
Corresponding Author | Zheng, Yuhui |
Affiliation | 1.School of Computer Science, Nanjing University of Information Science and Technology, China 2.College of Computer, Qinghai Normal University, China 3.State Key Laboratory of Internet of Things for Smart City, University of Macau, Macao 4.Key Laboratory of System Software, State Key Laboratory of Computer Science, Institute of Software, Chinese Academy of Sciences, China |
Recommended Citation GB/T 7714 | Chen, Sihan,Chen, Yadang,Zheng, Yuhui,et al. A Transformer-based Adaptive Prototype Matching Network for Few-Shot Semantic Segmentation[C]:International Joint Conferences on Artificial Intelligence, 2024, 659-667. |
APA | Chen, Sihan., Chen, Yadang., Zheng, Yuhui., Yang, Zhi Xin., & Wu, Enhua (2024). A Transformer-based Adaptive Prototype Matching Network for Few-Shot Semantic Segmentation. Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 659-667. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment