Residential College | false |
Status | 已發表Published |
Weakly supervised semantic segmentation via saliency perception with uncertainty-guided noise suppression | |
Liu, Xinyi1; Huang, Guoheng1; Yuan, Xiaochen2; Zheng, Zewen1; Zhong, Guo3; Chen, Xuhang4; Pun, Chi Man5 | |
2024-07-26 | |
Source Publication | Visual Computer |
ISSN | 0178-2789 |
Abstract | Weakly Supervised Semantic Segmentation (WSSS) has become increasingly popular for achieving remarkable segmentation with only image-level labels. Current WSSS approaches extract Class Activation Mapping (CAM) from classification models to produce pseudo-masks for segmentation supervision. However, due to the gap between image-level supervised classification loss and pixel-level CAM generation tasks, the model tends to activate discriminative regions at the image level rather than pursuing pixel-level classification results. Moreover, insufficient supervision leads to unrestricted attention diffusion in the model, further introducing inter-class recognition noise. In this paper, we introduce a framework that employs Saliency Perception and Uncertainty, which includes a Saliency Perception Module (SPM) with Pixel-wise Transfer Loss (SP-PT), and an Uncertainty-guided Noise Suppression method. Specifically, within the SPM, we employ a hybrid attention mechanism to expand the receptive field of the module and enhance its ability to perceive salient object features. Meanwhile, a Pixel-wise Transfer Loss is designed to guide the attention diffusion of the classification model to non-discriminative regions at the pixel-level, thereby mitigating the bias of the model. To further enhance the robustness of CAM for obtaining more accurate pseudo-masks, we propose a noise suppression method based on uncertainty estimation, which applies a confidence matrix to the loss function to suppress the propagation of erroneous information and correct it, thus making the model more robust to noise. We conducted experiments on the PASCAL VOC 2012 and MS COCO 2014, and the experimental results demonstrate the effectiveness of our proposed framework. Code is available at https://github.com/pur-suit/SPU. |
Keyword | Attention Mechanism Class Activation Mapping Uncertainty Estimation Weakly Supervised Semantic Segmentation |
DOI | 10.1007/s00371-024-03574-1 |
URL | View the original |
Indexed By | SCIE |
Language | 英語English |
WOS Research Area | Computer Science |
WOS Subject | Computer Science, Software Engineering |
WOS ID | WOS:001276948900003 |
Publisher | SPRINGER, ONE NEW YORK PLAZA, SUITE 4600 , NEW YORK, NY 10004, UNITED STATES |
Scopus ID | 2-s2.0-85199668192 |
Fulltext Access | |
Citation statistics | |
Document Type | Journal article |
Collection | Faculty of Science and Technology DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE |
Corresponding Author | Huang, Guoheng; Yuan, Xiaochen |
Affiliation | 1.Guangdong University of Technology, Guangzhou, China 2.Macao Polytechnic University, Macao 3.Guangdong University of Foreign Studies, Guangzhou, China 4.Huizhou University, Huizhou, China 5.University of Macau, Macao |
Recommended Citation GB/T 7714 | Liu, Xinyi,Huang, Guoheng,Yuan, Xiaochen,et al. Weakly supervised semantic segmentation via saliency perception with uncertainty-guided noise suppression[J]. Visual Computer, 2024. |
APA | Liu, Xinyi., Huang, Guoheng., Yuan, Xiaochen., Zheng, Zewen., Zhong, Guo., Chen, Xuhang., & Pun, Chi Man (2024). Weakly supervised semantic segmentation via saliency perception with uncertainty-guided noise suppression. Visual Computer. |
MLA | Liu, Xinyi,et al."Weakly supervised semantic segmentation via saliency perception with uncertainty-guided noise suppression".Visual Computer (2024). |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment