Weakly Supervised Visual Saliency Prediction

doi:10.1109/TIP.2022.3158064

UM > Faculty of Science and Technology

Residential College	false
Status	已發表Published
	Weakly Supervised Visual Saliency Prediction
	Qiuxia La 1; Zhou, Tianfei 2; Khan, Salman 3; Sun, Hanqiu 4; Shen, Jianbing 5; Shao, Ling 3
	2022-04-05
Source Publication	IEEE Transactions on Image Processing
ISSN	1057-7149
Volume	31 Pages:3111-3124
Abstract	The success of current deep saliency models heavily depends on large amounts of annotated human fixation data to fit the highly non-linear mapping between the stimuli and visual saliency. Such fully supervised data-driven approaches are annotation-intensive and often fail to consider the underlying mechanisms of visual attention. In contrast, in this paper, we introduce a model based on various cognitive theories of visual saliency, which learns visual attention patterns in a weakly supervised manner. Our approach incorporates insights from cognitive science as differentiable submodules, resulting in a unified, end-to-end trainable framework. Specifically, our model encapsulates the following important components motivated from biological vision. (a) As scene semantics are closely related to visually attentive regions, our model encodes discriminative spatial information for scene understanding through spatial visual semantics embedding. (b) To model the objectness factors in visual attention deployment, we incorporate object-level semantics embedding and object relation information. (c) Considering the 'winner-take-all' mechanism in visual stimuli processing, we model the competition mechanism among objects with softmax based neural attention. (d) Lastly, a conditional center prior is learned to mimic the spatial distribution bias of visual attention. Furthermore, we propose novel loss functions to utilize supervision cues from image-level semantics, saliency prior knowledge, and self-information compression. Experiments show that our method achieves promising results, and even outperforms many of its fully supervised counterparts. Overall, our weakly supervised saliency method makes an essential step towards reducing the annotation budget of current approaches, as well as providing a more comprehensive understanding of the visual attention mechanism. Our code is available at: https://github.com/ashleylqx/WeakFixation.git.
Keyword	Deep Learning Saliency Prediction Visual Attention Prediction Weakly Supervised Learning
DOI	10.1109/TIP.2022.3158064
URL	View the original
Indexed By	SCIE
Language	英語English
WOS Research Area	Computer Science ; Engineering
WOS Subject	Computer Science, Artificial Intelligence ; Engineering, Electrical & Electronic
WOS ID	WOS:000784189200001
Publisher	IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC, 445 HOES LANE, PISCATAWAY, NJ 08855-4141
Scopus ID	2-s2.0-85127805341
Fulltext Access	View Full-Text via DOI View Full-Text via Web of Science View Full-Text via Scopus
Citation statistics
Document Type	Journal article
Collection	Faculty of Science and Technology THE STATE KEY LABORATORY OF INTERNET OF THINGS FOR SMART CITY (UNIVERSITY OF MACAU) DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE
Corresponding Author	Zhou, Tianfei
Affiliation	1.State Key Laboratory of Media Convergence and Communication, Neuroscience and Intelligent Media Institute, Communication University of China (CUC), Beijing, 100024, China 2.Computer Vision Laboratory, Eth Zürich, Zürich, 8092, Switzerland 3.Inception Institute of Artificial Intelligence, Abu Dhabi, United Arab Emirates 4.School of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu, 610056, China 5.Department of Computer and Information Science, State Key Laboratory of Internet of Things for Smart City, University of Macau, Macao
Recommended Citation GB/T 7714	Qiuxia La,Zhou, Tianfei,Khan, Salman,et al. Weakly Supervised Visual Saliency Prediction[J]. IEEE Transactions on Image Processing, 2022, 31, 3111-3124.
APA	Qiuxia La., Zhou, Tianfei., Khan, Salman., Sun, Hanqiu., Shen, Jianbing., & Shao, Ling (2022). Weakly Supervised Visual Saliency Prediction. IEEE Transactions on Image Processing, 31, 3111-3124.
MLA	Qiuxia La,et al."Weakly Supervised Visual Saliency Prediction".IEEE Transactions on Image Processing 31(2022):3111-3124.

Files in This Item:
There are no files associated with this item.

If you have any objections to this item, please fill out the form below and the administrator will contact you as soon as possible.
Content:
Email：	*
Affiliation No.
Verification Code:	Refresh

Any comments and suggestions are welcomed.
Title:	*
Content:
Email：	*
Verification Code:	Refresh