Residential College | false |
Status | 已發表Published |
Spatial constraint for efficient semi-supervised video object segmentation | |
Chen, Yadang1,2; Ji, Chuanjun1,2; Yang, Zhi Xin3; Wu, Enhua4 | |
2023-12-01 | |
Source Publication | Computer Vision and Image Understanding |
ISSN | 1077-3142 |
Volume | 237Pages:103843 |
Abstract | Semi-supervised video object segmentation is the process of tracking and segmenting objects in a video sequence based on annotated masks for one or more frames. Recently, memory-based methods have attracted a significant amount of attention due to their strong performance. Having too much redundant information stored in memory, however, makes such methods inefficient and inaccurate. Moreover, a global matching strategy is usually used for memory reading, so these methods are susceptible to interference from semantically similar objects and are prone to incorrect segmentation. We propose a spatial constraint network to overcome these problems. In particular, we introduce a time-varying sensor and a dynamic feature memory to adaptively store pixel information to facilitate the modeling of the target object, which greatly reduces information redundancy in the memory without missing critical information. Furthermore, we propose an efficient memory reader that is less computationally intensive and has a smaller footprint. More importantly, we introduce a spatial constraint module to learn spatial consistency to obtain more precise segmentation; the target and distractors can be identified by the learned spatial response. The experimental results indicate that our method is competitive with state-of-the-art methods on several benchmark datasets. Our method also achieves an approximately 30 FPS inference speed, which is close to the requirement for real-time systems. |
Keyword | Memory-based Methods Redundant Information Semantically Similar Objects Video Object Segmentation |
DOI | 10.1016/j.cviu.2023.103843 |
URL | View the original |
Indexed By | SCIE |
Language | 英語English |
WOS Research Area | Computer Science ; Engineering |
WOS Subject | Computer Science, Artificial Intelligence ; Engineering, Electrical & Electronic |
WOS ID | WOS:001089100900001 |
Publisher | ACADEMIC PRESS INC ELSEVIER SCIENCE, 525 B ST, STE 1900, SAN DIEGO, CA 92101-4495 |
Scopus ID | 2-s2.0-85173157016 |
Fulltext Access | |
Citation statistics | |
Document Type | Journal article |
Collection | DEPARTMENT OF ELECTROMECHANICAL ENGINEERING Faculty of Science and Technology THE STATE KEY LABORATORY OF INTERNET OF THINGS FOR SMART CITY (UNIVERSITY OF MACAU) |
Corresponding Author | Ji, Chuanjun |
Affiliation | 1.Engineering Research Center of Digital Forensics, Ministry of Education, Nanjing University of Information Science and Technology, 210044, China 2.School of Computer and Software, Nanjing University of Information Science and Technology, Nanjing, 210044, China 3.The State Key Laboratory of Internet of Things for Smart City and Department of Electromechanical Engineering, University of Macau, 999078, China 4.The State Key Laboratory of Computer Science, Institute of Software, University of Chinese Academy of Sciences, Beijing, 100190, China |
Recommended Citation GB/T 7714 | Chen, Yadang,Ji, Chuanjun,Yang, Zhi Xin,et al. Spatial constraint for efficient semi-supervised video object segmentation[J]. Computer Vision and Image Understanding, 2023, 237, 103843. |
APA | Chen, Yadang., Ji, Chuanjun., Yang, Zhi Xin., & Wu, Enhua (2023). Spatial constraint for efficient semi-supervised video object segmentation. Computer Vision and Image Understanding, 237, 103843. |
MLA | Chen, Yadang,et al."Spatial constraint for efficient semi-supervised video object segmentation".Computer Vision and Image Understanding 237(2023):103843. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment