Residential College | false |
Status | 已發表Published |
Boosting Video Object Segmentation via Robust and Efficient Memory Network | |
Chen, Yadang1; Zhang, Dingwei1; Zheng, Yuhui1; Yang, Zhi Xin2; Wu, Enhua3; Zhao, Haixing4 | |
2024-05 | |
Source Publication | IEEE Transactions on Circuits and Systems for Video Technology |
ISSN | 1051-8215 |
Volume | 34Issue:5Pages:3340-3352 |
Abstract | Recently, memory-based methods have exhibited remarkable performance in Video Object Segmentation (VOS) by employing non-local pixel-wise matching between the query and memory. Nevertheless, these methods suffer from two limitations: 1) Non-local pixel-wise matching can result in the incorrect segmentation of background distractor objects, and 2) memory features with substantial temporal redundancy consume significant computing resources and reduce the inference speed. To address the limitations, we first propose a local attention mechanism to suppress background features, and we introduce a novel training framework based on contrast learning to ensure the network learns reliable and robust pixel-wise correspondence between query and memory. We adaptively determine whether to update the memory based on the variation of foreground objects. Next, we propose a dynamic memory bank, which utilizes a lightweight and differentiable soft modulation gate to determine the number of memory features to remove along the temporal dimension. This allows efficient and flexible management of memory features. Our network achieves competitive results (e.g., 92.1% on DAVIS 2016 val, 87.6%/81.3% on DAVIS 2017 val/test, 87.0% on YouTube-VOS 2018 val) compared with the state-of-the-art methods while maintaining a faster inference speed of 25+FPS. Moreover, our network demonstrates a favorable balance between performance and speed when dealing with the long-time video dataset. |
Keyword | Object Segmentation Robustness Redundancy Semi-supervised Learning |
DOI | 10.1109/TCSVT.2023.3321977 |
URL | View the original |
Indexed By | SCIE |
Language | 英語English |
WOS Research Area | Engineering |
WOS Subject | Engineering, Electrical & Electronic |
WOS ID | WOS:001221132000077 |
Publisher | IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC, 445 HOES LANE, PISCATAWAY, NJ 08855-4141 |
Scopus ID | 2-s2.0-85174811833 |
Fulltext Access | |
Citation statistics | |
Document Type | Journal article |
Collection | Faculty of Science and Technology THE STATE KEY LABORATORY OF INTERNET OF THINGS FOR SMART CITY (UNIVERSITY OF MACAU) DEPARTMENT OF ELECTROMECHANICAL ENGINEERING |
Corresponding Author | Zheng, Yuhui |
Affiliation | 1.School of Computer Science, Nanjing University of Information Science and Technology, Nanjing, China 2.Department of Electromechanical Engineering, State Key Laboratory of Internet of Things for Smart City, University of Macau, Macau, China 3.Institute of Software, State Key Laboratory of Computer Science, University of Chinese Academy of Sciences, Beijing, China 4.Key Laboratory of Tibetan Information Processing, the College of Computer, Qinghai Normal University, Xining, China |
Recommended Citation GB/T 7714 | Chen, Yadang,Zhang, Dingwei,Zheng, Yuhui,et al. Boosting Video Object Segmentation via Robust and Efficient Memory Network[J]. IEEE Transactions on Circuits and Systems for Video Technology, 2024, 34(5), 3340-3352. |
APA | Chen, Yadang., Zhang, Dingwei., Zheng, Yuhui., Yang, Zhi Xin., Wu, Enhua., & Zhao, Haixing (2024). Boosting Video Object Segmentation via Robust and Efficient Memory Network. IEEE Transactions on Circuits and Systems for Video Technology, 34(5), 3340-3352. |
MLA | Chen, Yadang,et al."Boosting Video Object Segmentation via Robust and Efficient Memory Network".IEEE Transactions on Circuits and Systems for Video Technology 34.5(2024):3340-3352. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment