Residential College | false |
Status | 即將出版Forthcoming |
Space-time Reinforcement Network for Video Object Segmentation | |
Chen, Yadang1; Zhu, Wentao1; Yang, Zhi Xin2; Wu, Enhua3 | |
2024 | |
Conference Name | 2024 IEEE International Conference on Multimedia and Expo, ICME 2024 |
Source Publication | Proceedings - IEEE International Conference on Multimedia and Expo |
Pages | 203042 |
Conference Date | 15 July 2024through 19 July 2024 |
Conference Place | Niagra Falls |
Publisher | IEEE Computer Society |
Abstract | Recently, video object segmentation (VOS) networks typically use memory-based methods: for each query frame, the mask is predicted by space-time matching to memory frames. Despite these methods having superior performance, they suffer from two issues: 1) Challenging data can destroy the space-time coherence between adjacent video frames. 2) Pixel-level matching will lead to undesired mismatching caused by the noises or distractors. To address the aforementioned issues, we first propose to generate an auxiliary frame between adjacent frames, serving as an implicit short-temporal reference for the query one. Next, we learn a prototype for each video object and prototype-level matching can be implemented between the query and memory. The experiment demonstrated that our network outperforms the state-of-the-art method on the DAVIS 2017, achieving a ℐ&ℱ score of 86.4%, and attains a competitive result 85.0% on YouTube VOS 2018. In addition, our network exhibits a high inference speed of 32+ FPS. |
Keyword | Auxiliary Frame Memory-based Methods Prototype Learning Video Object Segmentation |
DOI | 10.1109/ICME57554.2024.10687407 |
URL | View the original |
Language | 英語English |
Scopus ID | 2-s2.0-85206566288 |
Fulltext Access | |
Citation statistics | |
Document Type | Conference paper |
Collection | THE STATE KEY LABORATORY OF INTERNET OF THINGS FOR SMART CITY (UNIVERSITY OF MACAU) DEPARTMENT OF ELECTROMECHANICAL ENGINEERING |
Affiliation | 1.Nanjing University of Information Science and Technology, School of Computer Science, Nanjing, China 2.University of Macau, State Key Laboratory of Internet of Things for Smart City, Macao 3.Chinese Academy of Sciences, Key Laboratory of System Software, State Key Laboratory of Computer Science Institute of Software, Beijing, China |
Recommended Citation GB/T 7714 | Chen, Yadang,Zhu, Wentao,Yang, Zhi Xin,et al. Space-time Reinforcement Network for Video Object Segmentation[C]:IEEE Computer Society, 2024, 203042. |
APA | Chen, Yadang., Zhu, Wentao., Yang, Zhi Xin., & Wu, Enhua (2024). Space-time Reinforcement Network for Video Object Segmentation. Proceedings - IEEE International Conference on Multimedia and Expo, 203042. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment