Space-time Reinforcement Network for Video Object Segmentation

doi:10.1109/ICME57554.2024.10687407

UM > THE STATE KEY LABORATORY OF INTERNET OF THINGS FOR SMART CITY (UNIVERSITY OF MACAU)

Residential College	false
Status	即將出版Forthcoming
	Space-time Reinforcement Network for Video Object Segmentation
	Chen, Yadang 1; Zhu, Wentao 1; Yang, Zhi Xin2 ; Wu, Enhua 3
	2024
Conference Name	2024 IEEE International Conference on Multimedia and Expo, ICME 2024
Source Publication	Proceedings - IEEE International Conference on Multimedia and Expo
Pages	203042
Conference Date	15 July 2024through 19 July 2024
Conference Place	Niagra Falls
Publisher	IEEE Computer Society
Abstract	Recently, video object segmentation (VOS) networks typically use memory-based methods: for each query frame, the mask is predicted by space-time matching to memory frames. Despite these methods having superior performance, they suffer from two issues: 1) Challenging data can destroy the space-time coherence between adjacent video frames. 2) Pixel-level matching will lead to undesired mismatching caused by the noises or distractors. To address the aforementioned issues, we first propose to generate an auxiliary frame between adjacent frames, serving as an implicit short-temporal reference for the query one. Next, we learn a prototype for each video object and prototype-level matching can be implemented between the query and memory. The experiment demonstrated that our network outperforms the state-of-the-art method on the DAVIS 2017, achieving a ℐ&ℱ score of 86.4%, and attains a competitive result 85.0% on YouTube VOS 2018. In addition, our network exhibits a high inference speed of 32+ FPS.
Keyword	Auxiliary Frame Memory-based Methods Prototype Learning Video Object Segmentation
DOI	10.1109/ICME57554.2024.10687407
URL	View the original
Language	英語English
Scopus ID	2-s2.0-85206566288
Fulltext Access	View Full-Text via DOI View Full-Text via Scopus
Citation statistics
Document Type	Conference paper
Collection	THE STATE KEY LABORATORY OF INTERNET OF THINGS FOR SMART CITY (UNIVERSITY OF MACAU) DEPARTMENT OF ELECTROMECHANICAL ENGINEERING
Affiliation	1.Nanjing University of Information Science and Technology, School of Computer Science, Nanjing, China 2.University of Macau, State Key Laboratory of Internet of Things for Smart City, Macao 3.Chinese Academy of Sciences, Key Laboratory of System Software, State Key Laboratory of Computer Science Institute of Software, Beijing, China
Recommended Citation GB/T 7714	Chen, Yadang,Zhu, Wentao,Yang, Zhi Xin,et al. Space-time Reinforcement Network for Video Object Segmentation[C]:IEEE Computer Society, 2024, 203042.
APA	Chen, Yadang., Zhu, Wentao., Yang, Zhi Xin., & Wu, Enhua (2024). Space-time Reinforcement Network for Video Object Segmentation. Proceedings - IEEE International Conference on Multimedia and Expo, 203042.

Files in This Item:
There are no files associated with this item.

If you have any objections to this item, please fill out the form below and the administrator will contact you as soon as possible.
Content:
Email：	*
Affiliation No.
Verification Code:	Refresh

Any comments and suggestions are welcomed.
Title:	*
Content:
Email：	*
Verification Code:	Refresh