SCNet: Scale-aware coupling-structure network for efficient video object detection

doi:10.1016/j.neucom.2020.03.110

UM > Faculty of Science and Technology

Residential College	false
Status	已發表Published
	SCNet: Scale-aware coupling-structure network for efficient video object detection
	Wang,Fengchao 1; Xu,Zhewei 1; Gan,Yanfen2 ; Vong,Chi Man3 ; Liu,Qiong 1
	2020-09
Source Publication	NEUROCOMPUTING
ISSN	0925-2312
Volume	404 Pages:283-293
Abstract	In recent years, outstanding image-based object detectors were extended for video object detection. Such extension requires to address two challenging problems in videos, namely, scale variation and deformation anomaly. Most image-based works deal with scale variation by resizing the input image for multi-scale training and testing. Such way incurs very high computational cost, which becomes even higher in videos. For deformation anomaly, low-quality RoI feature can be caused by motion blur, video defocus and rare poses, resulting to poor detection accuracy in videos. In this paper, an end-to-end scale-aware coupling-structure network (SCNet) focusing on the two issues is proposed for video object detection with high accuracy and affordable computation. In SCNet, a lightweight scale-aware module is structured to flexibly model the object scale variation, which mainly consists of a set of dilated convolutional layers with parameter constraint. Furthermore, a coupling-structure RoI (region of interest) module is designed to extract robust RoI feature with position-sensitive and context-sensitive information for accuracy improvement. Besides, the feature aggregation strategy is simplified for efficiency. Experiments are conducted on the ImageNet VID dataset. SCNet achieves the state-of-the-art detection performance, exactly 79.5% mAP, with 5.9 points improvement compared to the strong single-frame baseline.
Keyword	Coupling-structure Roi Module Deformation Anomaly Scale Variation Scale-aware Module Video Object Detection
DOI	10.1016/j.neucom.2020.03.110
URL	View the original
Indexed By	SCIE
Language	英語English
WOS Research Area	Computer Science
WOS Subject	Computer Science, Artificial Intelligence
WOS ID	WOS:000542634100010
Publisher	ELSEVIERRADARWEG 29, 1043 NX AMSTERDAM, NETHERLANDS
Scopus ID	2-s2.0-85084942233
Fulltext Access	View Full-Text via DOI View Full-Text via Web of Science View Full-Text via Scopus
Citation statistics
Document Type	Journal article
Collection	Faculty of Science and Technology DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE
Corresponding Author	Liu,Qiong
Affiliation	1.South China University of Technology,Guangzhou,510006,China 2.South China Business College Guangdong University of Foreign Studies,Guangzhou,510545,China 3.University of Macau,Macau,999078,China
Recommended Citation GB/T 7714	Wang,Fengchao,Xu,Zhewei,Gan,Yanfen,et al. SCNet: Scale-aware coupling-structure network for efficient video object detection[J]. NEUROCOMPUTING, 2020, 404, 283-293.
APA	Wang,Fengchao., Xu,Zhewei., Gan,Yanfen., Vong,Chi Man., & Liu,Qiong (2020). SCNet: Scale-aware coupling-structure network for efficient video object detection. NEUROCOMPUTING, 404, 283-293.
MLA	Wang,Fengchao,et al."SCNet: Scale-aware coupling-structure network for efficient video object detection".NEUROCOMPUTING 404(2020):283-293.

Files in This Item:
There are no files associated with this item.

If you have any objections to this item, please fill out the form below and the administrator will contact you as soon as possible.
Content:
Email：	*
Affiliation No.
Verification Code:	Refresh

Any comments and suggestions are welcomed.
Title:	*
Content:
Email：	*
Verification Code:	Refresh