Residential College | false |
Status | 已發表Published |
SCNet: Scale-aware coupling-structure network for efficient video object detection | |
Wang,Fengchao1; Xu,Zhewei1; Gan,Yanfen2; Vong,Chi Man3; Liu,Qiong1 | |
2020-09 | |
Source Publication | NEUROCOMPUTING |
ISSN | 0925-2312 |
Volume | 404Pages:283-293 |
Abstract | In recent years, outstanding image-based object detectors were extended for video object detection. Such extension requires to address two challenging problems in videos, namely, scale variation and deformation anomaly. Most image-based works deal with scale variation by resizing the input image for multi-scale training and testing. Such way incurs very high computational cost, which becomes even higher in videos. For deformation anomaly, low-quality RoI feature can be caused by motion blur, video defocus and rare poses, resulting to poor detection accuracy in videos. In this paper, an end-to-end scale-aware coupling-structure network (SCNet) focusing on the two issues is proposed for video object detection with high accuracy and affordable computation. In SCNet, a lightweight scale-aware module is structured to flexibly model the object scale variation, which mainly consists of a set of dilated convolutional layers with parameter constraint. Furthermore, a coupling-structure RoI (region of interest) module is designed to extract robust RoI feature with position-sensitive and context-sensitive information for accuracy improvement. Besides, the feature aggregation strategy is simplified for efficiency. Experiments are conducted on the ImageNet VID dataset. SCNet achieves the state-of-the-art detection performance, exactly 79.5% mAP, with 5.9 points improvement compared to the strong single-frame baseline. |
Keyword | Coupling-structure Roi Module Deformation Anomaly Scale Variation Scale-aware Module Video Object Detection |
DOI | 10.1016/j.neucom.2020.03.110 |
URL | View the original |
Indexed By | SCIE |
Language | 英語English |
WOS Research Area | Computer Science |
WOS Subject | Computer Science, Artificial Intelligence |
WOS ID | WOS:000542634100010 |
Publisher | ELSEVIERRADARWEG 29, 1043 NX AMSTERDAM, NETHERLANDS |
Scopus ID | 2-s2.0-85084942233 |
Fulltext Access | |
Citation statistics | |
Document Type | Journal article |
Collection | Faculty of Science and Technology DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE |
Corresponding Author | Liu,Qiong |
Affiliation | 1.South China University of Technology,Guangzhou,510006,China 2.South China Business College Guangdong University of Foreign Studies,Guangzhou,510545,China 3.University of Macau,Macau,999078,China |
Recommended Citation GB/T 7714 | Wang,Fengchao,Xu,Zhewei,Gan,Yanfen,et al. SCNet: Scale-aware coupling-structure network for efficient video object detection[J]. NEUROCOMPUTING, 2020, 404, 283-293. |
APA | Wang,Fengchao., Xu,Zhewei., Gan,Yanfen., Vong,Chi Man., & Liu,Qiong (2020). SCNet: Scale-aware coupling-structure network for efficient video object detection. NEUROCOMPUTING, 404, 283-293. |
MLA | Wang,Fengchao,et al."SCNet: Scale-aware coupling-structure network for efficient video object detection".NEUROCOMPUTING 404(2020):283-293. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment