Residential College | false |
Status | 已發表Published |
Novel up-scale feature aggregation for object detection in aerial images | |
Lin,Hu1; Zhou,Jingkai1; Gan,Yanfen2; Vong,Chi Man3; Liu,Qiong1 | |
2020-10-21 | |
Source Publication | NEUROCOMPUTING |
ISSN | 0925-2312 |
Volume | 411Pages:364-374 |
Abstract | Object detection is a pivotal task for many unmanned aerial vehicle (UAV) applications. Compared to general scenes, the objects in aerial images are typically much smaller. For this reason, most general object detectors suffer from two critical challenges while dealing with aerial images: 1) The widely exploited Feature Pyramid Network works by integrating high-level features to lower levels progressively. However, this manner does not transfer equivalent information from each level of backbone network to the generated features, and the shared detection head faces an unbalanced sources of information flow, damaging the detection accuracy. 2) Up-sampling is commonly used to expand feature resolution for feature fusion or feature aggregation. However, existing up-sampling methods are ineffective to reconstruct high resolution feature maps. To address these two challenges, two works are proposed: 1) An up-scale feature aggregation framework that fully utilizes multi-scale complementary information, and 2) a novel up-sampling method that further improve detection accuracy. These two proposals are integrated into an end-to-end single-stage object detector namely HawkNet. Extensive experiments are conducted on VisDrone-DET2018, UAVDT and DIOR datasets. Compared to the RetinaNet baseline, our HawkNet achieves absolute gains of 6.0%, 1.2% and 5.9% in average precision (AP) on VisDrone-DET2018, UAVDT and DIOR datasets, respectively. For a 800 × 1333 input on the UAVDT dataset, HawkNet with ResNet-50 backbone surpasses existing methods for single-scale inference and achieves the best performance (37.4 AP), while operating at 10.6 frames per second on a single Nvidia GTX 1080Ti GPU. |
Keyword | Aerial Images Feature Aggregation Object Detection Up-sampling |
DOI | 10.1016/j.neucom.2020.06.011 |
URL | View the original |
Indexed By | SCIE |
Language | 英語English |
WOS Research Area | Computer Science |
WOS Subject | Computer Science, Artificial Intelligence |
WOS ID | WOS:000571895700016 |
Scopus ID | 2-s2.0-85087329363 |
Fulltext Access | |
Citation statistics | |
Document Type | Journal article |
Collection | DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE |
Corresponding Author | Liu,Qiong |
Affiliation | 1.South China University of Technology,Guangzhou,510006,China 2.South China Business College,Guangdong University of Foreign Studies,Guangzhou,510545,China 3.University of Macau,Macau,999078,China |
Recommended Citation GB/T 7714 | Lin,Hu,Zhou,Jingkai,Gan,Yanfen,et al. Novel up-scale feature aggregation for object detection in aerial images[J]. NEUROCOMPUTING, 2020, 411, 364-374. |
APA | Lin,Hu., Zhou,Jingkai., Gan,Yanfen., Vong,Chi Man., & Liu,Qiong (2020). Novel up-scale feature aggregation for object detection in aerial images. NEUROCOMPUTING, 411, 364-374. |
MLA | Lin,Hu,et al."Novel up-scale feature aggregation for object detection in aerial images".NEUROCOMPUTING 411(2020):364-374. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment