Residential College | false |
Status | 已發表Published |
An Underwater Organism Image Dataset and a Lightweight Module Designed for Object Detection Networks | |
Huang, Jiafeng1; Zhang, Tianjun1; Zhao, Shengjie1; Zhang, Lin1; Zhou, Yicong2 | |
2024-05 | |
Source Publication | ACM Transactions on Multimedia Computing, Communications and Applications |
ISSN | 1551-6857 |
Volume | 20Issue:5Pages:147 |
Abstract | Long-term monitoring and recognition of underwater organism objects are of great significance in marine ecology, fisheries science and many other disciplines. Traditional techniques in this field, including manual fishing-based ones and sonar-based ones, are usually flawed. Specifically, the method based on manual fishing is time-consuming and unsuitable for scientific researches, while the sonar-based one, has the defects of low acoustic image accuracy and large echo errors. In recent years, the rapid development of deep learning and its excellent performance in computer vision tasks make vision-based solutions feasible. However, the researches in this area are still relatively insufficient in mainly two aspects. First, to our knowledge, there is still a lack of large-scale datasets of underwater organism images with accurate annotations. Second, in consideration of the limitation on hardware resources of underwater devices, an underwater organism detection algorithm that is both accurate and lightweight enough to be able to infer in real time is still lacking. As an attempt to fill in the aforementioned research gaps to some extent, we established the Multiple Kinds of Underwater Organisms (MKUO) dataset with accurate bounding box annotations of taxonomic information, which consists of 10,043 annotated images, covering eighty-four underwater organism categories. Based on our benchmark dataset, we evaluated a series of existing object detection algorithms to obtain their accuracy and complexity indicators as the baseline for future reference. In addition, we also propose a novel lightweight module, namely Sparse Ghost Module, designed especially for object detection networks. By substituting the standard convolution with our proposed one, the network complexity can be significantly reduced and the inference speed can be greatly improved without obvious detection accuracy loss. To make our results reproducible, the dataset and the source code are available online at https://cslinzhang.github.io/MKUO-and-Sparse-Ghost-Module/. |
Keyword | Benchmark Dataset Lightweight Module Object Detection |
DOI | 10.1145/3640465 |
URL | View the original |
Indexed By | SCIE |
Language | 英語English |
WOS Research Area | Computer Science |
WOS Subject | Computer Science, Information Systems ; Computer Science, Software Engineering ; Computer Science, Theory & Methods |
WOS ID | WOS:001192177900027 |
Publisher | ASSOC COMPUTING MACHINERY, 1601 Broadway, 10th Floor, NEW YORK, NY, UNITED STATES |
Scopus ID | 2-s2.0-85185225335 |
Fulltext Access | |
Citation statistics | |
Document Type | Journal article |
Collection | Faculty of Science and Technology DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE |
Corresponding Author | Zhao, Shengjie; Zhang, Lin |
Affiliation | 1.School of Software Engineering, Tongji University, Shanghai, No. 4800, CaoAn Rd, 200092, China 2.Department of Computer and Information Science, University of Macau, Taipa University Road, 999078, Macao |
Recommended Citation GB/T 7714 | Huang, Jiafeng,Zhang, Tianjun,Zhao, Shengjie,et al. An Underwater Organism Image Dataset and a Lightweight Module Designed for Object Detection Networks[J]. ACM Transactions on Multimedia Computing, Communications and Applications, 2024, 20(5), 147. |
APA | Huang, Jiafeng., Zhang, Tianjun., Zhao, Shengjie., Zhang, Lin., & Zhou, Yicong (2024). An Underwater Organism Image Dataset and a Lightweight Module Designed for Object Detection Networks. ACM Transactions on Multimedia Computing, Communications and Applications, 20(5), 147. |
MLA | Huang, Jiafeng,et al."An Underwater Organism Image Dataset and a Lightweight Module Designed for Object Detection Networks".ACM Transactions on Multimedia Computing, Communications and Applications 20.5(2024):147. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment