UM  > Faculty of Science and Technology
Residential Collegefalse
Status已發表Published
Pre-trained SAM as data augmentation for image segmentation
Wu, Junjun1,2; Rao, Yunbo2; Zeng, Shaoning1; Zhang, Bob3
2024-10
Source PublicationCAAI Transactions on Intelligence Technology
ISSN2468-6557
Abstract

Data augmentation plays an important role in training deep neural model by expanding the size and diversity of the dataset. Initially, data augmentation mainly involved some simple transformations of images. Later, in order to increase the diversity and complexity of data, more advanced methods appeared and evolved to sophisticated generative models. However, these methods required a mass of computation of training or searching. In this paper, a novel training-free method that utilises the Pre-Trained Segment Anything Model (SAM) model as a data augmentation tool (PTSAM-DA) is proposed to generate the augmented annotations for images. Without the need for training, it obtains prompt boxes from the original annotations and then feeds the boxes to the pre-trained SAM to generate diverse and improved annotations. In this way, annotations are augmented more ingenious than simple manipulations without incurring huge computation for training a data augmentation model. Multiple comparative experiments on three datasets are conducted, including an in-house dataset, ADE20K and COCO2017. On this in-house dataset, namely Agricultural Plot Segmentation Dataset, maximum improvements of 3.77% and 8.92% are gained in two mainstream metrics, mIoU and mAcc, respectively. Consequently, large vision models like SAM are proven to be promising not only in image segmentation but also in data augmentation.

KeywordData Augmentation Image Segmentation Large Model Segment Anything Model
DOI10.1049/cit2.12381
URLView the original
Indexed BySCIE
Language英語English
WOS Research AreaComputer Science
WOS SubjectComputer Science, Artificial Intelligence
WOS IDWOS:001336276800001
PublisherWILEY, 111 RIVER ST, HOBOKEN 07030-5774, NJ
Scopus ID2-s2.0-85205920704
Fulltext Access
Citation statistics
Document TypeJournal article
CollectionFaculty of Science and Technology
DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE
Corresponding AuthorRao, Yunbo; Zeng, Shaoning
Affiliation1.Yangtze Delta Region Institute (Huzhou), University of Electronic Science and Technology of China, Huzhou, China
2.School of Information and Software Engineering, University of Electronic Science and Technology of China, Chengdu, China
3.Pattern Analysis and Machine Intelligence Research Group, Department of Computer and Information Science, University of Macau, Macao, China
Recommended Citation
GB/T 7714
Wu, Junjun,Rao, Yunbo,Zeng, Shaoning,et al. Pre-trained SAM as data augmentation for image segmentation[J]. CAAI Transactions on Intelligence Technology, 2024.
APA Wu, Junjun., Rao, Yunbo., Zeng, Shaoning., & Zhang, Bob (2024). Pre-trained SAM as data augmentation for image segmentation. CAAI Transactions on Intelligence Technology.
MLA Wu, Junjun,et al."Pre-trained SAM as data augmentation for image segmentation".CAAI Transactions on Intelligence Technology (2024).
Files in This Item:
There are no files associated with this item.
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Wu, Junjun]'s Articles
[Rao, Yunbo]'s Articles
[Zeng, Shaoning]'s Articles
Baidu academic
Similar articles in Baidu academic
[Wu, Junjun]'s Articles
[Rao, Yunbo]'s Articles
[Zeng, Shaoning]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Wu, Junjun]'s Articles
[Rao, Yunbo]'s Articles
[Zeng, Shaoning]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.