Residential Collegefalse
Status已發表Published
Adaptive swarm cluster-based dynamic multi-objective synthetic minority oversampling technique algorithm for tackling binary imbalanced datasets in biomedical data classification
Jinyan Li1; Simon Fong1; Yunsick Sung2; Kyungeun Cho3; Raymond Wong4; Kelvin K. L. Wong5,6
2016-12-01
Source PublicationBioData Mining
ISSN1756-0381
Volume9Issue:1
Abstract

Background: An imbalanced dataset is defined as a training dataset that has imbalanced proportions of data in both interesting and uninteresting classes. Often in biomedical applications, samples from the stimulating class are rare in a population, such as medical anomalies, positive clinical tests, and particular diseases. Although the target samples in the primitive dataset are small in number, the induction of a classification model over such training data leads to poor prediction performance due to insufficient training from the minority class.

Results: In this paper, we use a novel class-balancing method named adaptive swarm cluster-based dynamic multi-objective synthetic minority oversampling technique (ASCB_DmSMOTE) to solve this imbalanced dataset problem, which is common in biomedical applications. The proposed method combines under-sampling and over-sampling into a swarm optimisation algorithm. It adaptively selects suitable parameters for the rebalancing algorithm to find the best solution. Compared with the other versions of the SMOTE algorithm, significant improvements, which include higher accuracy and credibility, are observed with ASCB_DmSMOTE.

Conclusions: Our proposed method tactfully combines two rebalancing techniques together. It reasonably re-allocates the majority class in the details and dynamically optimises the two parameters of SMOTE to synthesise a reasonable scale of minority class for each clustered sub-imbalanced dataset. The proposed methods ultimately overcome other conventional methods and attains higher credibility with even greater accuracy of the classification model.

KeywordImbalanced Dataset Swarm Optimisation Under-sampling Smote Dynamic Multi-objective Classification Biomedical Data
DOI10.1186/s13040-016-0117-1
URLView the original
Indexed BySCIE
Language英語English
WOS Research AreaMathematical & Computational Biology
WOS SubjectMathematical & Computational Biology
WOS IDWOS:000389492800001
PublisherBMC, CAMPUS, 4 CRINAN ST, LONDON N1 9XW, ENGLAND
The Source to ArticleScopus
Scopus ID2-s2.0-85000420349
Fulltext Access
Citation statistics
Document TypeJournal article
CollectionDEPARTMENT OF COMPUTER AND INFORMATION SCIENCE
Corresponding AuthorKelvin K. L. Wong
Affiliation1.Department of Computer and Information Science, University of Macau, Taipa, Macau, S.A.R., China
2.Computer Engineering Division, Keimyung University, Daegu, South Korea
3.Department of Multimedia Engineering, College of Engineering, Dongguk University, Dongdaeipgu, South Korea
4.School of Computer Science and Engineering, University of New South Wales, Sydney, NSW 2000, Australia
5.Centre for Biomedical Engineering, School of Electrical & Electronic Engineering, University of Adelaide, Adelaide, Australia
6.School of Medicine, Western Sydney University, Campbelltown, Sydney, Australia
First Author AffilicationUniversity of Macau
Recommended Citation
GB/T 7714
Jinyan Li,Simon Fong,Yunsick Sung,et al. Adaptive swarm cluster-based dynamic multi-objective synthetic minority oversampling technique algorithm for tackling binary imbalanced datasets in biomedical data classification[J]. BioData Mining, 2016, 9(1).
APA Jinyan Li., Simon Fong., Yunsick Sung., Kyungeun Cho., Raymond Wong., & Kelvin K. L. Wong (2016). Adaptive swarm cluster-based dynamic multi-objective synthetic minority oversampling technique algorithm for tackling binary imbalanced datasets in biomedical data classification. BioData Mining, 9(1).
MLA Jinyan Li,et al."Adaptive swarm cluster-based dynamic multi-objective synthetic minority oversampling technique algorithm for tackling binary imbalanced datasets in biomedical data classification".BioData Mining 9.1(2016).
Files in This Item:
There are no files associated with this item.
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Jinyan Li]'s Articles
[Simon Fong]'s Articles
[Yunsick Sung]'s Articles
Baidu academic
Similar articles in Baidu academic
[Jinyan Li]'s Articles
[Simon Fong]'s Articles
[Yunsick Sung]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Jinyan Li]'s Articles
[Simon Fong]'s Articles
[Yunsick Sung]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.