Residential Collegefalse
Status已發表Published
Efficient variation-based feature selection for medical data classification
Fong, Simon2; Liang, Justin2; Siu, Shirley W. I.2; Chan, Jonathan H1
2015-09
Source PublicationJournal of Medical Imaging and Health Informatics
ISSN2156-7018
Volume5Issue:5Pages:1093-1098
Abstract

Medical data which collected from sophisticated and sometimes different instruments are described by a large number of feature variables and a historical archive of patients' records, known as multivariate medical dataset. In biomedical data mining, classification model is often built upon such dataset for predicting which particular type of disease that a new instance of record belongs to. One of the challenges in inferring a classification model with good prediction accuracy is to select the relevant features that contribute to maximum predictive power. Many feature selection techniques have been proposed and studied in the past, but none so far claimed to be the best. In this paper, an efficient feature selection method called Clustering Coefficients of Variation (CCV) is applied. CCV is based on a very simple principle of variance-bias which optimally balances the model training between generalization and over-fitting. Through a computer simulation experiment, eleven medical datasets with a substantially large number of features are tested by CCV in comparison to four popular feature selection techniques. Results show that CCV outperformed them in all aspects of averaged performances and speed. By the simplicity of design it is anticipated that CCV will be a useful feature selection method for classifying medical data especially those datasets that are characterized by many features.

KeywordClassification Feature Selection Medical Datasets
DOI10.1166/jmihi.2015.1501
URLView the original
Indexed BySCIE
Language英語English
WOS Research AreaMathematical & Computational Biology ; Radiology, Nuclear Medicine & Medical Imaging
WOS SubjectMathematical & Computational Biology ; Radiology, Nuclear Medicine & Medical Imaging
WOS IDWOS:000361271900029
PublisherAmerican Scientific Publishers
Scopus ID2-s2.0-84938313521
Fulltext Access
Citation statistics
Document TypeJournal article
CollectionDEPARTMENT OF COMPUTER AND INFORMATION SCIENCE
Affiliation1.King Mongkuts Univ Technol, Sch Informat Technol, Bangkok 10140, Thailand
2.Univ Macau, Dept Comp & Informat Sci, Taipa, Macau Sar, Peoples R China
First Author AffilicationUniversity of Macau
Recommended Citation
GB/T 7714
Fong, Simon,Liang, Justin,Siu, Shirley W. I.,et al. Efficient variation-based feature selection for medical data classification[J]. Journal of Medical Imaging and Health Informatics, 2015, 5(5), 1093-1098.
APA Fong, Simon., Liang, Justin., Siu, Shirley W. I.., & Chan, Jonathan H (2015). Efficient variation-based feature selection for medical data classification. Journal of Medical Imaging and Health Informatics, 5(5), 1093-1098.
MLA Fong, Simon,et al."Efficient variation-based feature selection for medical data classification".Journal of Medical Imaging and Health Informatics 5.5(2015):1093-1098.
Files in This Item:
There are no files associated with this item.
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Fong, Simon]'s Articles
[Liang, Justin]'s Articles
[Siu, Shirley W. I.]'s Articles
Baidu academic
Similar articles in Baidu academic
[Fong, Simon]'s Articles
[Liang, Justin]'s Articles
[Siu, Shirley W. I.]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Fong, Simon]'s Articles
[Liang, Justin]'s Articles
[Siu, Shirley W. I.]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.