Residential College | false |
Status | 已發表Published |
Efficient variation-based feature selection for medical data classification | |
Fong, Simon2; Liang, Justin2; Siu, Shirley W. I.2; Chan, Jonathan H1 | |
2015-09 | |
Source Publication | Journal of Medical Imaging and Health Informatics |
ISSN | 2156-7018 |
Volume | 5Issue:5Pages:1093-1098 |
Abstract | Medical data which collected from sophisticated and sometimes different instruments are described by a large number of feature variables and a historical archive of patients' records, known as multivariate medical dataset. In biomedical data mining, classification model is often built upon such dataset for predicting which particular type of disease that a new instance of record belongs to. One of the challenges in inferring a classification model with good prediction accuracy is to select the relevant features that contribute to maximum predictive power. Many feature selection techniques have been proposed and studied in the past, but none so far claimed to be the best. In this paper, an efficient feature selection method called Clustering Coefficients of Variation (CCV) is applied. CCV is based on a very simple principle of variance-bias which optimally balances the model training between generalization and over-fitting. Through a computer simulation experiment, eleven medical datasets with a substantially large number of features are tested by CCV in comparison to four popular feature selection techniques. Results show that CCV outperformed them in all aspects of averaged performances and speed. By the simplicity of design it is anticipated that CCV will be a useful feature selection method for classifying medical data especially those datasets that are characterized by many features. |
Keyword | Classification Feature Selection Medical Datasets |
DOI | 10.1166/jmihi.2015.1501 |
URL | View the original |
Indexed By | SCIE |
Language | 英語English |
WOS Research Area | Mathematical & Computational Biology ; Radiology, Nuclear Medicine & Medical Imaging |
WOS Subject | Mathematical & Computational Biology ; Radiology, Nuclear Medicine & Medical Imaging |
WOS ID | WOS:000361271900029 |
Publisher | American Scientific Publishers |
Scopus ID | 2-s2.0-84938313521 |
Fulltext Access | |
Citation statistics | |
Document Type | Journal article |
Collection | DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE |
Affiliation | 1.King Mongkuts Univ Technol, Sch Informat Technol, Bangkok 10140, Thailand 2.Univ Macau, Dept Comp & Informat Sci, Taipa, Macau Sar, Peoples R China |
First Author Affilication | University of Macau |
Recommended Citation GB/T 7714 | Fong, Simon,Liang, Justin,Siu, Shirley W. I.,et al. Efficient variation-based feature selection for medical data classification[J]. Journal of Medical Imaging and Health Informatics, 2015, 5(5), 1093-1098. |
APA | Fong, Simon., Liang, Justin., Siu, Shirley W. I.., & Chan, Jonathan H (2015). Efficient variation-based feature selection for medical data classification. Journal of Medical Imaging and Health Informatics, 5(5), 1093-1098. |
MLA | Fong, Simon,et al."Efficient variation-based feature selection for medical data classification".Journal of Medical Imaging and Health Informatics 5.5(2015):1093-1098. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment