Residential College | false |
Status | 已發表Published |
Addressing Low Dimensionality Feature Subset Selection: ReliefF(-k) or Extended Correlation-Based Feature Selection(eCFS)? | |
Antonio J. Tallón-Ballesteros1; Luís Cavique2; Simon Fong3 | |
2020 | |
Conference Name | 14th International Conference on Soft Computing Models in Industrial and Environmental Applications (SOCO) |
Source Publication | Advances in Intelligent Systems and Computing |
Volume | 950 |
Pages | 251-260 |
Conference Date | MAY 13-15, 2019 |
Conference Place | Seville, SPAIN |
Country | SPAIN |
Publication Place | SWITZERLAND |
Publisher | SPRINGER INTERNATIONAL PUBLISHING AG |
Abstract | This paper tackles problems where attribute selection is not only able to choose a few features but also to achieve a low performance classification in terms of accuracy compared to the full attribute set. Correlation-based feature selection (CFS) has been set as the baseline attribute subset selector due to its popularity and high performance. Around hundred data sets have been collected and submitted to CFS; then the problems fulfilling simultaneously the conditions: (a) a number of selected attributes lower than six and (b) a percentage of selected attributes lower than a forty per cent, have been tested onto two directions. Firstly, in the scope of data selection at the feature level, an advanced contemporary approach have been conducted as well as some options proposed in a prior work. Secondly, the pre-processed and initial problems have been tested with some sturdy classifiers. Moreover, this work introduces a new taxonomy of feature selection according to the solution type and the followed way to compute it. The test bed comprises seven problems featured by a low dimensionality after the CFS application, three out of them report a single selected attribute, another one with two extracted features and the three remaining data sets with four or five retained attributes; additionally, the initial feature set is between six and twenty-nine and the complexity of the problems, in terms of classes, fluctuates between two and twenty-one, throwing averages of sixteen and around five for both aforementioned properties. The contribution concluded that the advanced procedure (extended CFS) is suitable for problems where only one or two attributes are selected by CFS; for data sets with more than two selected features the baseline method is preferable to the advanced one, although the considered feature ranking method achieved intermediate results. |
Keyword | Cfs Correlation Extended Feature Subset Selection Feature Ranking Feature Subset Selection Machine Learning |
DOI | 10.1007/978-3-030-20055-8_24 |
URL | View the original |
Indexed By | CPCI-S |
Language | 英語English |
WOS Research Area | Computer Science |
WOS Subject | Computer Science, Artificial Intelligence ; Computer Science, Interdisciplinary Applications |
WOS ID | WOS:000490706700024 |
Scopus ID | 2-s2.0-85065917001 |
Fulltext Access | |
Citation statistics | |
Document Type | Conference paper |
Collection | DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE |
Corresponding Author | Antonio J. Tallón-Ballesteros |
Affiliation | 1.Department of Electronic, Computer Systems and Automation Engineering, University of Huelva, Huelva, Spain 2.Universidade Aberta, Lisbon, Portugal 3.Department of Computer and Information Science, University of Macau, Taipa, China |
Recommended Citation GB/T 7714 | Antonio J. Tallón-Ballesteros,Luís Cavique,Simon Fong. Addressing Low Dimensionality Feature Subset Selection: ReliefF(-k) or Extended Correlation-Based Feature Selection(eCFS)?[C], SWITZERLAND:SPRINGER INTERNATIONAL PUBLISHING AG, 2020, 251-260. |
APA | Antonio J. Tallón-Ballesteros., Luís Cavique., & Simon Fong (2020). Addressing Low Dimensionality Feature Subset Selection: ReliefF(-k) or Extended Correlation-Based Feature Selection(eCFS)?. Advances in Intelligent Systems and Computing, 950, 251-260. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment