Residential Collegefalse
Status已發表Published
A Lightweight Data Preprocessing Strategy with Fast Contradiction Analysis for Incremental Classifier Learning
Simon Fong1; Robert P. Biuk-Aghai1; Yain-whar Si1; Bee Wah Yap2
2015-03-01
Source PublicationMathematical Problems in Engineering
ISSN1024-123X
Volume2015
Other Abstract

A prime objective in constructing data streaming mining models is to achieve good accuracy, fast learning, and robustness to noise. Although many techniques have been proposed in the past, efforts to improve the accuracy of classification models have been somewhat disparate. These techniques include, but are not limited to, feature selection, dimensionality reduction, and the removal of noise from training data. One limitation common to all of these techniques is the assumption that the full training dataset must be applied. Although this has been effective for traditional batch training, it may not be practical for incremental classifier learning, also known as data stream mining, where only a single pass of the data stream is seen at a time. Because data streams can amount to infinity and the so-called big data phenomenon, the data preprocessing time must be kept to a minimum. This paper introduces a new data preprocessing strategy suitable for the progressive purging of noisy data from the training dataset without the need to process the whole dataset at one time. This strategy is shown via a computer simulation to provide the significant benefit of allowing for the dynamic removal of bad records from the incremental classifier learning process.

DOI10.1155/2015/125781
URLView the original
Indexed BySCIE
Language英語English
WOS Research AreaEngineering ; Mathematics
WOS SubjectEngineering, Multidisciplinary ; Mathematics, Interdisciplinary Applications
WOS IDWOS:000352424900001
PublisherHINDAWI LTD, ADAM HOUSE, 3RD FLR, 1 FITZROY SQ, LONDON, W1T 5HF, ENGLAND
Scopus ID2-s2.0-84925337211
Fulltext Access
Citation statistics
Document TypeJournal article
CollectionDEPARTMENT OF COMPUTER AND INFORMATION SCIENCE
Corresponding AuthorSimon Fong
Affiliation1.Department of Computer and Information Science, University of Macau, Macau
2.Faculty of Computer and Mathematical Sciences, Universiti Teknologi MARA, 40450 Shah Alam, Selangor, Malaysia
First Author AffilicationUniversity of Macau
Corresponding Author AffilicationUniversity of Macau
Recommended Citation
GB/T 7714
Simon Fong,Robert P. Biuk-Aghai,Yain-whar Si,et al. A Lightweight Data Preprocessing Strategy with Fast Contradiction Analysis for Incremental Classifier Learning[J]. Mathematical Problems in Engineering, 2015, 2015.
APA Simon Fong., Robert P. Biuk-Aghai., Yain-whar Si., & Bee Wah Yap (2015). A Lightweight Data Preprocessing Strategy with Fast Contradiction Analysis for Incremental Classifier Learning. Mathematical Problems in Engineering, 2015.
MLA Simon Fong,et al."A Lightweight Data Preprocessing Strategy with Fast Contradiction Analysis for Incremental Classifier Learning".Mathematical Problems in Engineering 2015(2015).
Files in This Item:
There are no files associated with this item.
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Simon Fong]'s Articles
[Robert P. Biuk-Aghai]'s Articles
[Yain-whar Si]'s Articles
Baidu academic
Similar articles in Baidu academic
[Simon Fong]'s Articles
[Robert P. Biuk-Aghai]'s Articles
[Yain-whar Si]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Simon Fong]'s Articles
[Robert P. Biuk-Aghai]'s Articles
[Yain-whar Si]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.