Residential Collegefalse
Status已發表Published
Incrementally Optimized Decision Tree for Mining Imperfect Data Streams
Hang Yang; Simon Fong
2012-12-01
Conference Name4th International Conference, NDT 2012
Source PublicationNetworked Digital Technologies
Volume293 PART 1
Pages281-296
Conference DateApril 24-26, 2012
Conference PlaceDubai, UAE
PublisherSpringer, Berlin, Heidelberg
Abstract

The Very Fast Decision Tree (VFDT) is one of the most important classification algorithms for real-time data stream mining. However, imperfections in data streams, such as noise and imbalanced class distribution, do exist in real world applications and they jeopardize the performance of VFDT. Traditional sampling techniques and post-pruning may be impractical for a non-stopping data stream. To deal with the adverse effects of imperfect data streams, we have invented an incremental optimization model that can be integrated into the decision tree model for data stream classification. It is called the Incrementally Optimized Very Fast Decision Tree (I-OVFDT) and it balances performance (in relation to prediction accuracy, tree size and learning time) and diminishes error and tree size dynamically. Furthermore, two new Functional Tree Leaf strategies are extended for I-OVFDT that result in superior performance compared to VFDT and its variant algorithms. Our new model works especially well for imperfect data streams. I-OVFDT is an anytime algorithm that can be integrated into those existing VFDT-extended algorithms based on Hoeffding bound in node splitting. The experimental results show that I-OVFDT has higher accuracy and more compact tree size than other existing data stream classification methods. 

KeywordData Stream Mining Decision Tree Classification Optimized Very Fast Decision Tree Incremental Optimization
DOI10.1007/978-3-642-30507-8_25
URLView the original
Language英語English
Scopus ID2-s2.0-84880497064
Fulltext Access
Citation statistics
Document TypeConference paper
CollectionDEPARTMENT OF COMPUTER AND INFORMATION SCIENCE
AffiliationDepartment of Computer and Information Science, University of Macau, Taipa, Macau SAR, China
First Author AffilicationUniversity of Macau
Recommended Citation
GB/T 7714
Hang Yang,Simon Fong. Incrementally Optimized Decision Tree for Mining Imperfect Data Streams[C]:Springer, Berlin, Heidelberg, 2012, 281-296.
APA Hang Yang., & Simon Fong (2012). Incrementally Optimized Decision Tree for Mining Imperfect Data Streams. Networked Digital Technologies, 293 PART 1, 281-296.
Files in This Item:
There are no files associated with this item.
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Hang Yang]'s Articles
[Simon Fong]'s Articles
Baidu academic
Similar articles in Baidu academic
[Hang Yang]'s Articles
[Simon Fong]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Hang Yang]'s Articles
[Simon Fong]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.