Residential Collegefalse
Status已發表Published
Moderated VFDT in Stream Mining Using Adaptive Tie Threshold and Incremental Pruning
Hang Yang; Simon Fong
2011-09-07
Conference Name13th International Conference, DaWaK 2011
Source PublicationLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume6862 LNCS
Pages471-483
Conference DateAugust/September 2011
Conference PlaceToulouse, France
Abstract

Very Fast Decision Tree (VFDT) is one of the most popular decision tree algorithms in data stream mining. The tree building process is based on the principle of the Hoeffding bound to decide on splitting nodes with sufficient data statistics at the leaf. The original version of VFDT requires a user-defined tie threshold by which a split will be forced to break to control the tree size. It is an open problem that the tree size grows tremendously with noise as continuous data stream in and the classifier's accuracy drops. In this paper, we propose a Moderated VFDT (M-VFDT), which uses an adaptive tie threshold for node splitting control by incremental computing. The tree building process is as fast as that of the original VFDT. The accuracy of M-VFDT improves significantly even under the presence of noise in the data stream. To solve the explosion of tree size, which is still an inherent problem in VFDT, we propose two lightweight pre-pruning mechanisms for stream mining (post-pruning is not appropriate here because of the streaming operation). Experiments are conducted to verify the merits of our new methods. M-VFDT with a pruning mechanism shows a better performance than the original VFDT at all times. Our contribution is a new model that can efficiently achieve a compact decision tree and good accuracy as an optimal balance in data stream mining. 

KeywordData Stream Mining Hoeffding Bound Incremental Pruning
DOI10.1007/978-3-642-23544-3_36
URLView the original
Language英語English
Scopus ID2-s2.0-80052341936
Fulltext Access
Citation statistics
Document TypeConference paper
CollectionDEPARTMENT OF COMPUTER AND INFORMATION SCIENCE
AffiliationDepartment of Science and Technology, University of Macau, Av. Padre Tomás Pereira Taipa, Macau, China
First Author AffilicationUniversity of Macau
Recommended Citation
GB/T 7714
Hang Yang,Simon Fong. Moderated VFDT in Stream Mining Using Adaptive Tie Threshold and Incremental Pruning[C], 2011, 471-483.
APA Hang Yang., & Simon Fong (2011). Moderated VFDT in Stream Mining Using Adaptive Tie Threshold and Incremental Pruning. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 6862 LNCS, 471-483.
Files in This Item:
There are no files associated with this item.
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Hang Yang]'s Articles
[Simon Fong]'s Articles
Baidu academic
Similar articles in Baidu academic
[Hang Yang]'s Articles
[Simon Fong]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Hang Yang]'s Articles
[Simon Fong]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.