Residential Collegefalse
Status已發表Published
Incrementally Optimized Decision Tree for Noisy Big Data
Hang Yang; Simon Fong
2012-09-28
Conference Name1st International Workshop on Big Data, Streams and Heterogeneous Source Mining: Algorithms, Systems, Programming Models and Applications
Source PublicationBigMine '12: Proceedings of the 1st International Workshop on Big Data, Streams and Heterogeneous Source Mining: Algorithms, Systems, Programming Models and Applications
Pages36-44
Conference DateAugust 12, 2012
Conference PlaceBeijing, China
Publication PlaceNew York, NY, USA
PublisherACM
Abstract

How to extract meaningful information from big data has been a popular open problem. Decision tree, which has a high degree of knowledge interpretation, has been favored in many real world applications. However noisy values commonly exist in high-speed data streams, e.g. real-time online data feeds that are prone to interference. When processing big data, it is hard to implement pre-processing and sampling in full batches. To solve this tradeoff, this paper proposes a new incremental decision tree algorithm so called incrementally optimized very fast decision tree (iOVFDT). The experiment evaluates the proposed algorithm in comparison to existing methods under noisy data streams environment. Result shows iOVFDT has outperformance on the aspects of higher accuracy and smaller model size. 

KeywordData Stream Mining Decision Tree Classification Optimized Very Fast Decision Tree Incremental Optimization
DOI10.1145/2351316.2351322
URLView the original
Language英語English
Scopus ID2-s2.0-84866615880
Fulltext Access
Citation statistics
Document TypeConference paper
CollectionDEPARTMENT OF COMPUTER AND INFORMATION SCIENCE
AffiliationDepartment of Computer and Information Science University of Macau, Av. Padre Tomás Pereira Taipa Macau, China
First Author AffilicationUniversity of Macau
Recommended Citation
GB/T 7714
Hang Yang,Simon Fong. Incrementally Optimized Decision Tree for Noisy Big Data[C], New York, NY, USA:ACM, 2012, 36-44.
APA Hang Yang., & Simon Fong (2012). Incrementally Optimized Decision Tree for Noisy Big Data. BigMine '12: Proceedings of the 1st International Workshop on Big Data, Streams and Heterogeneous Source Mining: Algorithms, Systems, Programming Models and Applications, 36-44.
Files in This Item:
There are no files associated with this item.
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Hang Yang]'s Articles
[Simon Fong]'s Articles
Baidu academic
Similar articles in Baidu academic
[Hang Yang]'s Articles
[Simon Fong]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Hang Yang]'s Articles
[Simon Fong]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.