Residential College | false |
Status | 已發表Published |
Incrementally Optimized Decision Tree for Noisy Big Data | |
Hang Yang; Simon Fong | |
2012-09-28 | |
Conference Name | 1st International Workshop on Big Data, Streams and Heterogeneous Source Mining: Algorithms, Systems, Programming Models and Applications |
Source Publication | BigMine '12: Proceedings of the 1st International Workshop on Big Data, Streams and Heterogeneous Source Mining: Algorithms, Systems, Programming Models and Applications |
Pages | 36-44 |
Conference Date | August 12, 2012 |
Conference Place | Beijing, China |
Publication Place | New York, NY, USA |
Publisher | ACM |
Abstract | How to extract meaningful information from big data has been a popular open problem. Decision tree, which has a high degree of knowledge interpretation, has been favored in many real world applications. However noisy values commonly exist in high-speed data streams, e.g. real-time online data feeds that are prone to interference. When processing big data, it is hard to implement pre-processing and sampling in full batches. To solve this tradeoff, this paper proposes a new incremental decision tree algorithm so called incrementally optimized very fast decision tree (iOVFDT). The experiment evaluates the proposed algorithm in comparison to existing methods under noisy data streams environment. Result shows iOVFDT has outperformance on the aspects of higher accuracy and smaller model size. |
Keyword | Data Stream Mining Decision Tree Classification Optimized Very Fast Decision Tree Incremental Optimization |
DOI | 10.1145/2351316.2351322 |
URL | View the original |
Language | 英語English |
Scopus ID | 2-s2.0-84866615880 |
Fulltext Access | |
Citation statistics | |
Document Type | Conference paper |
Collection | DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE |
Affiliation | Department of Computer and Information Science University of Macau, Av. Padre Tomás Pereira Taipa Macau, China |
First Author Affilication | University of Macau |
Recommended Citation GB/T 7714 | Hang Yang,Simon Fong. Incrementally Optimized Decision Tree for Noisy Big Data[C], New York, NY, USA:ACM, 2012, 36-44. |
APA | Hang Yang., & Simon Fong (2012). Incrementally Optimized Decision Tree for Noisy Big Data. BigMine '12: Proceedings of the 1st International Workshop on Big Data, Streams and Heterogeneous Source Mining: Algorithms, Systems, Programming Models and Applications, 36-44. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment