Residential College | false |
Status | 已發表Published |
Adaptive chunk-based dynamic weighted majority for imbalanced data streams with concept drift | |
Lu, Yang1,2; Cheung, Yiu Ming2; Yan Tang, Yuan3,4 | |
2019-12-05 | |
Source Publication | IEEE Transactions on Neural Networks and Learning Systems |
ISSN | 2162-237X |
Volume | 31Issue:8Pages:2764-2778 |
Abstract | One of the most challenging problems in the field of online learning is concept drift, which deeply influences the classification stability of streaming data. If the data stream is imbalanced, it is even more difficult to detect concept drifts and make an online learner adapt to them. Ensemble algorithms have been found effective for the classification of streaming data with concept drift, whereby an individual classifier is built for each incoming data chunk and its associated weight is adjusted to manage the drift. However, it is difficult to adjust the weights to achieve a balance between the stability and adaptability of the ensemble classifiers. In addition, when the data stream is imbalanced, the use of a size-fixed chunk to build a single classifier can create further problems; the data chunk may contain too few or even no minority class samples (i.e., only majority class samples). A classifier built on such a chunk is unstable in the ensemble. In this article, we propose a chunk-based incremental learning method called adaptive chunk-based dynamic weighted majority (ACDWM) to deal with imbalanced streaming data containing concept drift. ACDWM utilizes an ensemble framework by dynamically weighting the individual classifiers according to their classification performance on the current data chunk. The chunk size is adaptively selected by statistical hypothesis tests to access whether the classifier built on the current data chunk is sufficiently stable. ACDWM has four advantages compared with the existing methods as follows: 1) it can maintain stability when processing nondrifted streams and rapidly adapt to the new concept; 2) it is entirely incremental, i.e., no previous data need to be stored; 3) it stores a limited number of classifiers to ensure high efficiency; and 4) it adaptively selects the chunk size in the concept drift environment. Experiments on both synthetic and real data sets containing concept drift show that ACDWM outperforms both state-of-the-art chunk-based and online methods. |
Keyword | Concept Drift Ensemble Methods Imbalance Learning Online Learning |
DOI | 10.1109/TNNLS.2019.2951814 |
URL | View the original |
Indexed By | SCIE |
Language | 英語English |
WOS Research Area | Computer Science ; Engineering |
WOS Subject | Computer Science, Artificial Intelligence ; Computer Science, Hardware & Architecture ; Computer Science, Theory & Methods ; Engineering, Electrical & Electronic |
WOS ID | WOS:000557365700008 |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
Scopus ID | 2-s2.0-85089128769 |
Fulltext Access | |
Citation statistics | |
Document Type | Journal article |
Collection | DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE Faculty of Science and Technology |
Corresponding Author | Cheung, Yiu Ming |
Affiliation | 1.Fujian Key Laboratory of Sensing and Computing for Smart City, School of Informatics, Xiamen University, Xiamen, 361005, China 2.Department of Computer Science, Hong Kong Baptist University, Hong Kong 3.Faculty of Science and Technology, UOW College Hong Kong, Community College of City University, Hong Kong 4.Department of Computer and Information Science, Faculty of Science and Technology, University of Macau, Macao |
Recommended Citation GB/T 7714 | Lu, Yang,Cheung, Yiu Ming,Yan Tang, Yuan. Adaptive chunk-based dynamic weighted majority for imbalanced data streams with concept drift[J]. IEEE Transactions on Neural Networks and Learning Systems, 2019, 31(8), 2764-2778. |
APA | Lu, Yang., Cheung, Yiu Ming., & Yan Tang, Yuan (2019). Adaptive chunk-based dynamic weighted majority for imbalanced data streams with concept drift. IEEE Transactions on Neural Networks and Learning Systems, 31(8), 2764-2778. |
MLA | Lu, Yang,et al."Adaptive chunk-based dynamic weighted majority for imbalanced data streams with concept drift".IEEE Transactions on Neural Networks and Learning Systems 31.8(2019):2764-2778. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment