Residential College | false |
Status | 已發表Published |
GraphSR: A Data Augmentation Algorithm for Imbalanced Node Classification | |
Zhou Mengting1,2; Gong ZG(鞏志國)1,2 | |
2023-06-27 | |
Conference Name | 37th AAAI Conference on Artificial Intelligence |
Source Publication | Proceedings of the 37th AAAI Conference on Artificial Intelligence, AAAI 2023 |
Volume | 37 |
Pages | 4954 - 4962 |
Conference Date | 2023/02/07-2023/02/14 |
Conference Place | Washington DC |
Country | USA |
Publisher | AAAI Press |
Abstract | Graph neural networks (GNNs) have achieved great success in node classification tasks. However, existing GNNs naturally bias towards the majority classes with more labelled data and ignore those minority classes with relatively few labelled ones. The traditional techniques often resort over-sampling methods, but they may cause overfitting problem. More recently, some works propose to synthesize additional nodes for minority classes from the labelled nodes, however, there is no any guarantee if those generated nodes really stand for the corresponding minority classes. In fact, improperly synthesized nodes may result in insufficient generalization of the algorithm. To resolve the problem, in this paper we seek to automatically augment the minority classes from the massive unlabelled nodes of the graph. Specifically, we propose GraphSR, a novel self-training strategy to augment the minority classes with significant diversity of unlabelled nodes, which is based on a Similarity-based selection module and a Reinforcement Learning(RL) selection module. The first module finds a subset of unlabelled nodes which are most similar to those labelled minority nodes, and the second one further determines the representative and reliable nodes from the subset via RL technique. Furthermore, the RL-based module can adaptively determine the sampling scale according to current training data. This strategy is general and can be easily combined with different GNNs models. Our experiments demonstrate the proposed approach outperforms the state-ofthe-art baselines on various class-imbalanced datasets. |
DOI | 10.48550/arXiv.2302.12814 |
URL | View the original |
Language | 英語English |
Scopus ID | 2-s2.0-85153768983 |
Fulltext Access | |
Citation statistics | |
Document Type | Conference paper |
Collection | THE STATE KEY LABORATORY OF INTERNET OF THINGS FOR SMART CITY (UNIVERSITY OF MACAU) DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE |
Corresponding Author | Gong ZG(鞏志國) |
Affiliation | 1.State Key Laboratory of Internet of Things for Smart City, University of Macau, Macao 2.Guangdong-Macau Joint Laboratory for Advanced and Intelligent Computing, Macao |
First Author Affilication | University of Macau |
Corresponding Author Affilication | University of Macau |
Recommended Citation GB/T 7714 | Zhou Mengting,Gong ZG. GraphSR: A Data Augmentation Algorithm for Imbalanced Node Classification[C]:AAAI Press, 2023, 4954 - 4962. |
APA | Zhou Mengting., & Gong ZG (2023). GraphSR: A Data Augmentation Algorithm for Imbalanced Node Classification. Proceedings of the 37th AAAI Conference on Artificial Intelligence, AAAI 2023, 37, 4954 - 4962. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment