CGraphNet: Contrastive Graph Context Prediction for Sparse Unlabeled Short Text Representation Learning on Social Media

doi:10.1109/TCSS.2024.3452695

UM > THE STATE KEY LABORATORY OF INTERNET OF THINGS FOR SMART CITY (UNIVERSITY OF MACAU)

Residential College	false
Status	已發表Published
	CGraphNet: Contrastive Graph Context Prediction for Sparse Unlabeled Short Text Representation Learning on Social Media
	Chen, Junyang 1; Guo, Jingcai 2; Li, Xueliang 3; Wang, Huan 4; Xu, Zhenghua 5; Gong, Zhiguo6 ; Zhang, Liangjie 1; Leung, Victor C.M.1
	2024-10
Source Publication	IEEE Transactions on Computational Social Systems
ISSN	2329-924X
Abstract	Unlabeled text representation learning (UTRL), encompassing static word embeddings such as Word2Vec and contextualized word embeddings such as bidirectional encoder representations from transformer (BERT), aims to capture semantic word relationships in a low-dimensional space without the need for manual labeling. These word embeddings are invaluable for downstream tasks such as document classification and clustering. However, the surge of short texts generated daily on social media platforms results in sparse word cooccurrences, compromising UTRL outcomes. Contextualized models such as recurrent neural network (RNN) and BERT, while impressive, often struggle with predicting the next word due to sparse word sequences in short texts. To address this, we introduce CGraphNet, a contrastive graph context prediction model designed for UTRL. This approach converts short texts into graphs, establishing links between sequentially occurring words. Information from the next word and its neighbors informs the target prediction, a process referred to as graph context prediction, mitigating sparse word cooccurrence issues in brief sentences. To minimize noise, an attention mechanism assigns importance to neighbors, while a contrastive objective encourages more distinctive representations by comparing the target word with its neighbors. Our experiments demonstrate CGraphNet's superior performance over other baselines, particularly in classification and clustering tasks on real-world datasets.
Keyword	Contrastive Graph Context Prediction Sequential Learning Social Media Short Text Representation Learning Sparsity Problem Text Mining
DOI	10.1109/TCSS.2024.3452695
URL	View the original
Indexed By	SCIE
Language	英語English
WOS Research Area	Computer Science
WOS Subject	Computer Science, Cybernetics ; Computer Science, Information Systems
WOS ID	WOS:001328984200001
Publisher	IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC, 445 HOES LANE, PISCATAWAY, NJ 08855-4141
Scopus ID	2-s2.0-85205833043
Fulltext Access	View Full-Text via DOI View Full-Text via Web of Science View Full-Text via Scopus
Citation statistics
Document Type	Journal article
Collection	THE STATE KEY LABORATORY OF INTERNET OF THINGS FOR SMART CITY (UNIVERSITY OF MACAU) Faculty of Science and Technology DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE
Corresponding Author	Guo, Jingcai
Affiliation	1.Shenzhen University, College of Computer Science and Software Engineering, Shenzhen, 518060, China 2.The Hong Kong Polytechnic University, Department of Computing, Hong Kong, 999077, Hong Kong 3.Shenzhen University, National Engineering Laboratory for Big Data System Computing Technology, Shenzhen, 518060, China 4.Huazhong Agricultural University, College of Informatics, Wuhan, 430070, China 5.Hebei University of Technology, State Key Laboratory of Reliability and Intelligence of Electrical Equipment, Tianjin, 300401, China 6.University of Macau, State Key Laboratory of Internet of Things for Smart City, Department of Computer Information Science, 999078, Macao
Recommended Citation GB/T 7714	Chen, Junyang,Guo, Jingcai,Li, Xueliang,et al. CGraphNet: Contrastive Graph Context Prediction for Sparse Unlabeled Short Text Representation Learning on Social Media[J]. IEEE Transactions on Computational Social Systems, 2024.
APA	Chen, Junyang., Guo, Jingcai., Li, Xueliang., Wang, Huan., Xu, Zhenghua., Gong, Zhiguo., Zhang, Liangjie., & Leung, Victor C.M. (2024). CGraphNet: Contrastive Graph Context Prediction for Sparse Unlabeled Short Text Representation Learning on Social Media. IEEE Transactions on Computational Social Systems.
MLA	Chen, Junyang,et al."CGraphNet: Contrastive Graph Context Prediction for Sparse Unlabeled Short Text Representation Learning on Social Media".IEEE Transactions on Computational Social Systems (2024).

Files in This Item:
There are no files associated with this item.

If you have any objections to this item, please fill out the form below and the administrator will contact you as soon as possible.
Content:
Email：	*
Affiliation No.
Verification Code:	Refresh

Any comments and suggestions are welcomed.
Title:	*
Content:
Email：	*
Verification Code:	Refresh