Residential Collegefalse
Status已發表Published
CGraphNet: Contrastive Graph Context Prediction for Sparse Unlabeled Short Text Representation Learning on Social Media
Chen, Junyang1; Guo, Jingcai2; Li, Xueliang3; Wang, Huan4; Xu, Zhenghua5; Gong, Zhiguo6; Zhang, Liangjie1; Leung, Victor C.M.1
2024-10
Source PublicationIEEE Transactions on Computational Social Systems
ISSN2329-924X
Abstract

Unlabeled text representation learning (UTRL), encompassing static word embeddings such as Word2Vec and contextualized word embeddings such as bidirectional encoder representations from transformer (BERT), aims to capture semantic word relationships in a low-dimensional space without the need for manual labeling. These word embeddings are invaluable for downstream tasks such as document classification and clustering. However, the surge of short texts generated daily on social media platforms results in sparse word cooccurrences, compromising UTRL outcomes. Contextualized models such as recurrent neural network (RNN) and BERT, while impressive, often struggle with predicting the next word due to sparse word sequences in short texts. To address this, we introduce CGraphNet, a contrastive graph context prediction model designed for UTRL. This approach converts short texts into graphs, establishing links between sequentially occurring words. Information from the next word and its neighbors informs the target prediction, a process referred to as graph context prediction, mitigating sparse word cooccurrence issues in brief sentences. To minimize noise, an attention mechanism assigns importance to neighbors, while a contrastive objective encourages more distinctive representations by comparing the target word with its neighbors. Our experiments demonstrate CGraphNet's superior performance over other baselines, particularly in classification and clustering tasks on real-world datasets.

KeywordContrastive Graph Context Prediction Sequential Learning Social Media Short Text Representation Learning Sparsity Problem Text Mining
DOI10.1109/TCSS.2024.3452695
URLView the original
Indexed BySCIE
Language英語English
WOS Research AreaComputer Science
WOS SubjectComputer Science, Cybernetics ; Computer Science, Information Systems
WOS IDWOS:001328984200001
PublisherIEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC, 445 HOES LANE, PISCATAWAY, NJ 08855-4141
Scopus ID2-s2.0-85205833043
Fulltext Access
Citation statistics
Document TypeJournal article
CollectionTHE STATE KEY LABORATORY OF INTERNET OF THINGS FOR SMART CITY (UNIVERSITY OF MACAU)
Faculty of Science and Technology
DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE
Corresponding AuthorGuo, Jingcai
Affiliation1.Shenzhen University, College of Computer Science and Software Engineering, Shenzhen, 518060, China
2.The Hong Kong Polytechnic University, Department of Computing, Hong Kong, 999077, Hong Kong
3.Shenzhen University, National Engineering Laboratory for Big Data System Computing Technology, Shenzhen, 518060, China
4.Huazhong Agricultural University, College of Informatics, Wuhan, 430070, China
5.Hebei University of Technology, State Key Laboratory of Reliability and Intelligence of Electrical Equipment, Tianjin, 300401, China
6.University of Macau, State Key Laboratory of Internet of Things for Smart City, Department of Computer Information Science, 999078, Macao
Recommended Citation
GB/T 7714
Chen, Junyang,Guo, Jingcai,Li, Xueliang,et al. CGraphNet: Contrastive Graph Context Prediction for Sparse Unlabeled Short Text Representation Learning on Social Media[J]. IEEE Transactions on Computational Social Systems, 2024.
APA Chen, Junyang., Guo, Jingcai., Li, Xueliang., Wang, Huan., Xu, Zhenghua., Gong, Zhiguo., Zhang, Liangjie., & Leung, Victor C.M. (2024). CGraphNet: Contrastive Graph Context Prediction for Sparse Unlabeled Short Text Representation Learning on Social Media. IEEE Transactions on Computational Social Systems.
MLA Chen, Junyang,et al."CGraphNet: Contrastive Graph Context Prediction for Sparse Unlabeled Short Text Representation Learning on Social Media".IEEE Transactions on Computational Social Systems (2024).
Files in This Item:
There are no files associated with this item.
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Chen, Junyang]'s Articles
[Guo, Jingcai]'s Articles
[Li, Xueliang]'s Articles
Baidu academic
Similar articles in Baidu academic
[Chen, Junyang]'s Articles
[Guo, Jingcai]'s Articles
[Li, Xueliang]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Chen, Junyang]'s Articles
[Guo, Jingcai]'s Articles
[Li, Xueliang]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.