UM  > Faculty of Science and Technology
Residential Collegefalse
Status已發表Published
Field-aware Variational Autoencoders for Billion-scale User Representation Learning
Ge Fan1; Chaoyun Zhang1; Junyang Chen2; Baopu Li3; Zenglin Xu4; Yingjie Li1; Luyu Peng1; Zhiguo Gong5
2022-05-09
Conference Name38th IEEE International Conference on Data Engineering, ICDE 2022
Source Publication2022 IEEE 38th International Conference on Data Engineering (ICDE)
Volume2022-May
Pages3413-3425
Conference Date09-12 May 2022
Conference PlaceKuala Lumpur, Malaysia
CountryMalaysia
Abstract

User representation learning plays an essential role in Internet applications, such as recommender systems. Though developing a universal embedding for users is demanding, only few previous works are conducted in an unsupervised learning manner. The unsupervised method is however important as most of the user data is collected without specific labels. In this paper, we harness the unsupervised advantages of Variational Autoencoders (VAEs), to learn user representation from large-scale, high-dimensional, and multi-field data. We extend the traditional VAE by developing Field-aware VAE (FVAE) to model each feature field with an independent multinomial distribution. To reduce the complexity in training, we employ dynamic hash tables, a batched softmax function, and a feature sampling strategy to improve the efficiency of our method. We conduct experiments on multiple datasets, showing that the proposed FVAE significantly outperforms baselines on several tasks of data reconstruction and tag prediction. Moreover, we deploy the proposed method in real-world applications and conduct online A/B tests in a look-alike system. Results demonstrate that our method can effectively improve the quality of recommendation. To the best of our knowledge, it is the first time that the VAE-based user representation learning model is applied to real-world recommender systems.

KeywordLookalike Systems Recommender Systems User Representation Learning Variational Autoencoder
DOI10.1109/ICDE53745.2022.00321
Indexed ByCPCI-S
WOS Research AreaComputer Science
WOS SubjectComputer Science, Artificial Intelligence ; Computer Science, Information Systems ; Computer Science, Theory & Methods
WOS IDWOS:000855078403056
Scopus ID2-s2.0-85136123705
Fulltext Access
Citation statistics
Cited Times [WOS]:6   [WOS Record]     [Related Records in WOS]
Document TypeConference paper
CollectionFaculty of Science and Technology
DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE
Affiliation1.Tencent Inc., Shenzhen, China
2.Shenzhen University, Shenzhen, China
3.Baidu USA, Sunnyvale, USA
4.Harbin Institute of Technology (Shenzhen), Shenzhen, China
5.University of Macau, Macau, China
Recommended Citation
GB/T 7714
Ge Fan,Chaoyun Zhang,Junyang Chen,et al. Field-aware Variational Autoencoders for Billion-scale User Representation Learning[C], 2022, 3413-3425.
APA Ge Fan., Chaoyun Zhang., Junyang Chen., Baopu Li., Zenglin Xu., Yingjie Li., Luyu Peng., & Zhiguo Gong (2022). Field-aware Variational Autoencoders for Billion-scale User Representation Learning. 2022 IEEE 38th International Conference on Data Engineering (ICDE), 2022-May, 3413-3425.
Files in This Item:
There are no files associated with this item.
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Ge Fan]'s Articles
[Chaoyun Zhang]'s Articles
[Junyang Chen]'s Articles
Baidu academic
Similar articles in Baidu academic
[Ge Fan]'s Articles
[Chaoyun Zhang]'s Articles
[Junyang Chen]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Ge Fan]'s Articles
[Chaoyun Zhang]'s Articles
[Junyang Chen]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.