Residential College | false |
Status | 已發表Published |
Field-aware Variational Autoencoders for Billion-scale User Representation Learning | |
Ge Fan1; Chaoyun Zhang1; Junyang Chen2; Baopu Li3; Zenglin Xu4; Yingjie Li1; Luyu Peng1; Zhiguo Gong5![]() | |
2022-05-09 | |
Conference Name | 38th IEEE International Conference on Data Engineering, ICDE 2022 |
Source Publication | 2022 IEEE 38th International Conference on Data Engineering (ICDE)
![]() |
Volume | 2022-May |
Pages | 3413-3425 |
Conference Date | 09-12 May 2022 |
Conference Place | Kuala Lumpur, Malaysia |
Country | Malaysia |
Abstract | User representation learning plays an essential role in Internet applications, such as recommender systems. Though developing a universal embedding for users is demanding, only few previous works are conducted in an unsupervised learning manner. The unsupervised method is however important as most of the user data is collected without specific labels. In this paper, we harness the unsupervised advantages of Variational Autoencoders (VAEs), to learn user representation from large-scale, high-dimensional, and multi-field data. We extend the traditional VAE by developing Field-aware VAE (FVAE) to model each feature field with an independent multinomial distribution. To reduce the complexity in training, we employ dynamic hash tables, a batched softmax function, and a feature sampling strategy to improve the efficiency of our method. We conduct experiments on multiple datasets, showing that the proposed FVAE significantly outperforms baselines on several tasks of data reconstruction and tag prediction. Moreover, we deploy the proposed method in real-world applications and conduct online A/B tests in a look-alike system. Results demonstrate that our method can effectively improve the quality of recommendation. To the best of our knowledge, it is the first time that the VAE-based user representation learning model is applied to real-world recommender systems. |
Keyword | Lookalike Systems Recommender Systems User Representation Learning Variational Autoencoder |
DOI | 10.1109/ICDE53745.2022.00321 |
Indexed By | CPCI-S |
WOS Research Area | Computer Science |
WOS Subject | Computer Science, Artificial Intelligence ; Computer Science, Information Systems ; Computer Science, Theory & Methods |
WOS ID | WOS:000855078403056 |
Scopus ID | 2-s2.0-85136123705 |
Fulltext Access | |
Citation statistics | |
Document Type | Conference paper |
Collection | Faculty of Science and Technology DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE |
Affiliation | 1.Tencent Inc., Shenzhen, China 2.Shenzhen University, Shenzhen, China 3.Baidu USA, Sunnyvale, USA 4.Harbin Institute of Technology (Shenzhen), Shenzhen, China 5.University of Macau, Macau, China |
Recommended Citation GB/T 7714 | Ge Fan,Chaoyun Zhang,Junyang Chen,et al. Field-aware Variational Autoencoders for Billion-scale User Representation Learning[C], 2022, 3413-3425. |
APA | Ge Fan., Chaoyun Zhang., Junyang Chen., Baopu Li., Zenglin Xu., Yingjie Li., Luyu Peng., & Zhiguo Gong (2022). Field-aware Variational Autoencoders for Billion-scale User Representation Learning. 2022 IEEE 38th International Conference on Data Engineering (ICDE), 2022-May, 3413-3425. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment