Residential College | false |
Status | 已發表Published |
A Deep Reinforcement Learning Recommender System With Multiple Policies for Recommendations | |
Mingsheng Fu1; Liwei Huang1,2; Ananya Rao3; Athirai A. Irissappane3; Jie Zhang4; Hong Qu1 | |
2022-09-26 | |
Source Publication | IEEE Transactions on Industrial Informatics |
ISSN | 1551-3203 |
Volume | 19Issue:2Pages:2049-2061 |
Abstract | Deep reinforcement learning (DRL) based recommender systems are suitable for user cold-start problems as they can capture user preferences progressively. However, most existing DRL-based recommender systems are suboptimal, since they use the same policy to suit the dynamics of different users. We reformulate recommendation as a multitask Markov Decision Process, where each task represents a set of similar users. Since similar users have closer dynamics, a task-specific policy is more effective than a single universal policy for all users. To make recommendations for cold-start users, we use a default policy to collect some initial interactions to identify the user task, after which a task-specific policy is employed. We use Q-learning to optimize our framework and consider the task uncertainty by the mutual information regarding tasks. Experiments are conducted on three real-world datasets to verify the effectiveness of our proposed framework. |
Keyword | Deep Reinforcement Learning (Drl) Multitask Markov Decision Process (Mdp) Recommender System |
DOI | 10.1109/TII.2022.3209290 |
URL | View the original |
Indexed By | SCIE |
Language | 英語English |
WOS Research Area | Automation & Control Systems ; Computer Science ; Engineering |
WOS Subject | Automation & Control Systems ; Computer Science, Interdisciplinary Applications ; Engineering, Industrial |
WOS ID | WOS:000926964700089 |
Publisher | IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC445 HOES LANE, PISCATAWAY, NJ 08855-4141 |
Scopus ID | 2-s2.0-85139520746 |
Fulltext Access | |
Citation statistics | |
Document Type | Journal article |
Collection | THE STATE KEY LABORATORY OF INTERNET OF THINGS FOR SMART CITY (UNIVERSITY OF MACAU) |
Corresponding Author | Liwei Huang |
Affiliation | 1.School of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu 610054, China 2.State Key Laboratory of IoTSC, University of Macau, Taipa, Macau 999078, China 3.School of Engineering and Technology, University of Washington, Tacoma, WA 98402 USA 4.School of Computer Science and Engineering, Nanyang Technological University, Singapore 639798 |
Corresponding Author Affilication | University of Macau |
Recommended Citation GB/T 7714 | Mingsheng Fu,Liwei Huang,Ananya Rao,et al. A Deep Reinforcement Learning Recommender System With Multiple Policies for Recommendations[J]. IEEE Transactions on Industrial Informatics, 2022, 19(2), 2049-2061. |
APA | Mingsheng Fu., Liwei Huang., Ananya Rao., Athirai A. Irissappane., Jie Zhang., & Hong Qu (2022). A Deep Reinforcement Learning Recommender System With Multiple Policies for Recommendations. IEEE Transactions on Industrial Informatics, 19(2), 2049-2061. |
MLA | Mingsheng Fu,et al."A Deep Reinforcement Learning Recommender System With Multiple Policies for Recommendations".IEEE Transactions on Industrial Informatics 19.2(2022):2049-2061. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment