A Deep Reinforcement Learning Recommender System With Multiple Policies for Recommendations

doi:10.1109/TII.2022.3209290

UM > THE STATE KEY LABORATORY OF INTERNET OF THINGS FOR SMART CITY (UNIVERSITY OF MACAU)

Residential College	false
Status	已發表Published
	A Deep Reinforcement Learning Recommender System With Multiple Policies for Recommendations
	Mingsheng Fu 1; Liwei Huang 1,2; Ananya Rao 3; Athirai A. Irissappane 3; Jie Zhang 4; Hong Qu 1
	2022-09-26
Source Publication	IEEE Transactions on Industrial Informatics
ISSN	1551-3203
Volume	19 Issue:2 Pages:2049-2061
Abstract	Deep reinforcement learning (DRL) based recommender systems are suitable for user cold-start problems as they can capture user preferences progressively. However, most existing DRL-based recommender systems are suboptimal, since they use the same policy to suit the dynamics of different users. We reformulate recommendation as a multitask Markov Decision Process, where each task represents a set of similar users. Since similar users have closer dynamics, a task-specific policy is more effective than a single universal policy for all users. To make recommendations for cold-start users, we use a default policy to collect some initial interactions to identify the user task, after which a task-specific policy is employed. We use Q-learning to optimize our framework and consider the task uncertainty by the mutual information regarding tasks. Experiments are conducted on three real-world datasets to verify the effectiveness of our proposed framework.
Keyword	Deep Reinforcement Learning (Drl) Multitask Markov Decision Process (Mdp) Recommender System
DOI	10.1109/TII.2022.3209290
URL	View the original
Indexed By	SCIE
Language	英語English
WOS Research Area	Automation & Control Systems ; Computer Science ; Engineering
WOS Subject	Automation & Control Systems ; Computer Science, Interdisciplinary Applications ; Engineering, Industrial
WOS ID	WOS:000926964700089
Publisher	IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC445 HOES LANE, PISCATAWAY, NJ 08855-4141
Scopus ID	2-s2.0-85139520746
Fulltext Access	View Full-Text via DOI View Full-Text via Web of Science View Full-Text via Scopus
Citation statistics
Document Type	Journal article
Collection	THE STATE KEY LABORATORY OF INTERNET OF THINGS FOR SMART CITY (UNIVERSITY OF MACAU)
Corresponding Author	Liwei Huang
Affiliation	1.School of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu 610054, China 2.State Key Laboratory of IoTSC, University of Macau, Taipa, Macau 999078, China 3.School of Engineering and Technology, University of Washington, Tacoma, WA 98402 USA 4.School of Computer Science and Engineering, Nanyang Technological University, Singapore 639798
Corresponding Author Affilication	University of Macau
Recommended Citation GB/T 7714	Mingsheng Fu,Liwei Huang,Ananya Rao,et al. A Deep Reinforcement Learning Recommender System With Multiple Policies for Recommendations[J]. IEEE Transactions on Industrial Informatics, 2022, 19(2), 2049-2061.
APA	Mingsheng Fu., Liwei Huang., Ananya Rao., Athirai A. Irissappane., Jie Zhang., & Hong Qu (2022). A Deep Reinforcement Learning Recommender System With Multiple Policies for Recommendations. IEEE Transactions on Industrial Informatics, 19(2), 2049-2061.
MLA	Mingsheng Fu,et al."A Deep Reinforcement Learning Recommender System With Multiple Policies for Recommendations".IEEE Transactions on Industrial Informatics 19.2(2022):2049-2061.

Files in This Item:
There are no files associated with this item.

If you have any objections to this item, please fill out the form below and the administrator will contact you as soon as possible.
Content:
Email：	*
Affiliation No.
Verification Code:	Refresh

Any comments and suggestions are welcomed.
Title:	*
Content:
Email：	*
Verification Code:	Refresh