TY - GEN
T1 - UserSim
T2 - 30th World Wide Web Conference, WWW 2021
AU - Zhao, Xiangyu
AU - Xia, Long
AU - Zou, Lixin
AU - Liu, Hui
AU - Yin, Dawei
AU - Tang, Jiliang
N1 - Publisher Copyright:
© 2021 ACM.
PY - 2021/6/3
Y1 - 2021/6/3
N2 - With the recent advances in Reinforcement Learning (RL), there have been tremendous interests in employing RL for recommender systems. However, directly training and evaluating a new RL-based recommendation algorithm needs to collect users' real-time feedback in the real system, which is time/effort consuming and could negatively impact users' experiences. Thus, it calls for a user simulator that can mimic real users' behaviors to pre-train and evaluate new recommendation algorithms. Simulating users' behaviors in a dynamic system faces immense challenges - (i) the underlying item distribution is complex, and (ii) historical logs for each user are limited. In this paper, we develop a user simulator based on a Generative Adversarial Network (GAN). To be specific, the generator captures the underlying distribution of users' historical logs and generates realistic logs that can be considered as augmentations of real logs; while the discriminator not only distinguishes real and fake logs but also predicts users' behaviors. The experimental results based on benchmark datasets demonstrate the effectiveness of the proposed simulator.
AB - With the recent advances in Reinforcement Learning (RL), there have been tremendous interests in employing RL for recommender systems. However, directly training and evaluating a new RL-based recommendation algorithm needs to collect users' real-time feedback in the real system, which is time/effort consuming and could negatively impact users' experiences. Thus, it calls for a user simulator that can mimic real users' behaviors to pre-train and evaluate new recommendation algorithms. Simulating users' behaviors in a dynamic system faces immense challenges - (i) the underlying item distribution is complex, and (ii) historical logs for each user are limited. In this paper, we develop a user simulator based on a Generative Adversarial Network (GAN). To be specific, the generator captures the underlying distribution of users' historical logs and generates realistic logs that can be considered as augmentations of real logs; while the discriminator not only distinguishes real and fake logs but also predicts users' behaviors. The experimental results based on benchmark datasets demonstrate the effectiveness of the proposed simulator.
KW - Generative Adversarial Network
KW - Recommender System
KW - Reinforcement Learning
KW - User Simulation
UR - https://www.scopus.com/pages/publications/85107918995
U2 - 10.1145/3442381.3450125
DO - 10.1145/3442381.3450125
M3 - 会议稿件
AN - SCOPUS:85107918995
T3 - The Web Conference 2021 - Proceedings of the World Wide Web Conference, WWW 2021
SP - 3582
EP - 3589
BT - The Web Conference 2021 - Proceedings of the World Wide Web Conference, WWW 2021
PB - Association for Computing Machinery, Inc
Y2 - 19 April 2021 through 23 April 2021
ER -