Learning to collaborate in multi-module recommendation via multi-agent reinforcement learning without communication

22/09/2020

Learning to collaborate in multi-module recommendation via multi-agent reinforcement learning without communication

Xu HE, Bo An, Yanghua Li, Haikai Chen, Rundong Wang, Xinrun Wang, Runsheng Yu, Xin Li, Zhirong Wang

Keywords: Reinforcement learning

Abstract Paper Similar Papers

Abstract: With the rise of online e-commerce platforms, more and more customers prefer to shop online. To sell more products, online platforms introduce various modules to recommend items with different properties such as huge discounts. A web page often consists of different independent modules. The ranking policies of these modules are decided by different teams and optimized individually without cooperation, which might result in competition between modules. Thus, the global policy of the whole page could be sub-optimal. In this paper, we propose a novel multi-agent cooperative reinforcement learning approach with the restriction that different modules cannot communicate. Our contributions are three-fold. Firstly, inspired by a solution concept in game theory named correlated equilibrium, we design a signal network to promote cooperation of all modules by generating signals (vectors) for different modules. Secondly, an entropy-regularized version of the signal network is proposed to coordinate agents’ exploration of the optimal global policy. Furthermore, experiments based on real-world e-commerce data demonstrate that our algorithm obtains superior performance over baselines.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at RECSYS 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

02/02/2021

Traffic Shaping in E-Commercial Search Engine: Multi-Objective Online Welfare Maximization

Liucheng Sun, Chenwei Weng, Chengfu Huo and
Weijun Ren, Guochuan Zhang, Xin Li

Keywords Paper

0

0

0

0

13:50

02/02/2021

Joint Incentive Optimization of Customer and Merchant in Mobile Payment Marketing

Li Yu, Zhengwei Wu, Tianchi Cai and
Ziqi Liu, Zhiqiang Zhang, Lihong Gu, Xiaodong Zeng, Jinjie Gu

Keywords Paper

0

0

0

0

16:46

13/04/2021

Dominate or delete: Decentralized competing bandits in serial dictatorship

Abishek Sankararaman, Soumya Basu, Karthik Abinav Sankararaman

Keywords Paper

0

0

0

0

2:57

19/10/2020

P-companion: A principled framework for diversified complementary product recommendation

Junheng Hao, Tong Zhao, Jin Li and
Xin Luna Dong, Christos Faloutsos, Yizhou Sun, Wei Wang

Keywords Paper

complementary product recommendation, product relationship understanding, recommender system

0

0

0

0

9:12

19/08/2021

Altruism Design in Networked Public Goods Games

Sixie Yu, David Kempe, Yevgeniy Vorobeychik

Keywords Paper

Agent-based and Multi-agent Systems, Algorithmic Game Theory, Noncooperative Games

0

0

0

0

13:51

06/12/2020

Contextual Games: Multi-Agent Learning with Side Information

Pier Giuseppe Sessa, Ilija Bogunovic, Andreas Krause, Maryam Kamgarpour

Keywords Paper

0

0

0

0

3:30

23/08/2020

Maximizing cumulative user engagement in sequential recommendation: An online optimization perspective

Yifei Zhao, Yu-Hang Zhou, Mingdong Ou and
Huan Xu, Nan Li

Keywords Paper

recommendation systems, data mining, online learning

0

0

0

0

3:15

06/12/2021

Differentiable Equilibrium Computation with Decision Diagrams for Stackelberg Models of Combinatorial Congestion Games

Shinsaku Sakaue, Kengo Nakamura

Keywords Paper

optimization

0

0

0

0

15:07

25/07/2020

Controlling fairness and bias in dynamic learning-to-rank

Marco Morik, Ashudeep Singh, Jessica Hong, Thorsten Joachims

Keywords Paper

learning-to-rank, selection bias, exposure, fairness, ranking, bias

0

0

0

0

13:55

02/02/2021

Selfish Creation of Social Networks

Davide Bilò, Tobias Friedrich, Pascal Lenzner and
Stefanie Lowski, Anna Melnichenko

Keywords Paper

0

0

0

0

19:25

03/08/2020

Brief announcement: Deterministic lower bound for dynamic balanced graph partitioning

Maciej Pacut, Mahmoud Parham, Stefan Schmid

Keywords Paper

online algorithms, graph partitioning, self-adjusting networks

0

0

0

0

10:22

19/08/2021

Controlling Fairness and Bias in Dynamic Learning-to-Rank (Extended Abstract)

Marco Morik, Ashudeep Singh, Jessica Hong, Thorsten Joachims

Keywords Paper

Machine Learning, Learning Preferences or Rankings, Fairness, Information Retrieval, Online Learning

0

0

0

0

14:01

02/02/2021

Dec-SGTS: Decentralized Sub-Goal Tree Search for Multi-Agent Coordination

Minglong Li, Zhongxuan Cai, Wenjing Yang and
Lixia Wu, Yinghui Xu, Ji Wang

Keywords Paper

0

0

0

0

14:45

06/12/2021

Learning Equilibria in Matching Markets from Bandit Feedback

Meena Jagadeesan, Alexander Wei, Yixin Wang and
Michael Jordan, Jacob Steinhardt

Keywords Paper

bandits

0

0

0

0

15:04

02/02/2021

Signaling in Bayesian Network Congestion Games: the Subtle Power of Symmetry

Matteo Castiglioni, Andrea Celli, Alberto Marchesi, Nicola Gatti

Keywords Paper

0

0

0

0

15:03

16/11/2020

The Emergence of Adversarial Communication in Multi-Agent Reinforcement Learning

Jan Blumenkamp, Amanda Prorok

Keywords Paper

0

0

0

0

4:51

18/07/2021

Regularized Online Allocation Problems: Fairness and Beyond

Santiago Balseiro, Haihao Lu, Vahab Mirrokni

Keywords Paper

Algorithms, Online Learning Algorithms

0

0

0

0

5:23

06/12/2021

One More Step Towards Reality: Cooperative Bandits with Imperfect Communication

Udari Madhushani, Abhimanyu Dubey, Naomi Leonard, Alex Pentland

Keywords Paper

bandits

0

0

0

0

15:01

26/04/2020

Learning Nearly Decomposable Value Functions Via Communication Minimization

Tonghan Wang, Jianhao Wang, Chongyi Zheng, Chongjie Zhang

Keywords Paper

Multi-agent reinforcement learning, Nearly decomposable value function, Minimized communication

0

0

0

0

5:00

19/10/2020

Decoupled graph convolution network for inferring substitutable and complementary items

Yiding Liu, Yulong Gu, Zhuoye Ding and
Junchao Gao, Ziyi Guo, Yongjun Bao, Weipeng Yan

Keywords Paper

recommender systems, graph convolution network

0

0

0

0

8:21

02/02/2021

Decentralized Policy Gradient Descent Ascent for Safe Multi-Agent Reinforcement Learning

Songtao Lu, Kaiqing Zhang, Tianyi Chen and
Tamer Başar, Lior Horesh

Keywords Paper

0

0

0

0

16:54

03/08/2020

Distributed computation and reconfiguration in actively dynamic networks

Othon Michail, George Skretas, Paul G. Spirakis

Keywords Paper

polylogarithmic time, distributed algorithms, edge complexity, transformation, reconfiguration, dynamic networks

0

0

0

0

24:10

06/12/2021

Multi-Agent Reinforcement Learning in Stochastic Networked Systems

Yiheng Lin, Guannan Qu, Longbo Huang, Adam Wierman

Keywords Paper

reinforcement learning and planning, graph learning

0

0

0

0

11:20

06/12/2021

Learning Collaborative Policies to Solve NP-hard Routing Problems

Minsu Kim, Jinkyoo Park, joungho kim

Keywords Paper

reinforcement learning and planning

0

0

0

0

15:03

25/07/2020

Transfer learning via contextual invariants for one-to-many cross-domain recommendation

Adit Krishnan, Mahashweta Das, Mangesh Bendre and
Hao Yang, Hari Sundaram

Keywords Paper

data sparsity, transfer learning, cross-domain recommendation, contextual invariants, neural layer adaptation

0

0

0

0

19:24

22/09/2020

Exploring clustering of bandits for online recommendation system

Liu Yang, Bo Liu, Leyu Lin and
Feng Xia, Kai Chen, Qiang Yang

Keywords Paper

online learning, cluster-of-bandit, recommendation system

0

0

0

0

2:57

25/07/2020

Evolutionary product description generation: A dynamic fine-tuning approach leveraging user click behavior

Yongzhen Wang, Jian Wang, Heng Huang and
Hongsong Li, Xiaozhong Liu

Keywords Paper

product description generation, neural network, sequence-to-sequence, click-through rate, reinforcement learning

0

0

0

0

14:34

26/10/2020

Contention-Aware Mapping and Scheduling Optimization for NoC-Based MPSoCs

Rongjie Yan, Yupeng Zhou, Anyu Cai and
Changwen Li, Yige Yan, Minghao Yin

Keywords Paper

MPSoCs, mapping and scheduling, multi-objective optimization, genetic algorithms, local search, exact methods

0

0

0

0

9:31

02/02/2021

Evolutionary Game Theory Squared: Evolving Agents in Endogenously Evolving Zero-Sum Games

Stratis Skoulakis, Tanner Fiez, Ryann Sim and
Georgios Piliouras, Lillian Ratliff

Keywords Paper

0

0

0

0

20:14

26/08/2020

Communication-Efficient Distributed Optimization in Networks with Gradient Tracking and Variance Reduction

Boyue Li, Shicong Cen, Yuxin Chen, Yuejie Chi

Keywords Paper

0

0

0

0

13:50

02/02/2021

Who You Would Like to Share With? A Study of Share Recommendation in Social E-commerce

Houye Ji, Junxiong Zhu, Xiao Wang and
Chuan Shi, Bai Wang, Xiaoye Tan, Yanghua Li, Shaojian He

Keywords Paper

0

0

0

0

14:03

13/04/2021

Multitask bandit learning through heterogeneous feedback aggregation

Zhi Wang, Chicheng Zhang, Manish Kumar Singh and
Laurel Riek, Kamalika Chaudhuri

Keywords Paper

0

0

0

0

3:07

25/07/2020

TAGNN: Target attentive graph neural networks for session-based recommendation

Feng Yu, Yanqiao Zhu, Qiang Liu and
Shu Wu, Liang Wang, Tieniu Tan

Keywords Paper

session-based recommendation, target attention, graph neural networks

0

0

0

0

7:31

12/07/2020

Q-value Path Decomposition for Deep Multiagent Reinforcement Learning

Yaodong Yang, Jianye Hao, Guangyong Chen and
Hongyao Tang, Yingfeng Chen, Yujing Hu, Changjie Fan, Zhongyu Wei

Keywords Paper

Planning, Control, and Multiagent Learning

0

0

0

0

6:42

18/07/2021

Multi-layered Network Exploration via Random Walks: From Offline Optimization to Online Learning

Xutong Liu, Jinhang Zuo, Xiaowei Chen and
Wei Chen, John C. S. Lui

Keywords Paper

Optimization, Optimization, Convex Optimization, Reinforcement Learning and Planning, Bandits

0

0

0

0

17:45

25/07/2020

Influence function for unbiased recommendation

Jiangxing Yu, Hong Zhu, Chih-Yao Chang and
Xinhua Feng, Bowen Yuan, Xiuqiang He, Zhenhua Dong

Keywords Paper

recommender system, influence function, counterfactual learning

0

0

0

0

9:43

12/07/2020

A distributional view on multi objective policy optimization

Abbas Abdolmaleki, Sandy Huang, Leonard Hasenclever and
Michael Neunert, Martina Zambelli, Murilo Martins, Francis Song, Nicolas Heess, Raia Hadsell, Martin Riedmiller

Keywords Paper

Reinforcement Learning - Deep RL

0

0

0

0

15:04

06/12/2021

Federated Linear Contextual Bandits

Ruiquan Huang, Weiqiang Wu, Jing Yang, Cong Shen

Keywords Paper

bandits

0

0

0

0

14:40

12/07/2020

Robust Pricing in Dynamic Mechanism Design

Yuan Deng, Sébastien Lahaie, Vahab Mirrokni

Keywords Paper

Learning Theory

0

0

0

0

15:48

23/08/2020

Controllable multi-interest framework for recommendation

Yukuo Cen, Jianwei Zhang, Xu Zou and
Chang Zhou, Hongxia Yang, Jie Tang

Keywords Paper

recommender system, multi-interest framework, sequential recommendation

0

0

0

0

15:59