Unifying Online and Counterfactual Learning to Rank: A Novel Counterfactual Estimator that Effectively Utilizes Online Interventions (Extended Abstract)

19/08/2021

Unifying Online and Counterfactual Learning to Rank: A Novel Counterfactual Estimator that Effectively Utilizes Online Interventions (Extended Abstract)

Harrie Oosterhuis, Maarten de Rijke

Keywords: Machine Learning, Recommender Systems, Online Learning, Information Retrieval

Abstract Paper Similar Papers

Abstract: State-of-the-art Learning to Rank (LTR) methods for optimizing ranking systems based on user interactions are divided into online approaches – that learn by direct interaction – and counterfactual approaches – that learn from historical interactions. We propose a novel intervention-aware estimator to bridge this online/counterfactual division. The estimator corrects for the effect of position bias, trust bias, and item-selection bias by using corrections based on the behavior of the logging policy and on online interventions: changes to the logging policy made during the gathering of click data. Our experimental results show that, unlike existing counterfactual LTR methods, the intervention-aware estimator can greatly benefit from online interventions. To the best of our knowledge, this is the first method that is shown to be highly effective in both online and counterfactual scenarios.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at IJCAI 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

19/08/2021

User Retention: A Causal Approach with Triple Task Modeling

Yang Zhang, Dong Wang, Qiang Li and
Yue Shen, Ziqi Liu, Xiaodong Zeng, Zhiqiang Zhang, Jinjie Gu, Derek F. Wong

Keywords Paper

Machine Learning, Deep Learning, Applications of Supervised Learning, Recommender Systems

0

0

0

0

13:53

25/07/2020

Policy-aware unbiased learning to rank for top-k rankings

Harrie Oosterhuis, Maarten Rijke

Keywords Paper

recommendation, selection bias, counterfactual learning to rank, learning to rank, counterfactual learning, top-k ranking

0

0

0

0

20:00

14/09/2020

Mend The Learning Approach, Not the Data: Insights for Ranking E-Commerce Products

Muhammad Umer Anwaar, Dmytro Rybalko, Martin Kleinsteuber

Keywords Paper

information retrieval, ranking and preference learning, learning to rank, e-commerce search, implicit feedback, counterfactual risk minimization, dataset, mining data logs

0

0

0

0

11:21

26/04/2020

Black-Box Adversarial Attack with Transferable Model-based Embedding

Zhichao Huang, Tong Zhang

Keywords Paper

adversarial examples, black-box attack, embedding

0

0

0

0

4:48

25/07/2020

Accelerated convergence for counterfactual learning to rank

Rolf Jagerman, Maarten Rijke

Keywords Paper

unbiased learning, counterfactual learning, learning to rank

0

0

0

0

14:21

25/07/2020

Disentangled graph collaborative filtering

Xiang Wang, Hongye Jin, An Zhang and
Xiangnan He, Tong Xu, Tat-Seng Chua

Keywords Paper

explainable recommendation, disentangled representation learning, collaborative filtering, graph neural networks

0

0

0

0

15:17

02/02/2021

A General Offline Reinforcement Learning Framework for Interactive Recommendation

Teng Xiao, Donglin Wang

Keywords Paper

0

0

0

0

14:16

22/09/2020

Exploring clustering of bandits for online recommendation system

Liu Yang, Bo Liu, Leyu Lin and
Feng Xia, Kai Chen, Qiang Yang

Keywords Paper

online learning, cluster-of-bandit, recommendation system

0

0

0

0

2:57

19/08/2021

Exploring Periodicity and Interactivity in Multi-Interest Framework for Sequential Recommendation

Gaode Chen, Xinghua Zhang, Yanyan Zhao and
Cong Xue, Ji Xiang

Keywords Paper

Data Mining, Recommender Systems, Big Data, Large-Scale Systems, Recommender Systems

0

0

0

0

15:20

25/07/2020

A deep recurrent survival model for unbiased ranking

Jiarui Jin, Yuchen Fang, Weinan Zhang and
Kan Ren, Guorui Zhou, Jian Xu, Yong Yu, Jun Wang, Xiaoqiang Zhu, Kun Gai

Keywords Paper

cascade model, unbiased learning-to-rank, position bias

0

0

0

0

11:32

02/06/2020

Entity Summarization with User Feedback

Qingxia Liu, Yue Chen, Gong Cheng and
Evgeny Kharlamov, Junyou Li, Yuzhong Qu

Keywords Paper

0

0

0

0

21:30

02/02/2021

Learning from eXtreme Bandit Feedback

Romain Lopez, Inderjit S. Dhillon, Michael I. Jordan

Keywords Paper

0

0

0

0

19:29

06/12/2021

Understanding Negative Samples in Instance Discriminative Self-supervised Representation Learning

Kento Nozawa, Issei Sato

Keywords Paper

machine learning, representation learning

0

0

0

0

8:50

18/07/2021

DriftSurf: Stable-State / Reactive-State Learning under Concept Drift

Ashraf Tahmasbi, Ellango Jothimurugesan, Srikanta Tirthapura, Phil Gibbons

Keywords Paper

Algorithms, Online Learning Algorithms

0

0

0

0

5:07

03/05/2021

OPAL: Offline Primitive Discovery for Accelerating Offline Reinforcement Learning

Anurag Ajay, Aviral Kumar, Pulkit Agrawal and
Sergey Levine, Ofir Nachum

Keywords Paper

Unsupervised Learning, Offline Reinforcement Learning, Primitive Discovery

0

0

0

0

5:08

02/02/2021

Graph Heterogeneous Multi-Relational Recommendation

Chong Chen, Weizhi Ma, Min Zhang and
Zhaowei Wang, Xiuqiang He, Chenyang Wang, Yiqun Liu, Shaoping Ma

Keywords Paper

0

0

0

0

13:49

26/08/2020

Online Binary Space Partitioning Forests

Xuhui Fan, Bin Li, Scott SIsson

Keywords Paper

0

0

0

0

15:02

19/10/2020

Improving multi-scenario learning to rank in e-commerce by exploiting task relationships in the label space

Pengcheng Li, Runze Li, Qing Da and
An-Xiang Zeng, Lijun Zhang

Keywords Paper

learning to rank, neural networks, e-commerce, multi-task learning

0

0

0

0

8:45

06/12/2021

Provably Efficient Causal Reinforcement Learning with Confounded Observational Data

Lingxiao Wang, Zhuoran Yang, Zhaoran Wang

Keywords Paper

deep learning, reinforcement learning and planning, causality

0

0

0

0

14:54

11/08/2020

Classic meets modern: A pragmatic learning-based congestion control for the internet

Soheil Abbasloo, Chen-Yu Yen, H. Jonathan Chao

Keywords Paper

Congestion Control, TCP, Deep Reinforcement Learning

0

0

0

0

14:09

18/07/2021

Robust Representation Learning via Perceptual Similarity Metrics

Saeid A Taghanaki, Kristy Choi, Amir Hosein Khasahmadi, Anirudh Goyal

Keywords Paper

Deep Learning, Embedding and Representation learning

0

0

0

0

4:48

07/09/2020

BriNet: Towards Bridging the Intra-class and Inter-class Gaps in One-Shot Segmentation

Xianghui Yang, Bairun Wang, Xinchi Zhou and
Kaige Chen, Shuai Yi, Wanli Ouyang, Luping Zhou

Keywords Paper

Few-shot Semantic Segmentation, Few-shot learning, Semantic Segmentation

0

0

0

0

8:26

06/12/2020

Differentiable Causal Discovery from Interventional Data

Philippe Brouillard, Sébastien Lachapelle, Alexandre Lacoste and
Simon Lacoste-Julien, Alexandre Drouin

Keywords Paper

0

1

0

2

3:13

23/08/2020

Time-aware user embeddings as a service

Martin Pavlovski, Jelena Gligorijevic, Ivan Stojkovic and
Shubham Agrawal, Shabhareesh Komirishetty, Djordje Gligorijevic, Narayan Bhamidipati, Zoran Obradovic

Keywords Paper

sequential models, user representation, neural embeddings

0

0

0

0

19:42

02/02/2021

Visualization of Supervised and Self-Supervised Neural Networks via Attribution Guided Factorization

Shir Gur, Ameen Ali, Lior Wolf

Keywords Paper

0

0

0

0

14:14

19/10/2020

Zero-shot heterogeneous transfer learning from recommender systems to cold-start search retrieval

Tao Wu, Ellie Ka-In Chio, Heng-Tze Cheng and
Yu Du, Steffen Rendle, Dima Kuzmin, Ritesh Agarwal, Li Zhang, John Anderson, Sarvjeet Singh, Tushar Chandra, Ed H. Chi, Wen Li, Ankit Kumar, Xiang Ma, Alex Soares, Nitin Jindal, Pei Cao

Keywords Paper

search, recommender systems, zero-shot learning, transfer learning

0

0

0

0

9:53

06/12/2021

Detecting and Adapting to Irregular Distribution Shifts in Bayesian Online Learning

Aodong Li, Alex Boyd, Padhraic Smyth, Stephan Mandt

Keywords Paper

self-supervised learning, online learning, continual learning

0

0

0

0

14:58

02/02/2021

A Simple and Effective Self-Supervised Contrastive Learning Framework for Aspect Detection

Tian Shi, Liuqing Li, Ping Wang, Chandan K. Reddy

Keywords Paper

0

0

0

0

19:21

18/07/2021

Message Passing Adaptive Resonance Theory for Online Active Semi-supervised Learning

Taehyeong Kim, Injune Hwang, Hyundo Lee and
Hyunseo Kim, Won-Seok Choi, Joseph Lim, Byoung-Tak Zhang

Keywords Paper

Algorithms, Active Learning

0

0

0

0

4:53

25/07/2020

TAGNN: Target attentive graph neural networks for session-based recommendation

Feng Yu, Yanqiao Zhu, Qiang Liu and
Shu Wu, Liang Wang, Tieniu Tan

Keywords Paper

session-based recommendation, target attention, graph neural networks

0

0

0

0

7:31

23/08/2020

Meta-learning on heterogeneous information networks for cold-start recommendation

Yuanfu Lu, Yuan Fang, Chuan Shi

Keywords Paper

heterogeneous information network, meta-learning, cold-start recommendation

0

0

0

0

15:04

14/09/2020

Neural User Embedding From Browsing Events

Mingxiao An, Sundong Kim

Keywords Paper

user embedding, web browsing, demographic prediction

0

0

0

0

13:58

22/11/2021

DEX: Domain Embedding Expansion for Generalized Person Re-identification

Phuay Wee Eugene Ang, Shan Lin, Alex Kot

Keywords Paper

person re-identification, domain generalization, data augmentation

0

0

0

0

2:57

03/05/2021

MetaNorm: Learning to Normalize Few-Shot Batches Across Domains

Yingjun Du, Xiantong Zhen, Ling Shao, Cees G Snoek

Keywords Paper

batch normalization, Meta-learning, few-shot domain generalization

0

0

0

0

5:48

06/12/2021

Online Adaptation to Label Distribution Shift

Ruihan Wu, Chuan Guo, Yi Su, Kilian Weinberger

Keywords Paper

optimization, machine learning, online learning

0

0

0

0

9:46

25/07/2020

Item tagging for information retrieval: A tripartite graph neural network based approach

Kelong Mao, Xi Xiao, Jieming Zhu and
Biao Lu, Ruiming Tang, Xiuqiang He

Keywords Paper

item tagging, graph neural networks, information retrieval

0

0

0

0

16:53

02/02/2021

Model Uncertainty Guides Visual Object Tracking

Lijun Zhou, Antoine Ledent, Qintao Hu and
Ting Liu, Jianlin Zhang, Marius Kloft

Keywords Paper

0

0

0

0

18:06

06/12/2020

Mitigating Forgetting in Online Continual Learning via Instance-Aware Parameterization

Hung-Jen Chen, An-Chieh Cheng, Da-Cheng Juan and
Wei Wei, Min Sun

Keywords Paper

0

0

0

0

3:23

06/12/2021

The Adaptive Doubly Robust Estimator and a Paradox Concerning Logging Policy

Masahiro Kato, Kenichiro McAlinn, Shota Yasui

Keywords Paper

machine learning, causality

0

0

0

0

14:41

06/12/2020

Online Influence Maximization under Linear Threshold Model

Shuai Li, Fang Kong, Kejie Tang and
Qizhi Li, Wei Chen

Keywords Paper

0

0

0

0

3:15