The Adaptive Doubly Robust Estimator and a Paradox Concerning Logging Policy

06/12/2021

The Adaptive Doubly Robust Estimator and a Paradox Concerning Logging Policy

Masahiro Kato, Kenichiro McAlinn, Shota Yasui

Keywords: machine learning, causality

Abstract Paper Similar Papers

Abstract: The doubly robust (DR) estimator, which consists of two nuisance parameters, the conditional mean outcome and the logging policy (the probability of choosing an action), is crucial in causal inference. This paper proposes a DR estimator for dependent samples obtained from adaptive experiments. To obtain an asymptotically normal semiparametric estimator from dependent samples without non-Donsker nuisance estimators, we propose adaptive-fitting as a variant of sample-splitting. We also report an empirical paradox that our proposed DR estimator tends to show better performances compared to other estimators utilizing the true logging policy. While a similar phenomenon is known for estimators with i.i.d. samples, traditional explanations based on asymptotic efficiency cannot elucidate our case with dependent samples. We confirm this hypothesis through simulation studies.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at NeurIPS 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

18/07/2021

Optimal Off-Policy Evaluation from Multiple Logging Policies

Nathan Kallus, Yuta Saito, Masatoshi Uehara

Keywords Paper

Probabilistic Methods, Causal Inference

0

0

0

0

5:24

06/12/2021

Provably Efficient Causal Reinforcement Learning with Confounded Observational Data

Lingxiao Wang, Zhuoran Yang, Zhaoran Wang

Keywords Paper

deep learning, reinforcement learning and planning, causality

0

0

0

0

14:54

23/08/2020

Joint policy-value learning for recommendation

Olivier Jeunen, David Rohde, Flavian Vasile, Martin Bompaire

Keywords Paper

bandit feedback, counterfactual learning, policy learning

0

0

0

0

12:15

02/02/2021

Learning from eXtreme Bandit Feedback

Romain Lopez, Inderjit S. Dhillon, Michael I. Jordan

Keywords Paper

0

0

0

0

19:29

26/08/2020

Asymptotically Efficient Off-Policy Evaluation for Tabular Reinforcement Learning

Ming Yin, Yu-Xiang Wang

Keywords Paper

0

0

0

0

14:17

02/02/2021

A Theory of Independent Mechanisms for Extrapolation in Generative Models

Michel Besserve, Remy Sun, Dominik Janzing, Bernhard Schölkopf

Keywords Paper

0

0

0

0

18:37

06/12/2021

Bayesian Adaptation for Covariate Shift

Aurick Zhou, Sergey Levine

Keywords Paper

deep learning, machine learning, robustness, vision, domain adaptation

0

0

0

0

8:21

18/07/2021

Offline Contextual Bandits with Overparameterized Models

David Brandfonbrener, Will Whitney, Rajesh Ranganath, Joan Bruna

Keywords Paper

Optimization, Non-Convex Optimization, Reinforcement Learning and Planning, Optimization, Stochastic Optimization

0

0

0

1

6:07

06/12/2020

Joints in Random Forests

Alvaro Correia, Robert Peharz, Cassio de Campos

Keywords Paper

0

0

0

0

2:28

06/12/2021

Nonuniform Negative Sampling and Log Odds Correction with Rare Events Data

HaiYing Wang, Aonan Zhang, Chong Wang

Keywords Paper

0

0

0

0

14:58

18/11/2020

Robust deep ordinal regression under label noise

Bhanu Garg, Naresh Manwani

Keywords Paper

0

0

0

0

12:03

06/12/2021

Integrated Latent Heterogeneity and Invariance Learning in Kernel Space

Jiashuo Liu, Zheyuan Hu, Peng Cui and
Bo Li, Zheyan Shen

Keywords Paper

deep learning, reinforcement learning and planning, machine learning

0

0

0

0

11:11

16/11/2020

Discriminatively-Tuned Generative Classifiers for Robust Natural Language Inference

Xiaoan Ding, Tianyu Liu, Baobao Chang and
Zhifang Sui, Kevin Gimpel

Keywords Paper

natural inference, nli tasks, discriminative fine-tuning, discriminative classifiers

0

0

0

0

11:37

06/12/2020

A Class of Algorithms for General Instrumental Variable Models

Niki Kilbertus, Matt Kusner, Ricardo Silva

Keywords Paper

0

0

0

0

3:13

19/08/2021

Unifying Online and Counterfactual Learning to Rank: A Novel Counterfactual Estimator that Effectively Utilizes Online Interventions (Extended Abstract)

Harrie Oosterhuis, Maarten de Rijke

Keywords Paper

Machine Learning, Recommender Systems, Online Learning, Information Retrieval

0

0

0

0

15:01

13/04/2021

Causal autoregressive flows

Ilyes Khemakhem, Ricardo Monti, Robert Leech, Aapo Hyvarinen

Keywords Paper

0

0

0

0

2:54

06/12/2021

Bayesian decision-making under misspecified priors with applications to meta-learning

Max Simchowitz, Christopher Tosh, Akshay Krishnamurthy and
Daniel Hsu, Thodoris Lykouris, Miro Dudik, Robert E Schapire

Keywords Paper

meta learning, bandits

0

0

0

0

14:58

06/12/2021

Hybrid Regret Bounds for Combinatorial Semi-Bandits and Adversarial Linear Bandits

Shinji Ito

Keywords Paper

bandits

0

0

0

0

10:49

12/07/2020

Unbiased Risk Estimators Can Mislead: A Case Study of Learning with Complementary Labels

Yu-Ting Chou, Gang Niu, Hsuan-Tien Lin, Masashi Sugiyama

Keywords Paper

Unsupervised and Semi-Supervised Learning

0

0

0

0

14:39

12/07/2020

Unlabelled Data Improves Bayesian Uncertainty Calibration under Covariate Shift

Alexander Chan, Ahmed Alaa, Zhaozhi Qian, Mihaela van der Schaar

Keywords Paper

Trustworthy Machine Learning

0

0

0

0

14:59

06/12/2021

Deep Bandits Show-Off: Simple and Efficient Exploration with Deep Networks

Rong Zhu, Mattia Rigotti

Keywords Paper

theory, deep learning, reinforcement learning and planning, bandits

0

0

0

0

8:45

03/08/2020

Regret Analysis of Bandit Problems with Causal Background Knowledge

Yangyi Lu, Amirhossein Meisami, Ambuj Tewari, William Yan

Keywords Paper

0

0

0

0

7:32

06/12/2021

Beware of the Simulated DAG! Causal Discovery Benchmarks May Be Easy to Game

Alexander Reisach, Christof Seiler, Sebastian Weichwald

Keywords Paper

optimization, graph learning, causality

0

0

0

0

14:13

26/04/2020

Simple and Effective Regularization Methods for Training on Noisily Labeled Data with Generalization Guarantee

Wei Hu, Zhiyuan Li, Dingli Yu

Keywords Paper

deep learning theory, regularization, noisy labels

0

0

0

0

5:13

26/04/2020

Can gradient clipping mitigate label noise?

Aditya Krishna Menon, Ankit Singh Rawat, Sashank J. Reddi, Sanjiv Kumar

Keywords Paper

0

0

0

0

4:56

02/02/2021

Adversarial Robustness through Disentangled Representations

Shuo Yang, Tianyu Guo, Yunhe Wang, Chang Xu

Keywords Paper

0

0

0

0

15:00

06/12/2020

Reconsidering Generative Objectives For Counterfactual Reasoning

Danni Lu, Chenyang Tao, Junya Chen and
Fan Li, Feng Guo, Lawrence Carin

Keywords Paper

0

0

0

0

3:22

06/12/2021

Counterfactual Maximum Likelihood Estimation for Training Deep Networks

Xinyi Wang, Wenhu Chen, Michael Saxon, William Yang Wang

Keywords Paper

deep learning, domain adaptation, causality, language

0

0

0

0

14:07

06/12/2021

The balancing principle for parameter choice in distance-regularized domain adaptation

Werner Zellinger, Natalia Shepeleva, Marius-Constantin Dinu and
Hamid Eghbal-zadeh, Hoan Duc Nguyen, Bernhard Nessler, Sergei Pereverzyev, Bernhard A. Moser

Keywords Paper

domain adaptation

0

0

0

0

12:47

03/05/2021

Influence Estimation for Generative Adversarial Networks

Naoyuki Terashita, Hiroki Ohashi, Yuichi Nonaka, Takashi Kanemaru

Keywords Paper

influence, data cleansing, generative adversarial networks

0

0

1

1

10:18

12/07/2020

Improving Robustness of Deep-Learning-Based Image Reconstruction

Ankit Raj, Yoram Bresler, Bo Li

Keywords Paper

Trustworthy Machine Learning

0

0

0

0

15:12

03/08/2020

Joint Stochastic Approximation and Its Application to Learning Discrete Latent Variable Models

Zhijian Ou, Yunfu Song

Keywords Paper

0

0

0

0

8:24

18/07/2021

Analyzing the tree-layer structure of Deep Forests

Ludovic Arnould, Claire Boyer, Erwan Scornet

Keywords Paper

Theory, Deep learning Theory

0

0

0

0

5:21

18/07/2021

Wasserstein Distributional Normalization For Robust Distributional Certification of Noisy Labeled Data

Sung Woo Park, Junseok Kwon

Keywords Paper

Deep Learning, Generative Models, Algorithms, Representation Learning; Optimization, Submodular Optimization, Probabilistic Methods, Robust statistics

0

0

0

0

5:20

18/07/2021

Proximal Causal Learning with Kernels: Two-Stage Estimation and Moment Restriction

Afsaneh Mastouri, Yuchen Zhu, Limor Gultchin and
Anna Korba, Ricardo Silva, Matt J. Kusner, Arthur Gretton, Krikamol Muandet

Keywords Paper

Algorithms, Kernel Methods

0

0

0

0

5:10

06/12/2021

Understanding Negative Samples in Instance Discriminative Self-supervised Representation Learning

Kento Nozawa, Issei Sato

Keywords Paper

machine learning, representation learning

0

0

0

0

8:50

12/07/2020

Bayesian Differential Privacy for Machine Learning

Aleksei Triastcyn, Boi Faltings

Keywords Paper

Privacy-preserving Statistics and Machine Learning

0

0

0

0

15:10

19/08/2021

Closing the BIG-LID: An Effective Local Intrinsic Dimensionality Defense for Nonlinear Regression Poisoning

Sandamal Weerasinghe, Tamas Abraham, Tansu Alpcan and
Sarah M. Erfani, Christopher Leckie, Benjamin I. P. Rubinstein

Keywords Paper

Machine Learning, Adversarial Machine Learning, Security and Privacy, Anomaly/Outlier Detection

0

0

0

0

17:05

25/07/2020

Accelerated convergence for counterfactual learning to rank

Rolf Jagerman, Maarten Rijke

Keywords Paper

unbiased learning, counterfactual learning, learning to rank

0

0

0

0

14:21

06/12/2021

NeuroMLR: Robust & Reliable Route Recommendation on Road Networks

Jayant Jain, Vrittika Bagadia, Sahil Manchanda, Sayan Ranu

Keywords Paper

deep learning, generative model, graph learning

0

0

0

0

6:33