Local Differential Privacy for Regret Minimization in Reinforcement Learning

Abstract: Reinforcement learning algorithms are widely used in domains where it is desirable to provide a personalized service. In these domains it is common that user data contains sensitive information that needs to be protected from third parties. Motivated by this, we study privacy in the context of finite-horizon Markov Decision Processes (MDPs) by requiring information to be obfuscated on the user side. We formulate this notion of privacy for RL by leveraging the local differential privacy (LDP) framework. We establish a lower bound for regret minimization in finite-horizon MDPs with LDP guarantees which shows that guaranteeing privacy has a multiplicative effect on the regret. This result shows that while LDP is an appealing notion of privacy, it makes the learning problem significantly more complex. Finally, we present an optimistic algorithm that simultaneously satisfies $\varepsilon$-LDP requirements, and achieves $\sqrt{K}/\varepsilon$ regret in any finite-horizon MDP after $K$ episodes, matching the lower bound dependency on the number of episodes $K$.

12/07/2020

John Abowd, Robert Ashmead, Ryan Cumings-Menon and
Simson Garfinkel, Daniel Kifer, Philip Leclerc, William Sexton, Ashley Simpson, Christine Task, Pavel Zhuravlev

Local Differential Privacy for Regret Minimization in Reinforcement Learning

Evrard Garcelon, Vianney Perchet, Ciara Pike-Burke, Matteo Pirotta

Comments

Similar Papers

Reinforcement Learning with Differential Privacy

Giuseppe Vietri, Borja de Balle Pigem, Steven Wu, Akshay Krishnamurthy

Keywords Abstract Paper

Reinforcement Learning - General

Generalized Linear Bandits with Local Differential Privacy

Yuxuan Han, Zhipeng Liang, Yang Wang, Jiheng Zhang

Keywords Abstract Paper

optimization, bandits, privacy

Context Aware Local Differential Privacy

Jayadev Acharya, Kallista Bonawitz, Peter Kairouz and Daniel Ramage, Ziteng Sun

Keywords Abstract Paper

Privacy-preserving Statistics and Machine Learning

Learning discrete distributions: user vs item-level privacy

Yuhan Liu, Ananda Theertha Suresh, Felix Xinnan Yu and Sanjiv Kumar, Michael D Riley

Keywords Abstract Paper

Differentially private monotone submodular maximization under matroid and knapsack constraints

Omid Sadeghi, Maryam Fazel

Keywords Abstract Paper

Local Differential Privacy for Bayesian Optimization

Xingyu Zhou, Jian Tan

Keywords Abstract Paper

Smoothed Analysis of Online and Differentially Private Learning

Nika Haghtalab, Tim Roughgarden, Abhishek Shetty

Keywords Abstract Paper

, Algorithms -> Multitask and Transfer Learning

Learning Model-Based Privacy Protection under Budget Constraints

Junyuan Hong, Haotao Wang, Zhangyang Wang, Jiayu Zhou

Keywords Abstract Paper

Antipodes of Label Differential Privacy: PATE and ALIBI

Mani Malek Esmaeili, Ilya Mironov, Karthik Prasad and Igor Shilov, Florian Tramer

Keywords Abstract Paper

machine learning, privacy, semi-supervised learning

ADePT: Auto-encoder based differentially private text transformation

Satyapriya Krishna, Rahul Gupta, Christophe Dupuy

Keywords Abstract Paper

An Uncertainty Principle is a Price of Privacy-Preserving Microdata

John Abowd, Robert Ashmead, Ryan Cumings-Menon and Simson Garfinkel, Daniel Kifer, Philip Leclerc, William Sexton, Ashley Simpson, Christine Task, Pavel Zhuravlev

Keywords Abstract Paper

privacy

H-FL: A Hierarchical Communication-Efficient and Privacy-Protected Architecture for Federated Learning

He Yang

Keywords Abstract Paper

Agent-based and Multi-agent Systems, Coordination and Cooperation, Security and Privacy

LDP-FL: Practical Private Aggregation in Federated Learning with Local Differential Privacy

Lichao Sun, Jianwei Qian, Xun Chen

Keywords Abstract Paper

Data Mining, Federated Learning, Privacy Preserving Data Mining, Multi-agent Learning, Trustable Learning

Parameter-free HE-friendly Logistic Regression

Junyoung Byun, Woojin Lee, Jaewook Lee

Keywords Abstract Paper

machine learning, privacy

(Locally) Differentially Private Combinatorial Semi-Bandits

Xiaoyu Chen, Kai Zheng, Zixin Zhou and Yunchang Yang, Wei Chen, Liwei Wang

Keywords Abstract Paper

Privacy-preserving Statistics and Machine Learning

Do not Let Privacy Overbill Utility: Gradient Embedding Perturbation for Private Learning

Da Yu, Huishuai Zhang, Wei Chen, Tie-Yan Liu

Keywords Abstract Paper

gradient redundancy, differentially private deep learning, privacy preserving machine learning

High-Dimensional Sparse Linear Bandits

Botao Hao, Tor Lattimore, Mengdi Wang

Keywords Abstract Paper

PCKV: Locally Differentially Private Correlated Key-Value Data Collection with Optimized Utility

Xiaolan Gu, Ming Li, Yueqiang Cheng and Li Xiong, Yang Cao

Keywords Abstract Paper

Evade Deep Image Retrieval by Stashing Private Images in the Hash Space

Yanru Xiao, Cong Wang, Xing Gao

Keywords Abstract Paper

deep learning to hash, adversarial learning, privacy preservation

FedRec++: Lossless Federated Recommendation with Explicit Feedback

Feng Liang, Weike Pan, Zhong Ming

Keywords Abstract Paper

Shuffled model of differential privacy in federated learning

Antonious Girgis, Deepesh Data, Suhas Diggavi and Peter Kairouz, Ananda Theertha Suresh

Keywords Abstract Paper

Sharp Composition Bounds for Gaussian Differential Privacy via Edgeworth Expansion

Keywords Paper

Keywords Paper

Jayadev Acharya, Kallista Bonawitz, Peter Kairouz and
Daniel Ramage, Ziteng Sun

Keywords Paper

Yuhan Liu, Ananda Theertha Suresh, Felix Xinnan Yu and
Sanjiv Kumar, Michael D Riley

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Mani Malek Esmaeili, Ilya Mironov, Karthik Prasad and
Igor Shilov, Florian Tramer

Keywords Paper

Keywords Paper

John Abowd, Robert Ashmead, Ryan Cumings-Menon and
Simson Garfinkel, Daniel Kifer, Philip Leclerc, William Sexton, Ashley Simpson, Christine Task, Pavel Zhuravlev

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Xiaoyu Chen, Kai Zheng, Zixin Zhou and
Yunchang Yang, Wei Chen, Liwei Wang

Keywords Paper

Keywords Paper

Keywords Paper

Xiaolan Gu, Ming Li, Yueqiang Cheng and
Li Xiong, Yang Cao

Keywords Paper

Keywords Paper

Keywords Paper

Antonious Girgis, Deepesh Data, Suhas Diggavi and
Peter Kairouz, Ananda Theertha Suresh

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper