Differentially Private n-gram Extraction

06/12/2021

Differentially Private n-gram Extraction

Kunho Kim, Sivakanth Gopi, Janardhan Kulkarni, Sergey Yekhanin

Keywords: privacy

Abstract Paper Similar Papers

Abstract: We revisit the problem of $n$-gram extraction in the differential privacy setting. In this problem, given a corpus of private text data, the goal is to release as many $n$-grams as possible while preserving user level privacy. Extracting $n$-grams is a fundamental subroutine in many NLP applications such as sentence completion, auto response generation for emails, etc. The problem also arises in other applications such as sequence mining, trajectory analysis, etc., and is a generalization of recently studied differentially private set union (DPSU) by Gopi et al. (2020). In this paper, we develop a new differentially private algorithm for this problem which, in our experiments, significantly outperforms the state-of-the-art. Our improvements stem from combining recent advances in DPSU, privacy accounting, and new heuristics for pruning in the tree-based approach initiated by Chen et al. (2012).

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at NeurIPS 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

06/12/2020

CryptoNAS: Private Inference on a ReLU Budget

Zahra Ghodsi, Akshaj Kumar Veldanda, Brandon Reagen, Siddharth Garg

Keywords Paper

0

0

0

0

3:05

06/12/2021

Iterative Methods for Private Synthetic Data: Unifying Framework and New Methods

Terrance Liu, Giuseppe Vietri, Steven Wu

Keywords Paper

deep learning, optimization, machine learning, generative model, privacy

0

0

0

0

12:27

12/07/2020

Differentially Private Set Union

Pankaj Gulhane, Sivakanth Gopi, Janardhan Kulkarni and
Judy Hanwen Shen, Milad Shokouhi, Sergey Yekhanin

Keywords Paper

Privacy-preserving Statistics and Machine Learning

0

0

0

0

13:17

03/05/2021

R-GAP: Recursive Gradient Attack on Privacy

Junyi Zhu, Matthew Blaschko

Keywords Paper

collaborative learning, privacy leakage from gradients, federated learning

0

0

0

0

5:04

06/12/2021

Circa: Stochastic ReLUs for Private Deep Learning

Zahra Ghodsi, Nandan Kumar Jha, Brandon Reagen, Siddharth Garg

Keywords Paper

deep learning, optimization, machine learning, graph learning, privacy

0

0

0

0

12:55

12/07/2020

Instance-hiding Schemes for Private Distributed Learning

Yangsibo Huang, Zhao Song, Sanjeev Arora, Kai Li

Keywords Paper

Privacy-preserving Statistics and Machine Learning

0

0

0

0

13:29

18/07/2021

DeepReDuce: ReLU Reduction for Fast Private Inference

Nandan Kumar Jha, Zahra Ghodsi, Siddharth Garg, Brandon Reagen

Keywords Paper

Social Aspects of Machine Learning, Privacy, Anonymity, and Security

0

0

0

0

5:01

12/07/2020

Sharp Composition Bounds for Gaussian Differential Privacy via Edgeworth Expansion

Qinqing Zheng, Jinshuo Dong, Qi Long, Weijie Su

Keywords Paper

Privacy-preserving Statistics and Machine Learning

0

0

0

0

12:45

06/12/2021

Renyi Differential Privacy of The Subsampled Shuffle Model In Distributed Learning

Antonious Girgis, Deepesh Data, Suhas Diggavi

Keywords Paper

optimization, privacy, federated learning

0

0

0

1

15:50

26/08/2020

Learning Rate Adaptation for Differentially Private Learning

Antti Koskela, Antti Honkela

Keywords Paper

0

0

0

0

13:08

26/04/2020

Differentially Private Meta-Learning

Jeffrey Li, Mikhail Khodak, Sebastian Caldas, Ameet Talwalkar

Keywords Paper

Differential Privacy, Meta-Learning, Federated Learning

0

0

0

0

5:00

06/12/2020

Auditing Differentially Private Machine Learning: How Private is Private SGD?

Matthew Jagielski, Jonathan Ullman, Alina Oprea

Keywords Paper

0

0

0

0

3:28

18/07/2021

Differentially Private Sliced Wasserstein Distance

alain rakotomamonjy, Ralaivola Liva

Keywords Paper

Reinforcement Learning and Planning, Reinforcement Learning, Reinforcement Learning and Planning, Multi-Agent RL, Social Aspects of Machine Learning, Privacy, Anonymity, and Security

0

0

0

0

14:59

06/12/2020

Synthetic Data Generators -- Sequential and Private

Olivier Bousquet, Roi Livni, Shay Moran

Keywords Paper

Algorithms -> Stochastic Methods; Deep Learning -> Optimization for Deep Networks, Optimization -> Stochastic Optimization

0

0

0

0

3:15

14/06/2020

Evade Deep Image Retrieval by Stashing Private Images in the Hash Space

Yanru Xiao, Cong Wang, Xing Gao

Keywords Paper

deep learning to hash, adversarial learning, privacy preservation

0

0

0

0

1:01

23/06/2021

Vectorized Secure Evaluation of Decision Forests

Raghav Malik, Vidush Singhal, Benjamin Gottfried, Milind Kulkarni

Keywords Paper

Homomorphic Encryption, Decision Forests, Vectorization

0

0

0

0

20:43

02/02/2021

Learning Model-Based Privacy Protection under Budget Constraints

Junyuan Hong, Haotao Wang, Zhangyang Wang, Jiayu Zhou

Keywords Paper

0

0

0

0

20:02

03/08/2020

Differentially Private Top-k Selection via Stability on Unknown Domain

Ricardo Silva Carvalho, Ke Wang, Lovedeep Gondara, Chunyan Miao

Keywords Paper

0

0

0

0

7:41

19/08/2021

Federated Learning with Sparsification-Amplified Privacy and Adaptive Optimization

Rui Hu, Yanmin Gong, Yuanxiong Guo

Keywords Paper

Data Mining, Federated Learning, Security and Privacy, Privacy Preserving Data Mining

0

0

0

0

15:44

12/07/2020

Scalable Differential Privacy with Certified Robustness in Adversarial Learning

Hai Phan, My T. Thai, Han Hu and
Ruoming Jin, Tong Sun, Dejing Dou

Keywords Paper

Privacy-preserving Statistics and Machine Learning

0

0

0

0

14:47

26/08/2020

Local Differential Privacy for Sampling

Hisham Husain, Borja Balle, Zac Cranko, Richard Nock

Keywords Paper

0

0

0

0

12:55

18/07/2021

Differentially Private Correlation Clustering

Mark Bun, Marek Elias, Janardhan Kulkarni

Keywords Paper

Deep Learning, Embedding Approaches, Algorithms, Representation Learning; Algorithms, Structured Prediction; Applications, Computational Biology and Bioinform, Social Aspects of Machine Learning, Privacy, Anonymity, and Security

0

0

0

0

5:14

18/07/2021

Private Alternating Least Squares: Practical Private Matrix Completion with Tighter Rates

Steve Chien, Prateek Jain, Walid Krichene and
Steffen Rendle, Shuang Song, Abhradeep Guha Thakurta, Li Zhang

Keywords Paper

Theory, Deep Learning; Deep Learning, CNN Architectures; Theory, Spaces of Functions and Kernels, Social Aspects of Machine Learning, Privacy, Anonymity, and Security

0

0

0

0

24:19

13/04/2021

DP-MERF: Differentially private mean embeddings with RandomFeatures for practical privacy-preserving data generation

Frederik Harder, Kamil Adamczewski, Mijung Park

Keywords Paper

0

0

0

0

3:02

06/12/2021

Deep Learning with Label Differential Privacy

Badih Ghazi, Noah Golowich, Ravi Kumar and
Pasin Manurangsi, Chiyuan Zhang

Keywords Paper

deep learning, robustness, self-supervised learning, privacy

0

0

0

0

14:29

06/12/2021

Antipodes of Label Differential Privacy: PATE and ALIBI

Mani Malek Esmaeili, Ilya Mironov, Karthik Prasad and
Igor Shilov, Florian Tramer

Keywords Paper

machine learning, privacy, semi-supervised learning

0

0

0

0

14:17

09/07/2020

Closure Properties for Private Classification and Online Prediction

Noga Alon, Amos Beimel, Shay Moran, Uri Stemmer

Keywords Paper

Privacy, fairness, Online learning

0

0

0

0

13:47

12/07/2020

(Locally) Differentially Private Combinatorial Semi-Bandits

Xiaoyu Chen, Kai Zheng, Zixin Zhou and
Yunchang Yang, Wei Chen, Liwei Wang

Keywords Paper

Privacy-preserving Statistics and Machine Learning

0

0

0

0

12:39

06/12/2021

PartialFed: Cross-Domain Personalized Federated Learning via Partial Initialization

Benyuan Sun, Hongxing Huo, YI YANG, Bo Bai

Keywords Paper

machine learning, privacy, federated learning

0

0

0

0

10:35

18/07/2021

Differentially Private Quantiles

Jennifer Gillenwater, Matthew Joseph, Alex Kulesza

Keywords Paper

, Theory, Learning Theory, Social Aspects of Machine Learning, Privacy, Anonymity, and Security

0

0

0

0

5:13

12/07/2020

Optimal Differential Privacy Composition for Exponential Mechanisms

Jinshuo Dong, David Durfee, Ryan Rogers

Keywords Paper

Privacy-preserving Statistics and Machine Learning

0

0

0

0

15:54

13/04/2021

Shuffled model of differential privacy in federated learning

Antonious Girgis, Deepesh Data, Suhas Diggavi and
Peter Kairouz, Ananda Theertha Suresh

Keywords Paper

0

0

0

0

3:08

06/12/2020

Differentially Private Clustering: Tight Approximation Ratios

Badih Ghazi, Ravi Kumar, Pasin Manurangsi

Keywords Paper

0

0

0

0

3:02

26/08/2020

Private k-Means Clustering with Stability Assumptions

Moshe Shechner, Or Sheffet, Uri Stemmer

Keywords Paper

0

0

0

0

14:45

26/08/2020

Federated Heavy Hitters Discovery with Differential Privacy

Wennan Zhu, Peter Kairouz, Brendan McMahan and
Haicheng Sun, Wei Li

Keywords Paper

0

0

0

0

14:08

23/08/2020

TIPRDC: Task-independent privacy-respecting data crowdsourcing framework for deep learning with anonymized intermediate representations

Ang Li, Yixiao Duan, Huanrui Yang and
Yiran Chen, Jianlei Yang

Keywords Paper

anonymized intermediate representations, privacy-respecting data crowdsourcing, deep learning

0

0

0

0

16:04

09/07/2020

Locally Private Hypothesis Selection

Sivakanth Gopi, Gautam Kamath, Janardhan D Kulkarni and
Aleksandar Nikolov, Steven Wu, Huanyu Zhang

Keywords Paper

Privacy, fairness, Distribution learning/testing

0

0

0

0

14:58

06/12/2021

Learning with User-Level Privacy

Daniel Levy, Ziteng Sun, Kareem Amin and
Satyen Kale, Alex Kulesza, Mehryar Mohri, Ananda Theertha Suresh

Keywords Paper

optimization, privacy

0

0

0

0

14:54

19/08/2021

LDP-FL: Practical Private Aggregation in Federated Learning with Local Differential Privacy

Lichao Sun, Jianwei Qian, Xun Chen

Keywords Paper

Data Mining, Federated Learning, Privacy Preserving Data Mining, Multi-agent Learning, Trustable Learning

0

0

0

0

14:59

06/12/2020

Smoothed Analysis of Online and Differentially Private Learning

Nika Haghtalab, Tim Roughgarden, Abhishek Shetty

Keywords Paper

, Algorithms -> Multitask and Transfer Learning

0

0

0

0

3:23