Personal Information Leakage Detection in Conversations

16/11/2020

Personal Information Leakage Detection in Conversations

Qiongkai Xu, Lizhen Qu, Zeyu Gao, Gholamreza Haffari

Keywords: leakage information, detection task, alignment problem, dataset persona-leakage

Abstract Paper Similar Papers

Abstract: The global market size of conversational assistants (chatbots) is expected to grow to USD 9.4 billion by 2024, according to MarketsandMarkets. Despite the wide use of chatbots, leakage of personal information through chatbots poses serious privacy concerns for their users. In this work, we propose to protect personal information by warning users of detected suspicious sentences generated by conversational assistants. The detection task is formulated as an alignment optimization problem and a new dataset PERSONA-LEAKAGE is collected for evaluation. In this paper, we propose two novel constrained alignment models, which consistently outperform baseline methods on Moreover, we conduct analysis on the behavior of recently proposed personalized chit-chat dialogue systems. The empirical results show that those systems suffer more from personal information disclosure than the widely used Seq2Seq model and the language model. In those cases, a significant number of information leaking utterances can be detected by our models with high precision.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at EMNLP 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

14/06/2020

Evade Deep Image Retrieval by Stashing Private Images in the Hash Space

Yanru Xiao, Cong Wang, Xing Gao

Keywords Paper

deep learning to hash, adversarial learning, privacy preservation

0

0

0

0

1:01

06/12/2021

PartialFed: Cross-Domain Personalized Federated Learning via Partial Initialization

Benyuan Sun, Hongxing Huo, YI YANG, Bo Bai

Keywords Paper

machine learning, privacy, federated learning

0

0

0

0

10:35

11/08/2020

Padding Ain't Enough: Assessing the Privacy Guarantees of Encrypted DNS

Jonas Bushart, Christian Rossow

Keywords Paper

0

0

0

0

11:05

25/04/2020

Camera Adversaria

Kieran Browne, Ben Swift, Terhi Nurmikko-Fuller

Keywords Paper

surveillance capitalism, adversarial examples, critical design

0

0

0

0

15:04

19/04/2021

“are you kidding me?”: Detecting unpalatable questions on Reddit

Sunyam Bagga, Andrew Piper, Derek Ruths

Keywords Paper

0

0

0

0

11:46

25/04/2020

Will Deleting History Make Alexa More Trustworthy?

Eugene Cho, S. Sundar, Saeed Abdullah, Nasim Motalebi

Keywords Paper

customization, privacy concern, power usage, security, smart speaker(s), voice assistant(s)

0

0

0

0

13:12

25/04/2020

Taking Data Out of Context to Hyper-Personalize Ads

Julia Hanson, Miranda Wei, Sophie Veys and
Matthew Kugler, Lior Strahilevitz, Blase Ur

Keywords Paper

hyper-personalization, targeted advertising, creepy, user study

0

0

0

0

14:47

23/08/2020

Towards building an intelligent chatbot for customer service: Learning to respond at the appropriate time

Che Liu, Junfeng Jiang, Chao Xiong and
Yi Yang, Jieping Ye

Keywords Paper

customer service, triggering model, chatbot, self-supervised learning

0

0

0

0

10:34

25/07/2020

GoChat: Goal-oriented chatbots with hierarchical reinforcement learning

Jianfeng Liu, Feiyang Pan, Ling Luo

Keywords Paper

dialogue system, reinforcement learning, goal-oriented chatbot

0

0

0

0

9:15

19/08/2021

InverseNet: Augmenting Model Extraction Attacks with Training Data Inversion

Xueluan Gong, Yanjiao Chen, Wenbin Yang and
Guanghao Mei, Qian Wang

Keywords Paper

Machine Learning, Adversarial Machine Learning, Deep Learning, Security and Privacy

0

0

0

0

14:34

04/07/2020

You Impress Me: Dialogue Generation via Mutual Persona Perception

Qian Liu, Yihong Chen, Bei Chen and
Jian-Guang Lou, Zixuan Chen, Bin Zhou, Dongmei Zhang

Keywords Paper

Dialogue Generation, mimicking responses, cognitive science, understanding

0

0

0

0

10:13

14/06/2020

Celeb-DF: A Large-Scale Challenging Dataset for DeepFake Forensics

Yuezun Li, Xin Yang, Pu Sun and
Honggang Qi, Siwei Lyu

Keywords Paper

deepfake dataset, deepfake detection, face synthesis, multimedia forensics

0

0

0

0

1:01

12/08/2020

SkillExplorer: Understanding the Behavior of Skills in Large Scale

Zhixiu Guo, Zijin Lin, Pan Li, Kai Chen

Keywords Paper

0

0

0

0

11:51

16/11/2020

Spot The Bot: A Robust and Efficient Framework for the Evaluation of Conversational Dialogue Systems

Jan Deriu, Don Tuggener, Pius von Däniken and
Jon Ander Campos, Alvaro Rodrigo, Thiziri Belkacem, Aitor Soroa, Eneko Agirre, Mark Cieliebak

Keywords Paper

evalu-ation methods, conversational systems, chat bots, spot bot

0

0

0

0

12:11

02/02/2021

Unsupervised Summarization for Chat Logs with Topic-Oriented Ranking and Context-Aware Auto-Encoders

Yicheng Zou, Jun Lin, Lujun Zhao and
Yangyang Kang, Zhuoren Jiang, Changlong Sun, Qi Zhang, Xuanjing Huang, Xiaozhong Liu

Keywords Paper

0

0

0

0

14:21

06/12/2020

Glyph: Fast and Accurately Training Deep Neural Networks on Encrypted Data

Qian Lou, Bo Feng, Geoffrey Charles Fox, Lei Jiang

Keywords Paper

0

0

0

0

3:13

07/06/2021

You Don't Know How I Feel: Insider-Outsider Perspective Gaps in Cyberbullying Risk Detection

Seunghyun Kim, Afsaneh Razi, Gianluca Stringhini and
Pamela J. Wisniewski, Munmun De Choudhury

Keywords Paper

Qualitative and quantitative studies of social media, Human computer interaction, social media tools, navigation and visualization, Subjectivity in textual data, sentiment analysis, polarity/opinion identification and extraction, linguistic analyses of soc

0

0

0

0

7:05

12/08/2020

Preech: A System for Privacy-Preserving Speech Transcription

Shimaa Ahmed, Amrita Roy Chowdhury, Kassem Fawaz, Parmesh Ramanathan

Keywords Paper

0

0

0

0

12:02

12/07/2020

(Locally) Differentially Private Combinatorial Semi-Bandits

Xiaoyu Chen, Kai Zheng, Zixin Zhou and
Yunchang Yang, Wei Chen, Liwei Wang

Keywords Paper

Privacy-preserving Statistics and Machine Learning

0

0

0

0

12:39

12/08/2020

Detecting Stuffing of a User’s Credentials at Her Own Accounts

Ke Coby Wang, Michael K. Reiter

Keywords Paper

0

0

0

0

12:13

12/08/2020

Devil’s Whisper: A General Approach for Physical Adversarial Attacks against Commercial Black-box Speech Recognition Devices

Yuxuan Chen, Xuejing Yuan, Jiangshan Zhang and
Yue Zhao, Shengzhi Zhang, Kai Chen, XiaoFeng Wang

Keywords Paper

0

0

0

0

12:44

16/11/2020

Detecting Cross-Modal Inconsistency to Defend Against Neural Fake News

Reuben Tan, Bryan Plummer, Kate Saenko

Keywords Paper

large-scale disinformation, detecting inconsistencies, defense mechanism, adversaries

0

0

0

0

12:02

16/11/2020

Human-centric dialog training via offline reinforcement learning

Natasha Jaques, Judy Hanwen Shen, Asma Ghandeharioun and
Craig Ferguson, Agata Lapedriza, Noah Jones, Shixiang Gu, Rosalind Picard

Keywords Paper

dialog model, rl policy, rl, language models

0

0

0

0

12:10

25/04/2020

Does Context in Privacy Communication Really Matter? A Survey on Consumer Concerns and Preferences

Nico Ebert, Kurt Ackermann, Peter Heinrich

Keywords Paper

privacy, policy, user preferences, privacy concerns

0

0

0

0

12:09

01/07/2020

A Case Study of User Communication Styles with Customer Service Agents versus Intelligent Virtual Agents

Timothy Hewitt, Ian Beaver

Keywords Paper

0

0

0

0

5:21

05/12/2020

Rumor detection on Twitter using multiloss hierarchical BiLSTM with an attenuation factor

Yudianto Sujana, Jiawen Li, Hung-Yu Kao

Keywords Paper

0

0

0

0

13:00

03/05/2021

Private Image Reconstruction from System Side Channels Using Generative Models

Yuanyuan Yuan, Shuai Wang, Junping Zhang

Keywords Paper

side channel analysis

0

0

0

0

4:41

12/08/2020

The Tools and Tactics Used in Intimate Partner Surveillance: An Analysis of Online Infidelity Forums

Emily Tseng, Rosanna Bellini, Nora McDonald and
Matan Danos, Rachel Greenstadt, Damon McCoy, Nicola Dell, Thomas Ristenpart

Keywords Paper

0

0

0

0

13:30

25/04/2020

Social Boundaries for Personal Agents in the Interpersonal Space of the Home

Michal Luria, Rebecca Zheng, Bennett Huffman and
Shuangni Huang, John Zimmerman, Jodi Forlizzi

Keywords Paper

voice activated personal assistants, interaction design, speed dating, conversational agents, social robots, embodied agents

0

0

0

0

10:19

22/09/2020

Global and local differential privacy for collaborative bandits

Huazheng Wang, Qian Zhao, Qingyun Wu and
Shubham Chopra, Abhinav Khaitan, Hongning Wang

Keywords Paper

contextual bandits, Differential privacy, collaborative learning

0

0

0

0

2:43

07/06/2020

Beyond Positive Emotion: Deconstructing Happy Moments Based on Writing Prompts

Kokil Jaidka, Niyati Chhaya, Saran Mumick and
Matthew Killingsworth, Alon Halevy, Lyle Ungar

Keywords Paper

behaviors, detection, interactions, languages, search, sentiment

0

0

0

0

12:13

19/08/2021

Dialogue Disentanglement in Software Engineering: How Far are We?

Ziyou Jiang, Lin Shi, Celia Chen and
Jun Hu, Qing Wang

Keywords Paper

Natural Language Processing, Dialogue, NLP Applications and Tools, Resources and Evaluation

0

0

0

0

13:20

06/12/2021

Differentially Private n-gram Extraction

Kunho Kim, Sivakanth Gopi, Janardhan Kulkarni, Sergey Yekhanin

Keywords Paper

privacy

0

0

0

0

14:47

19/10/2020

Active query of private demographic data for learning fair models

Yijun Liu, Chao Lan

Keywords Paper

fairness, demographic privacy, decoupled fair model, active learning

0

0

0

0

6:15

14/09/2020

Mend The Learning Approach, Not the Data: Insights for Ranking E-Commerce Products

Muhammad Umer Anwaar, Dmytro Rybalko, Martin Kleinsteuber

Keywords Paper

information retrieval, ranking and preference learning, learning to rank, e-commerce search, implicit feedback, counterfactual risk minimization, dataset, mining data logs

0

0

0

0

11:21

19/04/2021

ADePT: Auto-encoder based differentially private text transformation

Satyapriya Krishna, Rahul Gupta, Christophe Dupuy

Keywords Paper

0

0

0

0

5:06

26/08/2020

Federated Heavy Hitters Discovery with Differential Privacy

Wennan Zhu, Peter Kairouz, Brendan McMahan and
Haicheng Sun, Wei Li

Keywords Paper

0

0

0

0

14:08

07/06/2020

Two Computational Models for Analyzing Political Attention in Social Media

Libby Hemphill, Angela M. Schöpke-Gonzalez

Keywords Paper

attention, classifiers, political, political rhetoric, services, tools, topic, tweets, twitter

0

0

0

0

8:50

06/12/2021

Parameter-free HE-friendly Logistic Regression

Junyoung Byun, Woojin Lee, Jaewook Lee

Keywords Paper

machine learning, privacy

0

0

0

0

14:14

02/02/2021

Disentangled Representation Learning in Heterogeneous Information Network for Large-scale Android Malware Detection in the COVID-19 Era and Beyond

Shifu Hou, Yujie Fan, Mingxuan Ju and
Yanfang Ye, Wenqiang Wan, Kui Wang, Yinming Mei, Qi Xiong, Fudong Shao

Keywords Paper

0

0

0

0

18:49