De-Anonymizing Text by Fingerprinting Language Generation

Abstract: Components of machine learning systems are not (yet) perceived as security hotspots. Secure coding practices, such as ensuring that no execution paths depend on confidential inputs, have not yet been adopted by ML developers. We initiate the study of code security of ML systems by investigating how nucleus sampling---a popular approach for generating text, used for applications such as auto-completion---unwittingly leaks texts typed by users. Our main result is that the series of nucleus sizes for many natural English word sequences is a unique fingerprint. We then show how an attacker can infer typed text by measuring these fingerprints via a suitable side channel (e.g., cache access times), explain how this attack could help de-anonymize anonymous texts, and discuss defenses.

12/08/2020

type inference, garbage collection, verification, type systems, lock-free data structures, linearizability, safe memory reclamation

21:14

12/08/2020

De-Anonymizing Text by Fingerprinting Language Generation

Zhen Sun, Roei Schuster, Vitaly Shmatikov

Comments

Similar Papers

Human Distinguishable Visual Key Fingerprints

Mozhgan Azimpourkivi, Umut Topkara, Bogdan Carbunar

Keywords Abstract Paper

Detecting Stuffing of a User’s Credentials at Her Own Accounts

Ke Coby Wang, Michael K. Reiter

Keywords Abstract Paper

Padding Ain't Enough: Assessing the Privacy Guarantees of Encrypted DNS

Jonas Bushart, Christian Rossow

Keywords Abstract Paper

CrypTen: Secure Multi-Party Computation Meets Machine Learning

Brian Knott, Shobha Venkataraman, Awni Hannun and Shubho Sengupta, Mark Ibrahim, Laurens van der Maaten

Keywords Abstract Paper

deep learning, machine learning, vision

Instance-hiding Schemes for Private Distributed Learning

Yangsibo Huang, Zhao Song, Sanjeev Arora, Kai Li

Keywords Abstract Paper

Privacy-preserving Statistics and Machine Learning

TeeRex: Discovery and Exploitation of Memory Corruption Vulnerabilities in SGX Enclaves

Tobias Cloosters, Michael Rodler, Lucas Davi

Keywords Abstract Paper

CopyCat: Controlled Instruction-Level Attacks on Enclaves

Daniel Moghimi, Jo Van Bulck, Nadia Heninger and Frank Piessens, Berk Sunar

Keywords Abstract Paper

An Off-Chip Attack on Hardware Enclaves via the Memory Bus

Dayeol Lee, Dongha Jung, Ian T. Fang and Chia-Che Tsai, Raluca Ada Popa

Keywords Abstract Paper

Embedding java classes with Code2vec: Improvements from variable obfuscation

Rhys Compton, Eibe Frank, Panos Patros, Abigail Koay

Keywords Abstract Paper

code2vec, machine learning, code obfuscation, source code, neural networks

A Language for Probabilistically Oblivious Computation

David Darais, Ian Sweet, Chang Liu, Michael Hicks

Keywords Abstract Paper

Noninterference, Probability, Type Systems, Oblivious Computation

DeepHammer: Depleting the Intelligence of Deep Neural Networks through Targeted Chain of Bit Flips

Fan Yao, Adnan Siraj Rakin, Deliang Fan

Keywords Abstract Paper

SEAL: Attack Mitigation for Encrypted Databases via Adjustable Leakage

Ioannis Demertzis, Dimitrios Papadopoulos, Charalampos Papamanthou, Saurabh Shintre

Keywords Abstract Paper

Dataset Inference: Ownership Resolution in Machine Learning

Pratyush Maini, Mohammad Yaghini, Nicolas Papernot

Keywords Abstract Paper

MLaaS, model extraction, model ownership

Matrix Sketching for Secure Collaborative Machine Learning

Mengjiao Zhang, Shusen Wang

Keywords Abstract Paper

Social Aspects of Machine Learning, Privacy, Anonymity, and Security

Pointer Life Cycle Types for Lock-Free Data Structures with Memory Reclamation

Roland Meyer, Sebastian Wolff

Keywords Abstract Paper

type inference, garbage collection, verification, type systems, lock-free data structures, linearizability, safe memory reclamation

TPM-FAIL: TPM meets Timing and Lattice Attacks

Daniel Moghimi, Berk Sunar, Thomas Eisenbarth, Nadia Heninger

Keywords Abstract Paper

Remote Side-Channel Attacks on Anonymous Transactions

Florian Tramer, Dan Boneh, Kenny Paterson

Keywords Abstract Paper

Input-Aware Dynamic Backdoor Attack

Tuan Anh Nguyen, Anh Tran

Keywords Abstract Paper

Big Numbers - Big Troubles: Systematically Analyzing Nonce Leakage in (EC)DSA Implementations

Samuel Weiser, David Schrammel, Lukas Bodner, Raphael Spreitzer

Keywords Abstract Paper

Viaduct: An Extensible, Optimizing Compiler for Secure Distributed Programs

Coşku Acay, Rolph Recto, Joshua Gancher and Andrew C. Myers, Elaine Shi

Keywords Abstract Paper

information flow, multiparty computation, zero knowledge

Modeling Deep Learning Based Privacy Attacks on Physical Mail

Bingyao Huang, Ruyi Lian, Dimitris Samaras, Haibin Ling

Keywords Abstract Paper

A Probabilistic Separation Logic

Gilles Barthe, Justin Hsu, Kevin Liao

Keywords Abstract Paper

verified cryptography, probabilistic independence, separation logic

Passport-aware Normalization for Deep Model Protection

Keywords Paper

Keywords Paper

Keywords Paper

Brian Knott, Shobha Venkataraman, Awni Hannun and
Shubho Sengupta, Mark Ibrahim, Laurens van der Maaten

Keywords Paper

Keywords Paper

Keywords Paper

Daniel Moghimi, Jo Van Bulck, Nadia Heninger and
Frank Piessens, Berk Sunar

Keywords Paper

Dayeol Lee, Dongha Jung, Ian T. Fang and
Chia-Che Tsai, Raluca Ada Popa

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Coşku Acay, Rolph Recto, Joshua Gancher and
Andrew C. Myers, Elaine Shi

Keywords Paper

Keywords Paper

Keywords Paper

Jie Zhang, Dongdong Chen, Jing Liao and
Weiming Zhang, Gang Hua, Nenghai Yu

Keywords Paper

Jiawang Bai, Baoyuan Wu, Yong Zhang and
Yiming Li, Zhifeng Li, Shu-Tao Xia

Keywords Paper

Dongsoo Lee, Se Jung Kwon, Byeongwook Kim and
Yongkweon Jeon, Baeseong Park, Jeongin Yun

Keywords Paper

Emma Dauterman, Eric Feng, Ellen Luo and
Raluca Ada Popa, Ion Stoica

Keywords Paper

Hao Chen, Ilaria Chillotti, Yihe Dong and
Oxana Poburinnaya, Ilya Razenshteyn, M. Sadegh Riazi

Keywords Paper

Cesar Pereida García, Sohaib ul Hassan, Nicola Tuveri and
Iaroslav Gridin, Alejandro Cabrera Aldaya, Billy Bob Brumley

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Sunjay Cauligi, Craig Disselkoen, Klaus Gleissenthall and
Dean Tullsen, Deian Stefan, Tamara Rezk, Gilles Barthe

Keywords Paper

Keywords Paper