Large Scale Author Obfuscation Using Siamese Variational Auto-Encoder: The SiamAO System

08/12/2020

Large Scale Author Obfuscation Using Siamese Variational Auto-Encoder: The SiamAO System

Chakaveh Saedi, Mark Dras

Keywords:

Abstract Paper Similar Papers

Abstract: Author obfuscation is the task of masking the author of a piece of text, with applications in privacy. Recent advances in deep neural networks have boosted author identification performance making author obfuscation more challenging. Existing approaches to author obfuscation are largely heuristic. Obfuscation can, however, be thought of as the construction of adversarial examples to attack author identification, suggesting that the deep learning architectures used for adversarial attacks could have application here. Current architectures are proposed to construct adversarial examples against classification-based models, which in author identification would exclude the high-performing similarity-based models employed when facing large number of authorial classes. In this paper, we propose the first deep learning architecture for constructing adversarial examples against similarity-based learners, and explore its application to author obfuscation. We analyse the output from both success in obfuscation and language acceptability, as well as comparing the performance with some common baselines, and showing promising results in finding a balance between safety and soundness of the perturbed texts.

The video of this talk cannot be embedded. You can watch it here:

https://underline.io/lecture/9115-large-scale-author-obfuscation-using-siamese-variational-auto-encoder-the-siamao-system

(Link will open in new window)

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at COLING Workshops 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

19/04/2021

Adversarial stylometry in the wild: Transferable lexical substitution attacks on author profiling

Chris Emmery, Ákos Kádár, Grzegorz Chrupała

Keywords Paper

0

0

0

0

11:46

08/12/2020

Dual Attention Model for Citation Recommendation

Yang Zhang, Qiang Ma

Keywords Paper

0

0

0

0

15:10

19/04/2021

Mode effects’ challenge to authorship attribution

Haining Wang, Allen Riddell, Patrick Juola

Keywords Paper

0

0

0

0

10:25

22/11/2021

SemGIF: A Semantics Guided Incremental Few-shot Learning Framework with Generative Replay

S Divakar Bhat, Biplab Banerjee, Subhasis Chaudhuri

Keywords Paper

Incremental few shot learning, Few shot learning, Incremental learning, feature augmentation, cross dataset, heterogenous

0

0

0

0

3:05

06/12/2021

SSUL: Semantic Segmentation with Unknown Label for Exemplar-based Class-Incremental Learning

Sungmin Cha, beomyoung kim, YoungJoon Yoo, Taesup Moon

Keywords Paper

machine learning, vision

0

0

0

0

14:05

01/07/2020

Linguistic Features for Readability Assessment

Tovly Deutsch, Masoud Jasbi, Stuart Shieber

Keywords Paper

0

0

0

0

12:06

16/11/2020

Learning from Context or Names? An Empirical Study on Neural Relation Extraction

Hao Peng, Tianyu Gao, Xu Han and
Yankai Lin, Peng Li, Zhiyuan Liu, Maosong Sun, Jie Zhou

Keywords Paper

relation benchmarks, re scenarios, neural models, re models

0

0

0

0

11:56

06/12/2021

TopicNet: Semantic Graph-Guided Topic Discovery

Zhibin Duan, Yi.shi Xu, Bo Chen and
dongsheng wang, Chaojie Wang, Mingyuan Zhou

Keywords Paper

optimization, generative model, graph learning

0

0

0

0

10:15

19/08/2021

Guided Attention Network for Concept Extraction

Songtao Fang, Zhenya Huang, Ming He and
Shiwei Tong, Xiaoqing Huang, Ye Liu, Jie Huang, Qi Liu

Keywords Paper

Data Mining, Information Retrieval, Mining Text, Web, Social Media

0

0

0

0

14:26

14/09/2020

Diversity-Based Generalization for Unsupervised Text Classification under Domain Shift

Jitin Krishnan, Hemant Purohit, Huzefa Rangwala

Keywords Paper

text classification, unsupervised domain adaptation, natural language processing, neural networks

0

0

0

0

16:13

04/07/2020

A Girl Has A Name: Detecting Authorship Obfuscation

Asad Mahmood, Zubair Shafiq, Padmini Srinivasan

Keywords Paper

Detecting Obfuscation, Authorship attribution, Authorship obfuscation, stylometric analysis

0

0

0

0

11:25

06/12/2020

Unsupervised Semantic Aggregation and Deformable Template Matching for Semi-Supervised Learning

Tao Han, Junyu Gao, Yuan Yuan, Qi Wang

Keywords Paper

0

0

0

0

3:22

04/07/2020

Improving Disentangled Text Representation Learning with Information-Theoretic Guidance

Pengyu Cheng, Martin Renqiang Min, Dinghan Shen and
Christopher Malon, Yizhe Zhang, Yitong Li, Lawrence Carin

Keywords Paper

Learning language, NLP tasks, conditional generation, style transfer

0

0

0

0

9:56

04/07/2020

From Zero to Hero: Human-In-The-Loop Entity Linking in Low Resource Domains

Jan-Christoph Klie, Richard Eckart de Castilho, Iryna Gurevych

Keywords Paper

Human-In-The-Loop Linking, Entity linking, disambiguating mentions, annotation process

0

0

0

0

12:26

08/12/2020

Variational Autoencoder with Embedded Student-t Mixture Model for Authorship Attribution

Benedikt Boenninghoff, Steffen Zeiler, Robert Nickel, Dorothea Kolossa

Keywords Paper

0

0

0

0

14:13

04/07/2020

Efficient Strategies for Hierarchical Text Classification: External Knowledge and Auxiliary Tasks

Kervy Rivas Rojas, Gina Bustamante, Arturo Oncevay, Marco Antonio Sobrevilla Cabezudo

Keywords Paper

Hierarchical Classification, External Tasks, sequence-to-sequence problem, auxiliary bottom-up-classification

0

0

0

0

5:44

06/12/2021

On Memorization in Probabilistic Deep Generative Models

Gerrit van den Burg, Chris Williams

Keywords Paper

deep learning, self-supervised learning, generative model

0

0

0

0

12:04

04/07/2020

Embedding-based Scientific Literature Discovery in a Text Editor Application

Onur Gökçe, Jonathan Prada, Nikola I. Nikolov and
Nianlong Gu, Richard H.R. Hahnloser

Keywords Paper

Embedding-based Discovery, Text Application, literature discovery, bibliography management

1

0

1

0

9:52

02/02/2021

Bigram and Unigram Based Text Attack via Adaptive Monotonic Heuristic Search

Xinghao Yang, Weifeng Liu, James Bailey and
Dacheng Tao, Wei Liu

Keywords Paper

0

0

0

0

17:17

12/07/2020

Scalable Differential Privacy with Certified Robustness in Adversarial Learning

Hai Phan, My T. Thai, Han Hu and
Ruoming Jin, Tong Sun, Dejing Dou

Keywords Paper

Privacy-preserving Statistics and Machine Learning

0

0

0

0

14:47

04/07/2020

Information-Theoretic Probing for Linguistic Structure

Tiago Pimentel, Josef Valvoda, Rowan Hall Maudslay and
Ran Zmigrod, Adina Williams, Ryan Cotterell

Keywords Paper

Information-Theoretic Probing, NLP tasks, linguistic task, probing

0

0

0

0

10:30

03/05/2021

Auxiliary Task Update Decomposition: The Good, the Bad and the Neutral

Lucio Dery, Yann Dauphin, David Grangier

Keywords Paper

multitask learning, deeplearning, pre-training, gradient decomposition

0

0

0

0

5:22

12/07/2020

Robustness to Programmable String Transformations via Augmented Abstract Training

Yuhao Zhang, Aws Albarghouthi, Loris D'Antoni

Keywords Paper

Adversarial Examples

0

0

0

0

14:49

06/12/2020

Adversarial Self-Supervised Contrastive Learning

Minseon Kim, Jihoon Tack, Sung Ju Hwang

Keywords Paper

0

0

0

0

3:19

06/12/2021

A Framework to Learn with Interpretation

Jayneel Parekh, Pavlo Mozharovskyi, Florence d'Alché-Buc

Keywords Paper

deep learning, interpretability

0

0

0

0

14:05

26/04/2020

The Curious Case of Neural Text Degeneration

Ari Holtzman, Jan Buys, Li Du and
Maxwell Forbes, Yejin Choi

Keywords Paper

generation, text, NLG, NLP, natural language, natural language generation, language model, neural, neural language model

0

0

0

0

4:57

08/12/2020

A Deep Metric Learning Method for Biomedical Passage Retrieval

Andrés Rosso-Mateus, Fabio A. González, Manuel Montes-y-Gómez

Keywords Paper

0

0

0

0

14:58

19/08/2021

Disentangled Face Attribute Editing via Instance-Aware Latent Space Search

Yuxuan Han, Jiaolong Yang, Ying Fu

Keywords Paper

Computer Vision, 2D and 3D Computer Vision, Explainable/Interpretable Machine Learning

0

0

0

0

12:51

19/04/2021

Changing the mind of transformers for topically-controllable language generation

Haw-Shiuan Chang, Jiaming Yuan, Mohit Iyyer, Andrew McCallum

Keywords Paper

0

0

0

0

11:47

04/07/2020

Cross-Lingual Unsupervised Sentiment Classification with Multi-View Transfer Learning

Hongliang Fei, Ping Li

Keywords Paper

Cross-Lingual Classification, sentiment classification, unsupervised system, classification

0

0

0

0

12:23

16/11/2020

Multi-Dimensional Gender Bias Classification

Emily Dinan, Angela Fan, Ledell Wu and
Jason Weston, Douwe Kiela, Adina Williams

Keywords Paper

detecting bias, machine models, nlp models, fine-grained framework

0

0

0

0

12:02

18/07/2021

Self-Damaging Contrastive Learning

Ziyu Jiang, Tianlong Chen, Bobak Mortazavi, Zhangyang Wang

Keywords Paper

Algorithms, Unsupervised Learning

0

0

0

1

5:10

03/05/2021

Monte-Carlo Planning and Learning with Language Action Value Estimates

Youngsoo Jang, Seokin Seo, Jongmin Lee, Kee-Eung Kim

Keywords Paper

reinforcement learning, interactive fiction, Monte-Carlo tree search, natural language processing

0

0

0

0

4:57

08/12/2020

Classifier Probes May Just Learn from Linear Context Features

Jenny Kunz, Marco Kuhlmann

Keywords Paper

0

0

0

0

14:33

04/07/2020

Contextualized Weak Supervision for Text Classification

Dheeraj Mekala, Jingbo Shang

Keywords Paper

Text Classification, Weakly classification, string matching, Contextualized Supervision

0

0

0

0

11:26

12/07/2020

Neural Topic Modeling with Continual Lifelong Learning

Pankaj Gupta, Yatin Chaudhary, Thomas Runkler, Hinrich Schuetze

Keywords Paper

Applications - Language, Speech and Dialog

0

0

0

0

15:57

04/07/2020

Word-level Textual Adversarial Attacking as Combinatorial Optimization

Yuan Zang, Fanchao Qi, Chenghao Yang and
Zhiyuan Liu, Meng Zhang, Qun Liu, Maosong Sun

Keywords Paper

Textual attacking, Word-level attacking, combinatorial problem, Word-level Attacking

0

0

0

0

9:34

14/06/2020

ScrabbleGAN: Semi-Supervised Varying Length Handwritten Text Generation

Sharon Fogel, Hadar Averbuch-Elor, Sarel Cohen and
Shai Mazor, Roee Litman

Keywords Paper

gan, semi-supervised, domain-adaptation, handwriting, generative, unlabeled, transfer learning, ocr, text, augmentation

0

0

0

0

1:01

12/07/2020

Educating Text Autoencoders: Latent Representation Guidance via Denoising

Tianxiao Shen, Jonas Mueller, Regina Barzilay, Tommi Jaakkola

Keywords Paper

Deep Learning - Generative Models and Autoencoders

0

0

0

0

17:06

06/12/2021

Antipodes of Label Differential Privacy: PATE and ALIBI

Mani Malek Esmaeili, Ilya Mironov, Karthik Prasad and
Igor Shilov, Florian Tramer

Keywords Paper

machine learning, privacy, semi-supervised learning

0

0

0

0

14:17