Unsupervised Domain Clusters in Pretrained Language Models

04/07/2020

Unsupervised Domain Clusters in Pretrained Language Models

Roee Aharoni, Yoav Goldberg

Keywords: NLP, data-driven domains, neural translation, Unsupervised Clusters

Abstract Paper Similar Papers

Abstract: The notion of ``in-domain data'' in NLP is often over-simplistic and vague, as textual data varies in many nuanced linguistic aspects such as topic, style or level of formality. In addition, domain labels are many times unavailable, making it challenging to build domain-specific systems. We show that massive pre-trained language models implicitly learn sentence representations that cluster by domains without supervision -- suggesting a simple data-driven definition of domains in textual data. We harness this property and propose domain data selection methods based on such models, which require only a small set of in-domain monolingual data. We evaluate our data selection methods for neural machine translation across five diverse domains, where they outperform an established approach as measured by both BLEU and precision and recall with respect to an oracle selection.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ACL 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

26/04/2020

Variational Template Machine for Data-to-Text Generation

Rong Ye, Wenxian Shi, Hao Zhou and
Zhongyu Wei, Lei Li

Keywords Paper

0

0

0

0

4:55

04/07/2020

Handling Rare Entities for Neural Sequence Labeling

Yangming Li, Han Li, Kaisheng Yao, Xiaolong Li

Keywords Paper

Neural Labeling, data problem, delexicalized identification, local reconstruction

0

0

0

0

15:45

22/11/2021

The Curious Layperson: Fine-Grained Image Recognition without Expert Labels

Subhabrata Choudhury, Iro Laina, Christian Rupprecht, Andrea Vedaldi

Keywords Paper

fine-grained recognition, weakly-supervised recognition, fine-grained retrieval, unsupervised recognition, image-to-text retrieval, text-to-image retrieval, image classification

0

0

0

0

8:53

26/04/2020

Neural Module Networks for Reasoning over Text

Nitish Gupta, Kevin Lin, Dan Roth and
Sameer Singh, Matt Gardner

Keywords Paper

question answering, compositionality, neural module networks, multi-step reasoning, reading comprehension

0

0

0

0

4:36

16/11/2020

Benchmarking Meaning Representations in Neural Semantic Parsing

Jiaqi Guo, Qian Liu, Jian-Guang Lou and
Zhenwen Li, Xueqing Liu, Tao Xie, Ting Liu

Keywords Paper

meaning representation, semantic parsing, unimer, meaning representations

0

0

0

0

11:45

06/12/2020

Compositional Generalization via Neural-Symbolic Stack Machines

Xinyun Chen, Chen Liang, Adams Wei Yu and
Dawn Song, Denny Zhou

Keywords Paper

Applications -> Computer Vision; Applications -> Visual Scene Analysis and Interpretation; Deep Learning -> Adversarial Network, Deep Learning -> Generative Models

0

0

0

0

3:26

18/07/2021

Reasoning Over Virtual Knowledge Bases With Open Predicate Relations

Haitian Sun, Patrick Verga, Bhuwan Dhingra and
Russ Salakhutdinov, William Cohen

Keywords Paper

Applications, Natural Language Processing

0

0

0

0

4:55

06/12/2021

Latent Execution for Neural Program Synthesis Beyond Domain-Specific Languages

Xinyun Chen, Dawn Song, Yuandong Tian

Keywords Paper

deep learning

0

0

0

0

14:52

08/12/2020

An Unsupervised Method for Learning Representations of Multi-word Expressions for Semantic Classification

Robert Vacareanu, Marco A. Valenzuela-Escárcega, Rebecca Sharp, Mihai Surdeanu

Keywords Paper

0

0

0

0

12:30

19/08/2021

Automatic Mixed-Precision Quantization Search of BERT

Changsheng Zhao, Ting Hua, Yilin Shen and
Qian Lou, Hongxia Jin

Keywords Paper

Machine Learning, Deep Learning, NLP Applications and Tools, Text Classification

0

0

0

0

12:12

04/07/2020

TaPas: Weakly Supervised Table Parsing via Pre-training

Jonathan Herzig, Pawel Krzysztof Nowak, Thomas Müller and
Francesco Piccinno, Julian Eisenschlos

Keywords Paper

Weakly Parsing, semantic task, question tables, SQA

0

0

0

0

12:49

05/12/2020

Vocabulary matters: A simple yet effective approach to paragraph-level question generation

Vishwajeet Kumar, Manish Joshi, Ganesh Ramakrishnan, Yuan-Fang Li

Keywords Paper

0

0

0

0

8:36

16/11/2020

Sketch-Driven Regular Expression Generation from Natural Language and Examples

Xi Ye, Qiaochu Chen, Xinyu Wang and
Isil Dillig, Greg Durrett

Keywords Paper

regex synthesis, semantic parser, program synthesizer, neural systems

0

0

0

0

12:00

02/02/2021

GENSYNTH: Synthesizing Datalog Programs without Language Bias

Jonathan Mendelson, Aaditya Naik, Mukund Raghothaman, Mayur Naik

Keywords Paper

0

0

0

0

20:03

19/08/2021

A Sequence-to-Set Network for Nested Named Entity Recognition

Zeqi Tan, Yongliang Shen, Shuai Zhang and
Weiming Lu, Yueting Zhuang

Keywords Paper

Natural Language Processing, Information Extraction, Named Entities

0

0

0

0

10:38

04/07/2020

Compositional Generalization by Factorizing Alignment and Translation

Jacob Russin, Jason Jo, Randall O'Reilly, Yoshua Bengio

Keywords Paper

Compositional Generalization, Translation, natural processing, cognitive science

0

0

0

0

10:37

15/06/2020

Blended, precise semantic program embeddings

Ke Wang, Zhendong Su

Keywords Paper

Static and Dynamic Program Features, Attention Network, Semantic Program Embedding

0

0

0

0

15:39

06/12/2021

Structured Reordering for Modeling Latent Alignments in Sequence Transduction

bailin wang, Mirella Lapata, Ivan Titov

Keywords Paper

language

0

0

0

0

15:00

26/04/2020

Permutation Equivariant Models for Compositional Generalization in Language

Jonathan Gordon, David Lopez-Paz, Marco Baroni, Diane Bouchacourt

Keywords Paper

Compositionality, Permutation Equivariance, Language Processing

0

0

0

0

5:00

29/06/2020

Embedding java classes with Code2vec: Improvements from variable obfuscation

Rhys Compton, Eibe Frank, Panos Patros, Abigail Koay

Keywords Paper

code2vec, machine learning, code obfuscation, source code, neural networks

0

0

0

0

14:20

08/12/2020

TableGPT: Few-shot Table-to-Text Generation with Table Structure Reconstruction and Content Matching

Heng Gong, Yawei Sun, Xiaocheng Feng and
Bing Qin, Wei Bi, Xiaojiang Liu, Ting Liu

Keywords Paper

0

0

0

0

8:45

06/12/2020

Unsupervised Translation of Programming Languages

Baptiste Roziere, Marie-Anne Lachaux, Lowik Chanussot, Guillaume Lample

Keywords Paper

0

0

0

0

3:17

04/07/2020

Few-Shot NLG with Pre-Trained Language Model

Zhiyu Chen, Harini Eavani, Wenhu Chen and
Yinyin Liu, William Yang Wang

Keywords Paper

natural generation, NLG, real-world applications, content selection

0

0

0

0

5:59

12/09/2020

Plausible Reasoning about EL-Ontologies using Concept Interpolation

Yazmín Ibáñez-García, Víctor Gutiérrez-Basulto, Steven Schockaert

Keywords Paper

Description logics-General, Commonsense reasoning-General, Knowledge representation languages-General, Concept formation, similarity-based reasoning-General

0

0

0

0

15:50

26/04/2020

A Probabilistic Formulation of Unsupervised Text Style Transfer

Junxian He, Xinyi Wang, Graham Neubig, Taylor Berg-Kirkpatrick

Keywords Paper

unsupervised text style transfer, deep latent sequence model

0

0

0

0

5:02

19/10/2020

Efficient neural query auto completion

Sida Wang, Weiwei Guo, Huiji Gao, Bo Long

Keywords Paper

deep learning, query auto completion, neural language model

0

0

0

0

9:59

16/11/2020

Uncertainty-Aware Semantic Augmentation for Neural Machine Translation

Xiangpeng Wei, Heng Yu, Yue Hu and
Rongxiang Weng, Luxi Xing, Weihua Luo

Keywords Paper

sequence-to-sequence task, nmt, inference, translation tasks

0

0

0

0

11:11

08/12/2020

Emergent Communication Pretraining for Few-Shot Machine Translation

Yaoyiran Li, Edoardo Maria Ponti, Ivan Vulić, Anna Korhonen

Keywords Paper

0

0

0

0

14:42

06/12/2021

Learning and Generalization in RNNs

Abhishek Panigrahi, Navin Goyal

Keywords Paper

deep learning, optimization

0

0

0

0

15:51

04/07/2020

Adaptive Compression of Word Embeddings

Yeachan Kim, Kang-Min Kim, SangKeun Lee

Keywords Paper

Adaptive Embeddings, Distributed words, natural tasks, downstream tasks

0

0

0

0

12:13

03/05/2021

Concept Learners for Few-Shot Learning

Kaidi Cao, Maria Brbic, Jure Leskovec

Keywords Paper

few-shot learning, meta learning

0

0

0

0

4:55

16/11/2020

A Predicate-Function-Argument Annotation of Natural Language for Open-Domain Information eXpression

Mingming Sun, Wenyue Hua, Zoey Liu and
Xin Wang, Kangjie Zheng, Ping Li

Keywords Paper

inference operations, oie algorithms, featured strategies, pipeline

0

0

0

0

11:47

03/05/2021

Anchor & Transform: Learning Sparse Embeddings for Large Vocabularies

Paul Pu Liang, Manzil Zaheer, Yuan Wang, Amr Ahmed

Keywords Paper

text classification, recommendation systems, large vocabularies, sparse embeddings, language modeling

0

0

0

1

7:03

12/07/2020

PoKED: A Semi-Supervised System for Word Sense Disambiguation

Feng Wei

Keywords Paper

Deep Learning - Generative Models and Autoencoders

0

0

0

0

15:39

04/07/2020

A Novel Cascade Binary Tagging Framework for Relational Triple Extraction

Zhepei Wei, Jianlin Su, Yue Wang and
Yuan Tian, Yi Chang

Keywords Paper

Relational Extraction, large-scale construction, overlapping problem, relational task

0

0

0

0

11:05

16/11/2020

Self-Supervised Meta-Learning for Few-Shot Natural Language Classification Tasks

Trapit Bansal, Rishikesh Jha, Tsendsuren Munkhdalai, Andrew McCallum

Keywords Paper

nlp applications, fine-tuning, meta-learning problem, supervised tasks

0

0

0

0

11:49

06/12/2020

Neural Execution Engines: Learning to Execute Subroutines

Yujun Yan, Kevin Swersky, Danai Koutra and
Parthasarathy Ranganathan, Milad Hashemi

Keywords Paper

0

0

0

0

3:20

16/11/2020

RNNs can generate bounded hierarchical languages with optimal memory

John Hewitt, Michael Hahn, Surya Ganguli and
Percy Liang, Christopher D. Manning

Keywords Paper

recurrent networks, rnns, rnn, finite-precision setting

1

1

0

0

11:53

04/07/2020

INFOTABS: Inference on Tables as Semi-structured Data

Vivek Gupta, Maitrey Mehta, Pegah Nokhiz, Vivek Srikumar

Keywords Paper

INFOTABS, complex reasoning, modeling strategies, meaning fragments

0

0

0

0

11:38

06/12/2020

Modular Meta-Learning with Shrinkage

Yutian Chen, Abe Friesen, Feryal Behbahani and
Arnaud Doucet, David Budden, Matthew Hoffman, Nando de Freitas

Keywords Paper

0

0

0

0

3:21