On Importance Sampling-Based Evaluation of Latent Language Models

04/07/2020

On Importance Sampling-Based Evaluation of Latent Language Models

Robert L Logan IV, Matt Gardner, Sameer Singh

Keywords: Importance Models, likelihood-based evaluation, Language models, importance sampling

Abstract Paper Similar Papers

Abstract: Language models that use additional latent structures (e.g., syntax trees, coreference chains, knowledge graph links) provide several advantages over traditional language models. However, likelihood-based evaluation of these models is often intractable as it requires marginalizing over the latent space. Existing works avoid this issue by using importance sampling. Although this approach has asymptotic guarantees, analysis is rarely conducted on the effect of decisions such as sample size and choice of proposal distribution on the reported estimates. In this paper, we carry out this analysis for three models: RNNG, EntityNLM, and KGLM. In addition, we elucidate subtle differences in how importance sampling is applied in these works that can have substantial effects on the final estimates, as well as provide theoretical results which reinforce the validity of this technique.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ACL 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

19/04/2021

Randomized deep structured prediction for discourse-level processing

Manuel Widmoser, Maria Leonor Pacheco, Jean Honorio, Dan Goldwasser

Keywords Paper

0

0

0

0

9:44

16/11/2020

Modeling Content Importance for Summarization with Pre-trained Language Models

Liqiang Xiao, Lu Wang, Hao He, Yaohui Jin

Keywords Paper

modeling importance, summarization, statistical methods, information theory

0

0

0

0

8:50

02/02/2021

MASKER: Masked Keyword Regularization for Reliable Text Classification

Seung Jun Moon, Sangwoo Mo, Kimin Lee and
Jaeho Lee, Jinwoo Shin

Keywords Paper

0

0

0

0

15:05

02/02/2021

Improving Commonsense Causal Reasoning by Adversarial Training and Data Augmentation

Ieva Staliūnaitė, Philip John Gorinski, Ignacio Iacobacci

Keywords Paper

0

0

0

0

16:40

06/12/2020

Further Analysis of Outlier Detection with Deep Generative Models

Ziyu Wang, Bin Dai, David P Wipf, Jun Zhu

Keywords Paper

0

0

0

0

2:57

16/11/2020

MODE-LSTM: A Parameter-efficient Recurrent Network with Multi-Scale for Sentence Classification

Qianli Ma, Zhenxi Lin, Jiangyue Yan and
Zipeng Chen, Liuhong Yu

Keywords Paper

sentence classification, extracting features, generalization, cnn models

0

0

0

0

10:35

01/07/2020

Syntactic Parsing in Humans and Machines

Paola Merlo

Keywords Paper

0

0

0

0

44:12

16/11/2020

A Rigorous Study on Named Entity Recognition: Can Fine-tuning Pretrained Model Lead to the Promised Land?

Hongyu Lin, Yaojie Lu, Jialong Tang and
Xianpei Han, Le Sun, Zhicheng Wei, Nicholas Jing Yuan

Keywords Paper

randomization test, fine-tuning model, ner, creditable approaches

0

0

0

0

10:12

26/08/2020

Variational Autoencoders for Sparse and Overdispersed Discrete Data

He Zhao, Piyush Rai, Lan Du and
Wray Buntine, Dinh Phung, Mingyuan Zhou

Keywords Paper

0

0

0

0

14:28

03/08/2020

MASSIVE: Tractable and Robust Bayesian Learning of Many-Dimensional Instrumental Variable Models

Ioan Gabriel Bucur, Tom Claassen, Tom Heskes

Keywords Paper

0

0

0

0

8:04

04/07/2020

Conditional Augmentation for Aspect Term Extraction via Masked Sequence-to-Sequence Generation

Kun Li, Chengbo Chen, Xiaojun Quan and
Qing Ling, Yan Song

Keywords Paper

Conditional Augmentation, Aspect Extraction, sentiment analysis, data augmentation

0

0

0

0

11:30

26/04/2020

Towards Hierarchical Importance Attribution: Explaining Compositional Semantics for Neural Sequence Models

Xisen Jin, Zhongyu Wei, Junyi Du and
Xiangyang Xue, Xiang Ren

Keywords Paper

natural language processing, interpretability

0

0

0

0

4:58

04/07/2020

Perturbation Based Learning for Structured NLP tasks with Application to Dependency Parsing

Amichay Doitch, Ram Yazdi, Tamir Hazan, Roi Reichart

Keywords Paper

Structured tasks, Dependency Parsing, NLP, sampling

0

0

0

0

10:53

03/05/2021

Neural Topic Model via Optimal Transport

He Zhao, Dinh Phung, Viet Huynh and
Trung Le, Wray Buntine

Keywords Paper

optimal transport, document analysis, topic modelling

0

0

0

1

9:29

04/07/2020

Generalized Entropy Regularization or: There's Nothing Special about Label Smoothing

Clara Meister, Elizabeth Salesky, Ryan Cotterell

Keywords Paper

label smoothing, language tasks, Generalized Regularization, Label Smoothing

0

0

0

0

12:03

06/12/2020

Promoting Stochasticity for Expressive Policies via a Simple and Efficient Regularization Method

Qi Zhou, Yufei Kuang, Zherui Qiu and
Houqiang Li, Jie Wang

Keywords Paper

0

0

0

0

3:10

04/07/2020

Effective Estimation of Deep Generative Language Models

Tom Pelsmaeker, Wilker Aziz

Keywords Paper

Estimation Models, parameterisation models, posterior collapse, language modelling

0

0

0

0

12:19

16/11/2020

An Unsupervised Sentence Embedding Method by Mutual Information Maximization

Yan Zhang, Ruidan He, Zuozhu Liu and
Kwan Hui Lim, Lidong Bing

Keywords Paper

sentence-pair tasks, clustering, semantic search, downstream tasks

0

0

0

0

12:22

03/05/2021

Variational Information Bottleneck for Effective Low-Resource Fine-Tuning

Rabeeh Karimi Mahabadi, Yonatan Belinkov, James Henderson

Keywords Paper

variational information bottleneck, biases, robust, over-fitting, large-scale pre-trained language models, NLP, Transfer learning

0

0

0

0

5:07

25/07/2020

Copula guided neural topic modelling for short texts

Lihui Lin, Hongyu Jiang, Yanghui Rao

Keywords Paper

short text modelling, Archimedean copulas, neural topic modelling, auto-encoding variational Bayes

0

0

0

0

8:46

16/11/2020

A Simple Yet Strong Pipeline for HotpotQA

Dirk Groeneveld, Tushar Khot, Mausam, Ashish Sabharwal

Keywords Paper

multi-hop answering, named recognition, graph-based reasoning, question decomposition

0

0

0

0

6:14

08/12/2020

Classifier Probes May Just Learn from Linear Context Features

Jenny Kunz, Marco Kuhlmann

Keywords Paper

0

0

0

0

14:33

13/04/2021

Improving adversarial robustness via unlabeled out-of-domain data

Zhun Deng, Linjun Zhang, Amirata Ghorbani, James Zou

Keywords Paper

0

0

0

0

3:01

06/12/2020

Minimax Value Interval for Off-Policy Evaluation and Policy Optimization

Nan Jiang, Jiawei Huang

Keywords Paper

Algorithms -> Classification, Algorithms -> Semi-Supervised Learning

0

0

0

0

2:56

12/07/2020

Learning and Simulation in Generative Structured World Models

Zhixuan Lin, Yi-Fu Wu, Skand Peri and
Bofeng Fu, Jindong Jiang, Sungjin Ahn

Keywords Paper

Deep Learning - Generative Models and Autoencoders

0

0

0

0

11:56

19/10/2020

Distant supervision in BERT-based adhoc document retrieval

Koustav Rudra, Avishek Anand

Keywords Paper

distant supervision, adhoc retrieval, document ranking

0

0

0

0

6:49

26/08/2020

Independent Subspace Analysis for Unsupervised Learning of Disentangled Representations

Jan Stuehmer, Richard Turner, Sebastian Nowozin

Keywords Paper

0

0

0

0

11:43

06/12/2020

Learning Rich Rankings

Arjun Seshadri, Stephen Ragain, Johan Ugander

Keywords Paper

0

0

0

0

3:19

06/12/2021

Local Explanation of Dialogue Response Generation

Yi-Lin Tuan, Connor Pryor, Wenhu Chen and
Lise Getoor, William Yang Wang

Keywords Paper

machine learning

0

0

0

0

13:14

12/09/2020

Plausible Reasoning about EL-Ontologies using Concept Interpolation

Yazmín Ibáñez-García, Víctor Gutiérrez-Basulto, Steven Schockaert

Keywords Paper

Description logics-General, Commonsense reasoning-General, Knowledge representation languages-General, Concept formation, similarity-based reasoning-General

0

0

0

0

15:50

04/07/2020

Don’t Say That! Making Inconsistent Dialogue Unlikely with Unlikelihood Training

Margaret Li, Stephen Roller, Ilia Kulikov and
Sean Welleck, Y-Lan Boureau, Kyunghyun Cho, Jason Weston

Keywords Paper

dialogue tasks, Unlikelihood Training, Generative models, maximum training

0

0

0

0

11:26

06/12/2021

Refining Language Models with Compositional Explanations

Huihan Yao, Ying Chen, Qinyuan Ye and
Xisen Jin, Xiang Ren

Keywords Paper

machine learning, fairness, language

0

0

0

0

13:17

19/04/2021

Disambiguatory signals are stronger in word-initial positions

Tiago Pimentel, Ryan Cotterell, Brian Roark

Keywords Paper

0

0

0

0

11:35

16/11/2020

Uncertainty-Aware Semantic Augmentation for Neural Machine Translation

Xiangpeng Wei, Heng Yu, Yue Hu and
Rongxiang Weng, Luxi Xing, Weihua Luo

Keywords Paper

sequence-to-sequence task, nmt, inference, translation tasks

0

0

0

0

11:11

26/08/2020

Prior-aware Composition Inference for Spectral Topic Models

Moontae Lee, David Bindel, David Mimno

Keywords Paper

0

0

0

0

14:46

02/02/2021

Have We Solved The Hard Problem? It’s Not Easy! Contextual Lexical Contrast as a Means to Probe Neural Coherence

Wenqiang Lei, Yisong Miao, Runpeng Xie and
Bonnie Webber, Meichun Liu, Tat-Seng Chua, Nancy F. Chen

Keywords Paper

0

0

0

0

18:55

19/08/2021

Method of Moments for Topic Models with Mixed Discrete and Continuous Features

Joachim Giesen, Paul Kahlmeyer, Sören Laue and
Matthias Mitterreiter, Frank Nussbaum, Christoph Staudt, Sina Zarrieß

Keywords Paper

Machine Learning, Learning Generative Models, Probabilistic Machine Learning, Unsupervised Learning

0

0

0

0

15:24

04/07/2020

Document-Level Event Role Filler Extraction using Multi-Granularity Contextualized Encoding

Xinya Du, Claire Cardie

Keywords Paper

Document-Level Extraction, event extraction, extraction decisions, Multi-Granularity Encoding

0

0

0

0

10:27

16/11/2020

Interpretable Multi-dataset Evaluation for Named Entity Recognition

Jinlan Fu, Pengfei Liu, Graham Neubig

Keywords Paper

natural tasks, interpretable evaluation, named task, analysis tool

0

0

0

0

11:11

06/12/2021

Locally Valid and Discriminative Prediction Intervals for Deep Learning Models

Zhen Lin, Shubhendu Trivedi, Jimeng Sun

Keywords Paper

deep learning

0

0

0

0

12:05