An Empirical Investigation of Contextualized Number Prediction

16/11/2020

An Empirical Investigation of Contextualized Number Prediction

Taylor Berg-Kirkpatrick, Daniel Spokoyny

Keywords: contextualized prediction, prediction, detecting, numerical pre-diction

Abstract Paper Similar Papers

Abstract: We conduct a large scale empirical investigation of contextualized number prediction in running text. Specifically, we consider two tasks: (1)masked number prediction-- predict-ing a missing numerical value within a sentence, and (2)numerical anomaly detection--detecting an errorful numeric value within a sentence. We experiment with novel combinations of contextual encoders and output distributions over the real number line. Specifically, we introduce a suite of output distribution parameterizations that incorporate latent variables to add expressivity and better fit the natural distribution of numeric values in running text, and combine them with both recur-rent and transformer-based encoder architectures. We evaluate these models on two numeric datasets in the financial and scientific domain. Our findings show that output distributions that incorporate discrete latent variables and allow for multiple modes outperform simple flow-based counterparts on all datasets, yielding more accurate numerical pre-diction and anomaly detection. We also show that our models effectively utilize textual con-text and benefit from general-purpose unsupervised pretraining.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at EMNLP 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

08/07/2020

Space-efficient Query Evaluation over Probabilistic Event Streams

Rajeev Alur, Yu Chen, Kishor Jothimurugan, Sanjeev Khanna

Keywords Paper

Query processing over streams, Streaming algorithms, Probabilistic streams

0

0

0

0

22:51

16/11/2020

Calibrated Language Model Fine-Tuning for In- and Out-of-Distribution Data

Lingkai Kong, Haoming Jiang, Yuchen Zhuang and
Jie Lyu, Tuo Zhao, Chao Zhang

Keywords Paper

augmented training, in-distribution calibration, text classification, expectation error

0

0

0

0

11:47

03/05/2021

A Mathematical Exploration of Why Language Models Help Solve Downstream Tasks

Nikunj Saunshi, Sadhika Malladi, Sanjeev Arora

Keywords Paper

representation learning, self-supervised learning, language models, theory, transfer learning, natural language processing, unsupervised learning

0

0

0

0

5:16

08/12/2020

Mitigating Silence in Compliance Terminology during Parsing of Utterances

Esme Manandise, Conrad de Peuter

Keywords Paper

0

0

0

0

17:48

02/02/2021

MERL: Multimodal Event Representation Learning in Heterogeneous Embedding Spaces

Linhai Zhang, Deyu Zhou, Yulan He, Zeng Yang

Keywords Paper

0

0

0

0

13:57

26/04/2020

Residual Energy-Based Models for Text Generation

Yuntian Deng, Anton Bakhtin, Myle Ott and
Arthur Szlam, Marc'Aurelio Ranzato

Keywords Paper

energy-based models, text generation

0

0

0

0

4:59

12/07/2020

Predictive Multiplicity in Classification

Charles Marx, Flavio Calmon, Berk Ustun

Keywords Paper

Accountability, Transparency and Interpretability

0

0

0

0

10:19

16/11/2020

Methods for Numeracy-Preserving Word Embeddings

Dhanasekar Sundararaman, Shijing Si, Vivek Subramanian and
Guoyin Wang, Devamanyu Hazarika, Lawrence Carin

Keywords Paper

numerical reasoning, question answering, list maximum, decoding

0

0

0

0

12:18

20/08/2020

Raising Expectations: Automating Expected Cost Analysis with Types

Di Wang, David M. Kahn, Jan Hoffmann

Keywords Paper

resource-aware type system, expected execution cost, analysis of probabilistic programs

0

0

0

0

15:02

02/02/2021

Multi-Dimensional Explanation of Target Variables from Documents

Diego Antognini, Claudiu Musat, Boi Faltings

Keywords Paper

0

0

0

0

19:03

22/06/2020

Exploiting Semantic Relations for Fine-grained Entity Typing

Hongliang Dai, Yangqiu Song, Xin Li

Keywords Paper

Fine-grained Entity Typing, Hypernym Extraction, Semantic Role Labeling

0

0

0

0

4:45

18/07/2021

Explaining Time Series Predictions with Dynamic Masks

Jonathan Crabbé, Mihaela van der Schaar

Keywords Paper

Social Aspects of Machine Learning, Fairness, Accountability, and Transparency

0

0

0

0

5:17

06/12/2021

Local Explanation of Dialogue Response Generation

Yi-Lin Tuan, Connor Pryor, Wenhu Chen and
Lise Getoor, William Yang Wang

Keywords Paper

machine learning

0

0

0

0

13:14

19/08/2021

Method of Moments for Topic Models with Mixed Discrete and Continuous Features

Joachim Giesen, Paul Kahlmeyer, Sören Laue and
Matthias Mitterreiter, Frank Nussbaum, Christoph Staudt, Sina Zarrieß

Keywords Paper

Machine Learning, Learning Generative Models, Probabilistic Machine Learning, Unsupervised Learning

0

0

0

0

15:24

26/04/2020

A Probabilistic Formulation of Unsupervised Text Style Transfer

Junxian He, Xinyi Wang, Graham Neubig, Taylor Berg-Kirkpatrick

Keywords Paper

unsupervised text style transfer, deep latent sequence model

0

0

0

0

5:02

06/12/2021

Higher Order Kernel Mean Embeddings to Capture Filtrations of Stochastic Processes

Cristopher Salvi, Maud Lemercier, Chong Liu and
Blanka Horvath, Theodoros Damoulas, Terry Lyons

Keywords Paper

machine learning, graph learning, causality

0

0

0

0

15:02

12/07/2020

Automatic Reparameterisation of Probabilistic Programs

Maria Gorinova, Dave Moore, Matthew Hoffman

Keywords Paper

Probabilistic Inference - Models and Probabilistic Programming

0

0

0

0

15:40

04/07/2020

Predictive Biases in Natural Language Processing Models: A Conceptual Framework and Overview

Deven Santosh Shah, H. Andrew Schwartz, Dirk Hovy

Keywords Paper

NLP, Natural Models, Conceptual Framework, mitigation techniques

0

0

0

0

11:52

08/12/2020

Predicting Modality in Financial Dialogue

Kilian Theil, Heiner Stuckenschmidt

Keywords Paper

0

0

0

0

8:42

08/12/2020

SpanAlign: Sentence Alignment Method based on Cross-Language Span Prediction and ILP

Katsuki Chousa, Masaaki Nagata, Masaaki Nishino

Keywords Paper

0

0

0

0

14:39

04/07/2020

Neural Mixed Counting Models for Dispersed Topic Discovery

Jiemin Wu, Yanghui Rao, Zusheng Zhang and
Haoran Xie, Qing Li, Fu Lee Wang, Ziye Chen

Keywords Paper

Dispersed Discovery, mining topics, Neural Models, Mixed models

0

0

0

0

10:29

08/12/2020

A Neural Model for Aggregating Coreference Annotation in Crowdsourcing

Maolin Li, Hiroya Takamura, Sophia Ananiadou

Keywords Paper

0

0

0

0

13:34

26/08/2020

DYNOTEARS: Structure Learning from Time-Series Data

Roxana Pamfil, Nisara Sriwattanaworachai, Shaan Desai and
Philip Pilgerstorfer, Konstantinos Georgatzis, Paul Beaumont, Bryon Aragam

Keywords Paper

0

0

0

0

14:45

06/12/2021

Using Random Effects to Account for High-Cardinality Categorical Features and Repeated Measures in Deep Neural Networks

Giora Simchoni, Saharon Rosset

Keywords Paper

deep learning, machine learning, vision

0

0

0

0

13:33

14/06/2020

A Unified Optimization Framework for Low-Rank Inducing Penalties

Marcus Valtonen Örnhag, Carl Olsson

Keywords Paper

low rank approximation, convex relaxation, non-rigid structure from motion, shrinking bias

0

0

0

0

1:01

04/07/2020

Conditional Augmentation for Aspect Term Extraction via Masked Sequence-to-Sequence Generation

Kun Li, Chengbo Chen, Xiaojun Quan and
Qing Ling, Yan Song

Keywords Paper

Conditional Augmentation, Aspect Extraction, sentiment analysis, data augmentation

0

0

0

0

11:30

13/04/2021

A variational inference approach to learning multivariate wold processes

Jalal Etesami, William Trouleau, Negar Kiyavash and
Matthias Grossglauser, Patrick Thiran

Keywords Paper

0

0

0

0

2:46

05/12/2020

Comparing probabilistic, distributional and transformer-based models on logical metonymy interpretation

Giulia Rambelli, Emmanuele Chersoni, Alessandro Lenci and
Philippe Blache, Chu-Ren Huang

Keywords Paper

0

0

0

0

13:45

26/08/2020

Model-Agnostic Counterfactual Explanations for Consequential Decisions

Amir-Hossein Karimi, Gilles Barthe, Borja Balle, Isabel Valera

Keywords Paper

0

0

0

0

16:25

19/04/2021

Randomized deep structured prediction for discourse-level processing

Manuel Widmoser, Maria Leonor Pacheco, Jean Honorio, Dan Goldwasser

Keywords Paper

0

0

0

0

9:44

20/08/2020

Denotational Recurrence Extraction for Amortized Analysis

Joseph W. Cutler, Dan Licata, Norman Danner

Keywords Paper

amortized analysis, recurrence extraction, denotational semantics, resource analysis, higher order recurrences, cost semantics

0

0

0

0

14:53

19/08/2021

What Changed? Interpretable Model Comparison

Rahul Nair, Massimiliano Mattetti, Elizabeth Daly and
Dennis Wei, Oznur Alkan, Yunfeng Zhang

Keywords Paper

Machine Learning, Explainable/Interpretable Machine Learning, Explainability

0

0

0

0

13:12

06/12/2021

Diversity Enhanced Active Learning with Strictly Proper Scoring Rules

Wei Tan, Lan Du, Wray Buntine

Keywords Paper

machine learning, active learning

0

0

0

0

13:21

19/01/2020

Trace Types and Denotational Semantics for Sound Programmable Inference in Probabilistic Languages

Alexander K. Lew, Marco Cusumano-Towner, Benjamin Sherman and
Michael Carbin, Vikash Mansinghka

Keywords Paper

Probabilistic programming, programmable inference, type systems

0

0

0

0

19:55

04/07/2020

Enabling Language Models to Fill in the Blanks

Chris Donahue, Mina Lee, Percy Liang

Keywords Paper

text infilling, predicting text, writing tools, language modeling

0

0

0

0

7:01

06/12/2021

Divergence Frontiers for Generative Models: Sample Complexity, Quantization Effects, and Frontier Integrals

Lang Liu, Krishna Pillutla, Sean Welleck and
Sewoong Oh, Yejin Choi, Zaid Harchaoui

Keywords Paper

theory, vision, generative model, language

0

0

0

0

8:52

06/12/2021

Control Variates for Slate Off-Policy Evaluation

Nikos Vlassis, Ashok Chandrashekar, Fernando Amat, Nathan Kallus

Keywords Paper

optimization, bandits

0

0

0

0

12:25

04/07/2020

Discrete Optimization for Unsupervised Sentence Summarization with Word-Level Extraction

Raphael Schumann, Lili Mou, Yao Lu and
Olga Vechtomova, Katja Markert

Keywords Paper

Unsupervised Summarization, Word-Level Extraction, Automatic summarization, Discrete Optimization

0

0

0

0

10:39

25/07/2020

Evaluation of cross domain text summarization

Liam Scanlon, Shiwei Zhang, Xiuzhen Zhang, Mark Sanderson

Keywords Paper

headline generation, text summarization, evaluation

0

0

0

0

9:45

14/09/2020

Generating Financial Reports from Macro News via Multiple edits Neural Networks

Wenxin Hu, Yunpeng Ren, Qianhai Financial Holdings Co. and
Ltd., Xiaofeng Zhang

Keywords Paper

financial data mining, text generation model, natural language generation

0

0

0

0

14:01