MAUVE: Measuring the Gap Between Neural Text and Human Text using Divergence Frontiers

Abstract: As major progress is made in open-ended text generation, measuring how close machine-generated text is to human language remains a critical open problem. We introduce Mauve, a comparison measure for open-ended text generation, which directly compares the learnt distribution from a text generation model to the distribution of human-written text using divergence frontiers. Mauve scales up to modern text generation models by computing information divergences in a quantized embedding space. Through an extensive empirical study on three open-ended generation tasks, we find that Mauve identifies known properties of generated text, scales naturally with model size, and correlates with human judgments, with fewer restrictions than existing distributional evaluation metrics.

08/12/2020

representation learning, self-supervised learning, language models, theory, transfer learning, natural language processing, unsupervised learning

5:16

16/11/2020

Tom B Brown, Ben Mann, Nick Ryder and
Melanie Subbiah, Jared D Kaplan, Prafulla Dhariwal, Arvind Neelakantan, Pranav Shyam, Girish Sastry, Amanda Askell, Sandhini Agarwal, Ariel Herbert-Voss, Gretchen M Krueger, Tom Henighan, Rewon Child, Aditya Ramesh, Daniel Ziegler, Jeffrey Wu, Clemens Winter, Chris Hesse, Mark Chen, Eric Sigler, Mateusz Litwin, Scott Gray, Benjamin Chess, Jack Clark, Christopher Berner, Sam McCandlish, Alec Radford, Ilya Sutskever, Dario Amodei

Qi Zhu, Chenyu Gao, Peng Wang, Qi Wu

bezier curve, scene text, end-to-end, detection, recognition, arbitrarily shaped, one stage, align, sampling, deep neural network

5:01

19/04/2021

MAUVE: Measuring the Gap Between Neural Text and Human Text using Divergence Frontiers

Krishna Pillutla, Swabha Swayamdipta, Rowan Zellers, John Thickstun, Sean Welleck, Yejin Choi, Zaid Harchaoui

Comments

Similar Papers

Automatic Interlinear Glossing for Under-Resourced Languages Leveraging Translations

Xingyuan Zhao, Satoru Ozaki, Antonios Anastasopoulos and Graham Neubig, Lori Levin

Keywords Abstract Paper

NLQuAD: A non-factoid long question answering data set

Amir Soleimani, Christof Monz, Marcel Worring

Keywords Abstract Paper

Sparse Text Generation

Pedro Henrique Martins, Zita Marinho, André F. T. Martins

Keywords Abstract Paper

story completion, dialogue generation, text generators, language models

Centering-based Neural Coherence Modeling with Hierarchical Discourse Segments

Sungho Jeon, Michael Strube

Keywords Abstract Paper

automated scoring, neural models, coherence model, linguistic coherence

Neural Machine Translation with Universal Visual Representation

Zhuosheng Zhang, Kehai Chen, Rui Wang and Masao Utiyama, Eiichiro Sumita, Zuchao Li, Hai Zhao

Keywords Abstract Paper

Neural Machine Translation, Visual Representation, Multimodal Machine Translation, Language Representation

Zero-Shot Crosslingual Sentence Simplification

Jonathan Mallinson, Rico Sennrich, Mirella Lapata

Keywords Abstract Paper

sentence simplification, translation, simplification, encoder-decoder models

Simultaneous Machine Translation with Visual Context

Ozan Caglayan, Julia Ive, Veneta Haralampieva and Pranava Madhyastha, Loïc Barrault, Lucia Specia

Keywords Abstract Paper

simt, multimodal approaches, simt frameworks, visually-grounded models

Residual Energy-Based Models for Text Generation

Yuntian Deng, Anton Bakhtin, Myle Ott and Arthur Szlam, Marc'Aurelio Ranzato

Keywords Abstract Paper

energy-based models, text generation

Understanding Points of Correspondence between Sentences for Abstractive Summarization

Logan Lebanoff, John Muchovej, Franck Dernoncourt and Doo Soon Kim, Lidan Wang, Walter Chang, Fei Liu

Keywords Abstract Paper

Abstractive Summarization, coreference resolution, summarization, cohesive devices

Have Your Text and Use It Too! End-to-End Neural Data-to-Text Generation with Semantic Fidelity

Hamza Harkous, Isabel Groves, Amir Saffari

Keywords Abstract Paper

Incremental Neural Lexical Coherence Modeling

Sungho Jeon, Michael Strube

Keywords Abstract Paper

Using Context in Neural Machine Translation Training Objectives

Danielle Saunders, Felix Stahlberg, Bill Byrne

Keywords Abstract Paper

Neural training, NMT training, document-level training, NMT objective

Towards automatically generating Questions under Discussion to link information and discourse structure

Kordula De Kuthy, Madeeswaran Kannan, Haemanth Santhi Ponnusamy, Detmar Meurers

Keywords Abstract Paper

Exemplification Modeling: Can You Give Me an Example, Please?

Edoardo Barba, Luigi Procopio, Caterina Lacerra and Tommaso Pasini, Roberto Navigli

Keywords Abstract Paper

Natural Language Processing, Natural Language Semantics, Resources and Evaluation

Fine-Grained Analysis of Cross-Linguistic Syntactic Divergences

Dmitry Nikolaev, Ofir Arviv, Taelin Karidi and Neta Kenneth, Veronika Mitnik, Lilja Maria Saeboe, Omri Abend

Keywords Abstract Paper

Fine-Grained Divergences, cross-lingual transfer, full automation, cross-lingual parser

LAReQA: Language-Agnostic Answer Retrieval from a Multilingual Pool

Uma Roy, Noah Constant, Rami Al-Rfou and Aditya Barua, Aaron Phillips, Yinfei Yang

Keywords Abstract Paper

language-agnostic retrieval, cross-lingual tasks, cross-lingual retrieval, alignment

Revisiting Mahalanobis Distance for Transformer-Based Out-of-Domain Detection

Alexander Podolskiy, Dmitry Lipin, Andrey Bout and Ekaterina Artemova, Irina Piontkovskaya

Keywords Abstract Paper

Human Attention Maps for Text Classification: Do Humans and Neural Networks Focus on the Same Words?

Cansu Sen, Thomas Hartvigsen, Biao Yin and Xiangnan Kong, Elke Rundensteiner

Keywords Abstract Paper

Text Classification, quantitative mechanisms, text task, large-scale study

Facet-Aware Evaluation for Extractive Summarization

Yuning Mao, Liyuan Liu, Qi Zhu and Xiang Ren, Jiawei Han

Keywords Abstract Paper

Facet-Aware Evaluation, Extractive Summarization, fine-grained evaluation, comparative analysis

Enabling Language Models to Fill in the Blanks

Chris Donahue, Mina Lee, Percy Liang

Keywords Abstract Paper

text infilling, predicting text, writing tools, language modeling

Query by Strings and Return Ranking Word Regions with Only One Look

Peng Zhao, Wenyuan Xue, Qingyong Li, Siqi Cai

Xingyuan Zhao, Satoru Ozaki, Antonios Anastasopoulos and
Graham Neubig, Lori Levin

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Zhuosheng Zhang, Kehai Chen, Rui Wang and
Masao Utiyama, Eiichiro Sumita, Zuchao Li, Hai Zhao

Keywords Paper

Keywords Paper

Ozan Caglayan, Julia Ive, Veneta Haralampieva and
Pranava Madhyastha, Loïc Barrault, Lucia Specia

Keywords Paper

Yuntian Deng, Anton Bakhtin, Myle Ott and
Arthur Szlam, Marc'Aurelio Ranzato

Keywords Paper

Logan Lebanoff, John Muchovej, Franck Dernoncourt and
Doo Soon Kim, Lidan Wang, Walter Chang, Fei Liu

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Edoardo Barba, Luigi Procopio, Caterina Lacerra and
Tommaso Pasini, Roberto Navigli

Keywords Paper

Dmitry Nikolaev, Ofir Arviv, Taelin Karidi and
Neta Kenneth, Veronika Mitnik, Lilja Maria Saeboe, Omri Abend

Keywords Paper

Uma Roy, Noah Constant, Rami Al-Rfou and
Aditya Barua, Aaron Phillips, Yinfei Yang

Keywords Paper

Alexander Podolskiy, Dmitry Lipin, Andrey Bout and
Ekaterina Artemova, Irina Piontkovskaya

Keywords Paper

Cansu Sen, Thomas Hartvigsen, Biao Yin and
Xiangnan Kong, Elke Rundensteiner

Keywords Paper

Yuning Mao, Liyuan Liu, Qi Zhu and
Xiang Ren, Jiawei Han

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Ruipeng Jia, Yanan Cao, Hengzhu Tang and
Fang Fang, Cong Cao, Shi Wang

Keywords Paper

Keywords Paper

Keywords Paper

Raphael Schumann, Lili Mou, Yao Lu and
Olga Vechtomova, Katja Markert

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Jian Liu, Yubo Chen, Kang Liu and
Wei Bi, Xiaojiang Liu

Keywords Paper

Keywords Paper

Nihal Potdar, Anderson Raymundo Avila, Chao Xing and
Dong Wang, Yiran Cao, Xiao Chen

Keywords Paper

Keywords Paper

Keywords Paper

Yuliang Liu, Hao Chen, Chunhua Shen and
Tong He, Lianwen Jin, Liangwei Wang

Keywords Paper

Anna Breit, Artem Revenko, Kiamehr Rezaee and
Mohammad Taher Pilehvar, Jose Camacho-Collados

Keywords Paper