STORIUM: A Dataset and Evaluation Platform for Machine-in-the-Loop Story Generation

16/11/2020

STORIUM: A Dataset and Evaluation Platform for Machine-in-the-Loop Story Generation

Nader Akoury, Shufan Wang, Josh Whiting, Stephen Hood, Nanyun Peng, Mohit Iyyer

Keywords: story generation, assessing text, story models, storium

Abstract Paper Similar Papers

Abstract: Systems for story generation are asked to produce plausible and enjoyable stories given an input context. This task is underspecified, as a vast number of diverse stories can originate from a single input. The large output space makes it difficult to build and evaluate story generation models, as (1) existing datasets lack rich enough contexts to meaningfully guide models, and (2) existing evaluations (both crowdsourced and automatic) are unreliable for assessing long-form creative text. To address these issues, we introduce a dataset and evaluation platform built from STORIUM, an online collaborative storytelling community. Our author-generated dataset contains 6K lengthy stories (125M tokens) with fine-grained natural language annotations (e.g., character goals and attributes) interspersed throughout each narrative, forming a robust source for guiding models. We evaluate language models fine-tuned on our dataset by integrating them onto STORIUM, where real authors can query a model for suggested story continuations and then edit them. Automatic metrics computed over these edits correlate well with both user ratings of generated stories and qualitative feedback from semi-structured user interviews. We release both the STORIUM dataset and evaluation platform to spur more principled research into story generation.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at EMNLP 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

05/12/2020

Cue me in: Content-inducing approaches to interactive story generation

Faeze Brahman, Alexandru Petrusca, Snigdha Chaturvedi

Keywords Paper

0

1

0

0

14:08

16/11/2020

Generating similes effortlessly like a Pro: A Style Transfer Approach for Simile Generation

Tuhin Chakrabarty, Smaranda Muresan, Nanyun Peng

Keywords Paper

human imagination, simile generation, mapping properties, sequence model

0

0

0

0

11:11

16/11/2020

Content Planning for Neural Story Generation with Aristotelian Rescoring

Seraphina Goldfarb-Tarrant, Tuhin Chakrabarty, Ralph Weischedel, Nanyun Peng

Keywords Paper

story generation, high-quality planning, large models, plot-generation model

0

0

0

0

11:52

02/02/2021

Commonsense Knowledge Aware Concept Selection For Diverse and Informative Visual Storytelling

Hong Chen, Yifei Huang, Hiroya Takamura, Hideki Nakayama

Keywords Paper

0

0

0

0

16:59

19/10/2020

Enhanced story representation by ConceptNet for predicting story endings

Shanshan Huang, Kenny Q. Zhu, Qianzi Liao and
Libin Shen, Yinggong Zhao

Keywords Paper

commonsense reasoning, commonsense knowledge, story comprehension

0

0

0

0

5:21

04/07/2020

A Knowledge-Enhanced Pretraining Model for Commonsense Story Generation

Jian Guan, Fei Huang, Minlie Huang and
Zhihao Zhao, Xiaoyan Zhu

Keywords Paper

Commonsense Generation, Story generation, generating story, Automatic evaluation

0

0

0

0

12:17

01/07/2020

Exploring aspects of similarity between spoken personal narratives by disentangling them into narrative clause types

Belen Saldias, Deb Roy

Keywords Paper

0

0

0

0

10:07

02/02/2021

Content Learning with Structure-Aware Writing: A Graph-Infused Dual Conditional Variational Autoencoder for Automatic Storytelling

Meng-Hsuan Yu, Juntao Li, Zhangming Chan and
Rui Yan, Dongyan Zhao

Keywords Paper

0

0

0

0

13:29

02/02/2021

Imagine, Reason and Write: Visual Storytelling with Graph Knowledge and Relational Reasoning

Chunpu Xu, Min Yang, Chengming Li and
Ying Shen, Xiang Ao, Ruifeng Xu

Keywords Paper

0

0

0

0

13:55

16/11/2020

PlotMachines: Outline-Conditioned Generation with Dynamic Plot State Tracking

Hannah Rashkin, Asli Celikyilmaz, Yejin Choi, Jianfeng Gao

Keywords Paper

outline-conditioned generation, plotmachines, neural model, large-scale models

0

1

0

0

12:16

19/10/2020

DeText: A deep text ranking framework with BERT

Weiwei Guo, Xiaowei Liu, Sida Wang and
Huiji Gao, Ananth Sankar, Zimeng Yang, Qi Guo, Liang Zhang, Bo Long, Bee-Chung Chen, Deepak Agarwal

Keywords Paper

ranking, deep language models, natural language processing

0

0

0

0

10:40

06/12/2021

Bias Out-of-the-Box: An Empirical Analysis of Intersectional Occupational Biases in Popular Generative Language Models

Hannah Rose Kirk, yennie jun, Filippo Volpin and
Haider Iqbal, Elias Benussi, Frederic Dreyer, Aleksandar Shtedritski, Yuki Asano

Keywords Paper

language

0

0

0

0

9:48

04/07/2020

A Large-Scale Multi-Document Summarization Dataset from the Wikipedia Current Events Portal

Demian Gholipour Ghalandari, Chris Hokamp, Nghia The Pham and
John Glover, Georgiana Ifrim

Keywords Paper

Multi-document summarization, story clustering, presentation results, timeline generation

0

0

0

0

6:12

04/07/2020

Toward Better Storylines with Sentence-Level Language Models

Daphne Ippolito, David Grangier, Douglas Eck, Chris Callison-Burch

Keywords Paper

unsupervised task, larger-scale tasks, Sentence-Level Models, sentence-level model

0

0

0

0

7:29

16/11/2020

UNION: An Unreferenced Metric for Evaluating Open-ended Story Generation

Jian Guan, Minlie Huang

Keywords Paper

open-ended generation, story generation, evaluating generation, constructing samples

0

0

0

0

11:26

01/07/2020

Systematic Evaluation of a Framework for Unsupervised Emotion Recognition for Narrative Text

Samira Zad, Mark Finlayson

Keywords Paper

0

0

0

0

9:43

07/06/2020

Source Attribution: Recovering the Press Releases behind Health Science News

Ansel MacLaughlin, John Wihbey, Aleszu Bajak, David A. Smith

Keywords Paper

articles, contexts, health, humans, news, news articles, predictions, relationships, representations, sources, texts

0

0

0

0

9:46

16/11/2020

Inquisitive Question Generation for High Level Text Comprehension

Wei-Jen Ko, Te-yuan Chen, Yiyan Huang and
Greg Durrett, Junyi Jessy Li

Keywords Paper

inquisitive questions, automatic systems, text comprehension, data-driven approaches

0

0

0

0

11:42

16/11/2020

Reading Between the Lines: Exploring Infilling in Visual Narratives

Khyathi Raghavi Chandu, Ruo-Ping Dong, Alan W Black

Keywords Paper

generating narratives, artificial intelligence, prediction steps, generating steps

0

0

0

0

7:13

16/11/2020

Writing Strategies for Science Communication: Data and Computational Analysis

Tal August, Lauren Kim, Katharina Reinecke, Noah A. Smith

Keywords Paper

science communication, writing strategies, annotation scheme, transformer-based classifiers

0

0

0

0

11:52

08/12/2020

Modeling Event Salience in Narratives via Barthes’ Cardinal Functions

Takaki Otake, Sho Yokoi, Naoya Inoue and
Ryo Takahashi, Tatsuki Kuribayashi, Kentaro Inui

Keywords Paper

0

0

0

0

9:18

04/07/2020

Recollection versus Imagination: Exploring Human Memory and Cognition via Neural Language Models

Maarten Sap, Eric Horvitz, Yejin Choi and
Noah A. Smith, James Pennebaker

Keywords Paper

Imagination, storytelling, narrativization memories, Recollection

0

0

0

0

7:32

19/08/2021

UNBERT: User-News Matching BERT for News Recommendation

Qi Zhang, Jingjie Li, Qinglin Jia and
Chuyuan Wang, Jieming Zhu, Zhaowei Wang, Xiuqiang He

Keywords Paper

Machine Learning, Recommender Systems, Recommender Systems

0

0

0

0

12:18

19/10/2020

An event-oriented neural ranking model for news retrieval

Lin Zhao, Wanhui Qian, Liangjun Zang and
Fuqing Zhu, Yijun Lu, Ruixuan Li, Jizhong Han, Songlin Hu

Keywords Paper

event-oriented, news retrieval, neural ranking model

0

0

0

0

6:56

19/04/2021

SANDI: Story-and-images alignment

Sreyasi Nag Chowdhury, Simon Razniewski, Gerhard Weikum

Keywords Paper

0

0

0

0

13:02

04/07/2020

Embedding-based Scientific Literature Discovery in a Text Editor Application

Onur Gökçe, Jonathan Prada, Nikola I. Nikolov and
Nianlong Gu, Richard H.R. Hahnloser

Keywords Paper

Embedding-based Discovery, Text Application, literature discovery, bibliography management

1

0

1

0

9:52

02/02/2021

Fact-Enhanced Synthetic News Generation

Kai Shu, Yichuan Li, Kaize Ding, Huan Liu

Keywords Paper

0

0

0

0

13:47

16/11/2020

MEGATRON-CNTRL: Controllable Story Generation with External Knowledge Using Large-Scale Language Models

Peng Xu, Mostofa Patwary, Mohammad Shoeybi and
Raul Puri, Pascale Fung, Anima Anandkumar, Bryan Catanzaro

Keywords Paper

text generation, pre-trained models, megatron-cntrl, large-scale models

0

0

0

0

11:59

02/02/2021

Paragraph-level Commonsense Transformers with Recurrent Memory

Saadia Gabriel, Chandra Bhagavatula, Vered Shwartz and
Ronan Le Bras, Maxwell Forbes, Yejin Choi

Keywords Paper

0

0

0

0

15:08

05/12/2020

Asking Crowdworkers to Write Entailment Examples: The Best of Bad options

Clara Vania, Ruijie Chen, Samuel R. Bowman

Keywords Paper

0

0

0

0

11:02

16/11/2020

Multi-Fact Correction in Abstractive Text Summarization

Yue Dong, Shuohang Wang, Zhe Gan and
Yu Cheng, Jackie Chi Kit Cheung, Jingjing Liu

Keywords Paper

news summarization, factual inconsistency, pre-trained systems, extractive strategies

0

0

0

0

11:53

08/12/2020

Best Practices for Data-Efficient Modeling in NLG:How to Train Production-Ready Neural Models with Less Data

Ankit Arun, Soumya Batra, Vikas Bhardwaj and
Ashwini Challa, Pinar Donmez, Peyman Heidari, Hakan Inan, Shashank Jain, Anuj Kumar, Shawn Mei, Karthik Mohan, Michael White

Keywords Paper

0

0

0

0

15:01

19/04/2021

TrNews: Heterogeneous user-interest transfer learning for news recommendation

Guangneng Hu, Qiang Yang

Keywords Paper

0

0

0

0

11:51

06/12/2021

BARTScore: Evaluating Generated Text as Text Generation

Weizhe Yuan, Graham Neubig, Pengfei Liu

Keywords Paper

0

0

0

0

13:47

12/09/2020

Plausible Reasoning about EL-Ontologies using Concept Interpolation

Yazmín Ibáñez-García, Víctor Gutiérrez-Basulto, Steven Schockaert

Keywords Paper

Description logics-General, Commonsense reasoning-General, Knowledge representation languages-General, Concept formation, similarity-based reasoning-General

0

0

0

0

15:50

04/07/2020

Hooks in the Headline: Learning to Generate Headlines with Controlled Styles

Di Jin, Zhijing Jin, Joey Tianyi Zhou and
Lisa Orii, Peter Szolovits

Keywords Paper

Stylistic Generation, summarization tasks, automatic evaluation, summarization systems

0

0

0

0

13:42

04/07/2020

exBERT: A Visual Analysis Tool to Explore Learned Representations in Transformer Models

Benjamin Hoover, Hendrik Strobelt, Sebastian Gehrmann

Keywords Paper

analysis, model-internal process, exBERT, Visual Tool

0

0

0

0

9:44

16/11/2020

Small but Mighty: New Benchmarks for Split and Rephrase

Li Zhang, Huaiyu Zhu, Siddhartha Brahma, Yunyao Li

Keywords Paper

text task, fine-grained evaluation, automatic process, rule-based model

0

0

0

0

6:58

02/02/2021

Hierarchical Coherence Modeling for Document Quality Assessment

Dongliang Liao, Jin Xu, Gongfu Li, Yiru Wang

Keywords Paper

0

0

0

0

18:16

16/11/2020

Measuring Information Propagation in Literary Social Networks

Matthew Sims, David Bamman

Keywords Paper

modeling propagation, information propagation, speaker attribution, structural holes

0

0

0

0

11:00