Towards Understanding Sample Variance in Visually Grounded Language Generation: Evaluations and Observations

16/11/2020

Towards Understanding Sample Variance in Visually Grounded Language Generation: Evaluations and Observations

Wanrong Zhu, Xin Wang, Pradyumna Narayana, Kazoo Sone, Sugato Basu, William Yang Wang

Keywords: visually generation, vision-and-language tasks, cider, utilities

Abstract Paper Similar Papers

Abstract: A major challenge in visually grounded language generation is to build robust benchmark datasets and models that can generalize well in real-world settings. To do this, it is critical to ensure that our evaluation protocols are correct, and benchmarks are reliable. In this work, we set forth to design a set of experiments to understand an important but often ignored problem in visually grounded language generation: given that humans have different utilities and visual attention, how will the sample variance in multi-reference datasets affect the models′ performance? Empirically, we study several multi-reference datasets and corresponding vision-and-language tasks. We show that it is of paramount importance to report variance in experiments; that human-generated references could vary drastically in different datasets/tasks, revealing the nature of each task; that metric-wise, CIDEr has shown systematically larger variances than others. Our evaluations on reference-per-instance shed light on the design of reliable datasets in the future.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at EMNLP 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

16/11/2020

Interpretable Multi-dataset Evaluation for Named Entity Recognition

Jinlan Fu, Pengfei Liu, Graham Neubig

Keywords Paper

natural tasks, interpretable evaluation, named task, analysis tool

0

0

0

0

11:11

06/12/2021

Reliable Post hoc Explanations: Modeling Uncertainty in Explainability

Dylan Slack, Anna Hilgard, Sameer Singh, Himabindu Lakkaraju

Keywords Paper

robustness, interpretability

0

0

0

0

15:06

04/07/2020

Reverse Engineering Configurations of Neural Text Generation Models

Yi Tay, Dara Bahri, Che Zheng and
Clifford Brunk, Donald Metzler, Andrew Tomkins

Keywords Paper

Reverse Models, neural modeling, Neural Models, generative models

0

0

0

0

6:16

12/07/2020

Predictive Multiplicity in Classification

Charles Marx, Flavio Calmon, Berk Ustun

Keywords Paper

Accountability, Transparency and Interpretability

0

0

0

0

10:19

26/04/2020

Measuring the Reliability of Reinforcement Learning Algorithms

Stephanie C.Y. Chan, Samuel Fishman, Anoop Korattikara and
John Canny, Sergio Guadarrama

Keywords Paper

reinforcement learning, metrics, statistics, reliability

0

0

0

0

5:32

02/02/2021

Appearance-Motion Memory Consistency Network for Video Anomaly Detection

Ruichu Cai, Hao Zhang, Wen Liu and
Shenghua Gao, Zhifeng Hao

Keywords Paper

0

0

0

0

19:08

12/07/2020

From ImageNet to Image Classification: Contextualizing Progress on Benchmarks

Dimitris Tsipras, Shibani Santurkar, Logan Engstrom and
Andrew Ilyas, Aleksander Madry

Keywords Paper

Deep Learning - General

0

0

0

0

14:33

16/11/2020

Unsupervised Quality Estimation for Neural Machine Translation

Marina Fomicheva, Shuo Sun, Lisa Yankovskaya and
Frédéric Blain, Francisco Guzmán, Mark Fishel, Nikolaos Aletras, Vishrav Chaudhary, Lucia Specia

Keywords Paper

machine mt, real-world applications, qe, uncertainty quantification

0

0

1

0

12:19

04/07/2020

Improving Image Captioning Evaluation by Considering Inter References Variance

Yanzhi Yi, Hangyu Deng, Jinglu Hu

Keywords Paper

Image Evaluation, Evaluating captions, system-level tasks, BERTScore

0

0

0

0

11:31

16/11/2020

Dataset Cartography: Mapping and Diagnosing Datasets with Training Dynamics

Swabha Swayamdipta, Roy Schwartz, Nicholas Lourie and
Yizhong Wang, Hannaneh Hajishirzi, Noah A. Smith, Yejin Choi

Keywords Paper

nlp research, out-of-distribution generalization, model optimization, data maps

0

0

0

0

12:03

26/08/2020

Variational Autoencoders for Sparse and Overdispersed Discrete Data

He Zhao, Piyush Rai, Lan Du and
Wray Buntine, Dinh Phung, Mingyuan Zhou

Keywords Paper

0

0

0

0

14:28

18/07/2021

Mandoline: Model Evaluation under Distribution Shift

Mayee Chen, Karan Goel, Nimit Sohoni and
Fait Poms, Kayvon Fatahalian, Christopher Re

Keywords Paper

Algorithms, Others

0

0

0

1

5:49

16/11/2020

An Exploration of Arbitrary-Order Sequence Labeling via Energy-Based Inference Networks

Lifu Tu, Tianyu Liu, Kevin Gimpel

Keywords Paper

natural processing, sequence labeling, semantic labeling, parsing

0

0

0

0

10:07

16/11/2020

Calibrated Language Model Fine-Tuning for In- and Out-of-Distribution Data

Lingkai Kong, Haoming Jiang, Yuchen Zhuang and
Jie Lyu, Tuo Zhao, Chao Zhang

Keywords Paper

augmented training, in-distribution calibration, text classification, expectation error

0

0

0

0

11:47

02/02/2021

Active Bayesian Assessment of Black-Box Classifiers

Disi Ji, Robert L. Logan, Padhraic Smyth, Mark Steyvers

Keywords Paper

0

0

0

0

14:47

05/04/2021

SUOD: Accelerating Large-Scale Unsupervised Heterogeneous Outlier Detection

Yue Zhao, Xiyang Hu, Cheng Cheng and
Cong Wang, Changlin Wan, Wen Wang, Jianing Yang, Haoping Bai, Zheng Li, Cao Xiao, Yunlong Wang, Zhi Qiao, Jimeng Sun, Leman Akoglu

Keywords Paper

Algorithms -> Adversarial Learning, Algorithms -> Image Segmentation; Algorithms -> Semi-Supervised Learning; Applications -> Computer Vision; Applications -> Imag

0

0

0

0

4:53

05/04/2021

SUOD: Accelerating Large-Scale Unsupervised Heterogeneous Outlier Detection

Yue Zhao, Xiyang Hu, Cheng Cheng and
Cong Wang, Changlin Wan, Wen Wang, Jianing Yang, Haoping Bai, Zheng Li, Cao Xiao, Yunlong Wang, Zhi Qiao, Jimeng Sun, Leman Akoglu

Keywords Paper

Algorithms -> Adversarial Learning, Algorithms -> Image Segmentation; Algorithms -> Semi-Supervised Learning; Applications -> Computer Vision; Applications -> Imag

0

0

0

0

18:47

05/01/2021

ChartOCR: Data Extraction From Charts Images via a Deep Hybrid Framework

Junyu Luo, Zekun Li, Jinpeng Wang, Chin-Yew Lin

Keywords Paper

0

0

0

0

4:58

02/02/2021

High Dimensional Level Set Estimation with Bayesian Neural Network

Huong Ha, Sunil Gupta, Santu Rana, Svetha Venkatesh

Keywords Paper

0

0

0

0

19:14

19/08/2021

An Information-Theoretic Approach on Causal Structure Learning for Heterogeneous Data Characteristics of Real-World Scenarios

Johannes Huegle

Keywords Paper

Machine Learning, Learning Graphical Models, Probabilistic Machine Learning, Statistical Methods and Machine Learning, Action, Change and Causality

0

0

0

0

12:06

06/12/2021

On Interaction Between Augmentations and Corruptions in Natural Corruption Robustness

Eric Mintun, Alexander Kirillov, Saining Xie

Keywords Paper

deep learning, robustness, vision

0

0

0

0

12:36

14/06/2020

Learning Saliency Propagation for Semi-Supervised Instance Segmentation

Yanzhao Zhou, Xin Wang, Jianbin Jiao and
Trevor Darrell, Fisher Yu

Keywords Paper

semi-supervised, instance segmentation, saliency, propagation, message passing, multiple instance learning, partial-supervised, generalization

0

0

0

0

1:01

14/06/2020

SAM: The Sensitivity of Attribution Methods to Hyperparameters

Naman Bansal, Chirag Agarwal, Anh Nguyen

Keywords Paper

xai, explainable, attribution, sensitivity, robustness, explanation, hyperparameters

0

0

0

0

8:50

02/02/2021

Learning Prediction Intervals for Model Performance

Benjamin Elder, Matthew Arnold, Anupama Murthi, Jiří Navrátil

Keywords Paper

0

0

0

0

20:12

25/07/2020

How to measure the reproducibility of system-oriented IR experiments

Timo Breuer, Nicola Ferro, Norbert Fuhr and
Maria Maistro, Tetsuya Sakai, Philipp Schaer, Ian Soboroff

Keywords Paper

replicability, measure, reproducibility

0

0

0

0

15:24

16/11/2020

A Diagnostic Study of Explainability Techniques for Text Classification

Pepa Atanasova, Jakob Grue Simonsen, Christina Lioma, Isabelle Augenstein

Keywords Paper

downstream tasks, machine learning, explainability techniques, diverse techniques

0

0

0

0

11:24

02/02/2021

Region-aware Global Context Modeling for Automatic Nerve Segmentation from Ultrasound Images

Huisi Wu, Jiasheng Liu, Wei Wang and
Zhenkun Wen, Jing Qin

Keywords Paper

0

0

0

0

15:15

03/05/2021

Trusted Multi-View Classification

Zongbo Han, Changqing Zhang, Huazhu FU, Joey T Zhou

Keywords Paper

Uncertainty Machine Learning, Multi-View Learning, Multi-Modal Learning

0

0

0

0

4:33

03/05/2021

Heteroskedastic and Imbalanced Deep Learning with Adaptive Regularization

Kaidi Cao, Yining Chen, Junwei Lu and
Nikos Arechiga, Adrien Gaidon, Tengyu Ma

Keywords Paper

imbalanced learning, noise robust learning, deep learning

0

0

0

0

5:14

02/02/2021

Semantics Altering Modifications for Evaluating Comprehension in Machine Reading

Viktor Schlegel, Goran Nenadic, Riza Batista-Navarro

Keywords Paper

0

0

0

0

18:42

18/07/2021

Delving into Deep Imbalanced Regression

Yuzhe Yang, Kaiwen Zha, YINGCONG CHEN and
Hao Wang, Dina Katabi

Keywords Paper

Applications

0

0

0

0

16:37

06/12/2021

Stabilizing Deep Q-Learning with ConvNets and Vision Transformers under Data Augmentation

Nicklas Hansen, Hao Su, Xiaolong Wang

Keywords Paper

reinforcement learning and planning, transformers

0

0

0

0

8:43

14/06/2020

Dynamic Refinement Network for Oriented and Densely Packed Object Detection

Xingjia Pan, Yuqiang Ren, Kekai Sheng and
Weiming Dong, Haolei Yuan, Xiaowei Guo, Chongyang Ma, Changsheng Xu

Keywords Paper

object detection, oriented, densely packed, sku110k, feature selection, dynamic, anchor-free

0

0

0

0

5:01

12/07/2020

Automatic Reparameterisation of Probabilistic Programs

Maria Gorinova, Dave Moore, Matthew Hoffman

Keywords Paper

Probabilistic Inference - Models and Probabilistic Programming

0

0

0

0

15:40

06/12/2021

Nested Counterfactual Identification from Arbitrary Surrogate Experiments

Juan Correa, Sanghack Lee, Elias Bareinboim

Keywords Paper

graph learning, causality, fairness

0

0

0

0

13:34

30/11/2020

MLIFeat: Multi-level information fusion based deep local features

Yuyang Zhang Institute of Automation, Chinese Academy of Sciences, University of Chinese Academy of Sciences and
Jinge Wang, Shibiao Xu, Xiao Liu, Xiaopeng Zhang

Keywords Paper

0

0

0

0

5:28

06/12/2020

From Predictions to Decisions: Using Lookahead Regularization

Nir Rosenfeld, Sophie Hilgard, Sai Ravindranath, David Parkes

Keywords Paper

0

0

0

0

3:10

25/07/2020

Neural hierarchical factorization machines for user’s event sequence analysis

Dongbo Xi, Fuzhen Zhuang, Bowen Song and
Yongchun Zhu, Shuai Chen, Dan Hong, Tao Chen, Xi Gu, Qing He

Keywords Paper

sequence representation, event sequence analysis, neural hierarchical factorization machines, event representation

0

0

1

0

7:50

26/04/2020

Input Complexity and Out-of-distribution Detection with Likelihood-based Generative Models

Joan Serrà, David Álvarez, Vicenç Gómez and
Olga Slizovskaia, José F. Núñez, Jordi Luque

Keywords Paper

OOD, generative models, likelihood

0

0

0

0

5:26

14/09/2020

A context-based approach to detect abnormal human behaviors in ambient intelligent systems

Roghayeh Mojarad, Ferhat Attal, Abdelghani Chibani, Yacine Amirat

Keywords Paper

context-aware approach, human behavior analysis, abnormal human behavior detection, answer set programming

0

0

0

0

13:24