Calibrate Before Use: Improving Few-shot Performance of Language Models

18/07/2021

Calibrate Before Use: Improving Few-shot Performance of Language Models

Tony Z. Zhao, Eric Wallace, Shi Feng, Dan Klein, Sameer Singh

Keywords: Applications, Natural Language Processing

Abstract Paper Similar Papers

Abstract: GPT-3 can perform numerous tasks when provided a natural language prompt that contains a few training examples. We show that this type of few-shot learning can be unstable: the choice of prompt format, training examples, and even the order of the examples can cause accuracy to vary from near chance to near state-of-the-art. We demonstrate that this instability arises from the bias of language models towards predicting certain answers, e.g., those that are placed near the end of the prompt or are common in the pre-training data. To mitigate this, we first estimate the model's bias towards each answer by asking for its prediction when given a training prompt and a content-free test input such as "N/A". We then fit calibration parameters that cause the prediction for this input to be uniform across answers. On a diverse set of tasks, this contextual calibration procedure substantially improves GPT-3 and GPT-2's accuracy (up to 30.0% absolute) across different choices of the prompt, while also making learning considerably more stable.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ICML 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

poohlio 2 years ago

Q: how are you sure that N/A is the unbiased example you need to find the balance between labels?

Similar Papers

06/12/2020

On the Value of Out-of-Distribution Testing: An Example of Goodhart's Law

Damien Teney, Ehsan Abbasnejad, Kushal Kafle and
Robik Shrestha, Christopher Kanan, Anton van den Hengel

Keywords Paper

0

0

0

0

3:21

06/12/2021

True Few-Shot Learning with Language Models

Ethan Perez, Douwe Kiela, Kyunghyun Cho

Keywords Paper

language, few shot learning

0

0

0

0

15:04

03/05/2021

Contrastive Learning with Adversarial Perturbations for Conditional Text Generation

Seanie Lee, Dong Bok Lee, Sung Ju Hwang

Keywords Paper

contrastive learning, conditional text generation

0

0

0

0

4:51

06/12/2021

Introspective Distillation for Robust Question Answering

Yulei Niu, Hanwang Zhang

Keywords Paper

0

0

0

0

8:11

16/11/2020

Look at the First Sentence: Position Bias in Question Answering

Miyoung Ko, Jinhyuk Lee, Hyunjae Kim and
Gangwoo Kim, Jaewoo Kang

Keywords Paper

extractive models, qa models, bidaf, de-biasing methods

0

0

0

0

11:01

16/11/2020

The World is Not Binary: Learning to Rank with Grayscale Data for Dialogue Response Selection

Zibo Lin, Deng Cai, Yan Wang and
Xiaojiang Liu, Haitao Zheng, Shuming Shi

Keywords Paper

response selection, retrieval-based systems, learning-to-rank problem, learning-to-rank

0

0

0

0

12:03

02/02/2021

A Case Study of the Shortcut Effects in Visual Commonsense Reasoning

Keren Ye, Adriana Kovashka

Keywords Paper

0

0

0

0

14:26

06/12/2021

Shape your Space: A Gaussian Mixture Regularization Approach to Deterministic Autoencoders

Amrutha Saseendran, Kathrin Skubch, Stefan Falkner, Margret Keuper

Keywords Paper

generative model

0

0

0

0

12:18

08/12/2020

Exploring Question-Specific Rewards for Generating Deep Questions

Yuxi Xie, Liangming Pan, Dongzhe Wang and
Min-Yen Kan, Yansong Feng

Keywords Paper

0

0

0

0

13:08

08/12/2020

Bayesian Methods for Semi-supervised Text Annotation

Kristian Miok, Gregor Pirs, Marko Robnik-Sikonja

Keywords Paper

0

0

0

0

11:18

16/11/2020

Train No Evil: Selective Masking for Task-Guided Pre-Training

Yuxian Gu, Zhengyan Zhang, Xiaozhi Wang and
Zhiyuan Liu, Maosong Sun

Keywords Paper

pre-training stage, fine-tuning stage, general pre-training, sentiment tasks

0

0

0

0

7:02

04/07/2020

How Can We Accelerate Progress Towards Human-like Linguistic Generalization?

Tal Linzen

Keywords Paper

natural understanding, classification task, Pretraining-Agnostic paradigm, pre-training

0

0

0

0

7:04

06/12/2020

Uncertainty-aware Self-training for Few-shot Text Classification

Subhabrata Mukherjee, Ahmed Awadallah

Keywords Paper

0

0

0

0

3:16

04/07/2020

Slot-consistent NLG for Task-oriented Dialogue Systems with Iterative Rectification Network

Yangming Li, Kaisheng Yao, Libo Qin and
Wanxiang Che, Xiaolong Li, Ting Liu

Keywords Paper

Task-oriented Systems, natural generation, natural NLG, NLG

0

0

0

0

10:53

03/05/2021

In-N-Out: Pre-Training and Self-Training using Auxiliary Information for Out-of-Distribution Robustness

Sang Michael Xie, Ananya Kumar, Robbie Jones and
Fereshte Khani, Tengyu Ma, Percy Liang

Keywords Paper

distribution shift, multi-task learning theory, auxiliary information, unlabeled data, out-of-distribution, self-training theory, pre-training, robustness

0

0

0

0

5:14

04/07/2020

Balancing Training for Multilingual Neural Machine Translation

Xinyi Wang, Yulia Tsvetkov, Graham Neubig

Keywords Paper

Multilingual Translation, Balancing Training, multilingual models, heuristic baselines

0

0

0

0

10:22

06/12/2020

ColdGANs: Taming Language GANs with Cautious Sampling Strategies

Thomas Scialom, Paul-Alexis Dray, Sylvain Lamprier and
Benjamin Piwowarski, Jacopo Staiano

Keywords Paper

0

0

0

0

3:19

13/04/2021

Comparing the value of labeled and unlabeled data in method-of-moments latent variable estimation

Mayee Chen, Benjamin Cohen-Wang, Stephen Mussmann and
Frederic Sala, Christopher Re

Keywords Paper

0

0

0

0

3:04

06/12/2021

End-to-End Training of Multi-Document Reader and Retriever for Open-Domain Question Answering

Devendra Singh, Siva Reddy, Will Hamilton and
Chris Dyer, Dani Yogatama

Keywords Paper

0

0

0

0

14:42

04/07/2020

End-to-End Bias Mitigation by Modelling Biases in Corpora

Rabeeh Karimi Mahabadi, Yonatan Belinkov, James Henderson

Keywords Paper

End-to-End Mitigation, real-world scenarios, training, large-scale benchmarks

0

0

0

0

10:57

25/07/2020

Asymmetric tri-training for debiasing missing-not-at-random explicit feedback

Yuta Saito

Keywords Paper

recommender systems, unsupervised domain adaptation, missing-not-at-random, matrix factorization, selection bias, explicit feedback

0

0

0

0

18:03

16/11/2020

Dataset Cartography: Mapping and Diagnosing Datasets with Training Dynamics

Swabha Swayamdipta, Roy Schwartz, Nicholas Lourie and
Yizhong Wang, Hannaneh Hajishirzi, Noah A. Smith, Yejin Choi

Keywords Paper

nlp research, out-of-distribution generalization, model optimization, data maps

0

0

0

0

12:03

01/07/2020

From Machine Reading Comprehension to Dialogue State Tracking: Bridging the Gap

Shuyang Gao, Sanchit Agarwal, Di Jin and
Tagyoung Chung, Dilek Hakkani-Tur

Keywords Paper

0

0

0

0

16:21

30/11/2020

Meta-Learning with Context-Agnostic Initialisations

Toby Perrett, Alessandro Masullo, Tilo Burghardt and
Majid Mirmehdi, Dima Damen

Keywords Paper

0

0

0

0

7:42

06/12/2021

Supervising the Transfer of Reasoning Patterns in VQA

Corentin Kervadec, Christian Wolf, Grigory Antipov and
Moez Baccouche, Madiha Nadri

Keywords Paper

theory, deep learning, vision

0

0

0

0

12:54

19/04/2021

Keep learning: Self-supervised meta-learning for learning from inference

Akhil Kedia, Sai Chetan Chinthakindi

Keywords Paper

0

0

0

0

11:27

06/12/2020

Robust Pre-Training by Adversarial Contrastive Learning

Ziyu Jiang, Tianlong Chen, Ting Chen, Zhangyang Wang

Keywords Paper

0

0

0

0

3:26

02/02/2021

Demodalizing Face Recognition with Synthetic Samples

Zhonghua Zhai, Pengju Yang, Xiaofeng Zhang and
Maji Huang, Haijing Cheng, Xuejun Yan, Chunmao Wang, Shiliang Pu

Keywords Paper

0

0

0

0

16:25

19/10/2020

Deep metric learning based on rank-sensitive optimization of top-k precision

Naoki Muramoto, Hai-Tao Yu

Keywords Paper

top-k precision, deep metric learning, rank-sensitive

0

0

0

0

6:51

04/07/2020

Low-Resource Generation of Multi-hop Reasoning Questions

Jianxing Yu, Wei Liu, Shuang Qiu and
Qinliang Su, Kai Wang, Xiaojun Quan, Jian Yin

Keywords Paper

Low-Resource Questions, generating questions, machine comprehension, multi-hop model

0

0

0

0

11:54

19/08/2021

Time-Series Representation Learning via Temporal and Contextual Contrasting

Emadeldeen Eldele, Mohamed Ragab, Zhenghua Chen and
Min Wu, Chee Keong Kwoh, Xiaoli Li, Cuntai Guan

Keywords Paper

Machine Learning, Deep Learning, Semi-Supervised Learning, Time-series; Data Streams

0

0

0

0

12:35

06/12/2021

Stylized Dialogue Generation with Multi-Pass Dual Learning

Jinpeng Li, Yingce Xia, Rui Yan and
Hongda Sun, Dongyan Zhao, Tie-Yan Liu

Keywords Paper

0

0

0

0

3:16

03/05/2021

The role of Disentanglement in Generalisation

Milton Montero, Casimir JH Ludwig, Rui Ponte Costa and
Gaurav Malhotra, Jeffrey Bowers

Keywords Paper

generalisation, compositional generalization, generative models, compositionality, variational autoencoders, disentanglement

0

0

0

0

4:16

06/12/2020

Model-based Adversarial Meta-Reinforcement Learning

Zichuan Lin, Garrett Thomas, Guangwen Yang, Tengyu Ma

Keywords Paper

0

0

0

0

3:31

18/07/2021

Composed Fine-Tuning: Freezing Pre-Trained Denoising Autoencoders for Improved Generalization

Sang Michael Xie, Tengyu Ma, Percy Liang

Keywords Paper

Algorithms, Multitask, Transfer, and Meta Learning

0

0

0

0

22:15

16/11/2020

Improving AMR Parsing with Sequence-to-Sequence Pre-training

Dongqin Xu, Junhui Li, Muhua Zhu and
Min Zhang, Guodong Zhou

Keywords Paper

abstract parsing, amr parsing, sequence-to-sequence parsing, machine translation

0

0

0

0

11:42

16/11/2020

A Probabilistic End-To-End Task-Oriented Dialog Model with Latent Belief States towards Semi-Supervised Learning

Yichi Zhang, Zhijian Ou, Min Hu, Junlan Feng

Keywords Paper

user tracking, database query, task-oriented systems, end-to-end systems

0

0

0

0

11:15

18/11/2020

A one-step approach to covariate shift adaptation

Tianyi Zhang, Ikko Yamane, Nan Lu, Masashi Sugiyama

Keywords Paper

0

0

0

0

12:27

16/11/2020

Efficient Meta Lifelong-Learning with Limited Memory

Zirui Wang, Sanket Vaibhav Mehta, Barnabas Poczos, Jaime Carbonell

Keywords Paper

lifelong learning, local adaptation, text benchmarks, multi-task learning

0

0

0

0

12:03

01/07/2020

Learning to Classify Intents and Slot Labels Given a Handful of Examples

Jason Krone, Yi Zhang, Mona Diab

Keywords Paper

0

0

0

0

11:52