Semantic Label Smoothing for Sequence to Sequence Problems

16/11/2020

Semantic Label Smoothing for Sequence to Sequence Problems

Michal Lukasik, Himanshu Jain, Aditya Menon, Seungyeon Kim, Srinadh Bhojanapalli, Felix Yu, Sanjiv Kumar

Keywords: classification, label de-noising, seqseq settings, machine translation

Abstract Paper Similar Papers

Abstract: Label smoothing has been shown to be an effective regularization strategy in classification, that prevents overfitting and helps in label de-noising. However, extending such methods directly to seq2seq settings, such as Machine Translation, is challenging: the large target output space of such problems makes it intractable to apply label smoothing over all possible outputs. Most existing approaches for seq2seq settings either do token level smoothing, or smooth over sequences generated by randomly substituting tokens in the target sequence. Unlike these works, in this paper, we propose a technique that smooths over \textitwell formed relevant sequences that not only have sufficient n-gram overlap with the target sequence, but are also \textitsemantically similar. Our method shows a consistent and significant improvement over the state-of-the-art techniques on different datasets.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at EMNLP 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

16/11/2020

Generationary or “How We Went beyond Word Sense Inventories and Learned to Gloss”

Michele Bevilacqua, Marco Maru, Roberto Navigli

Keywords Paper

generative modeling, definition modeling, discriminative tasks, word disambiguation

0

0

0

0

11:49

30/11/2020

Scale-Aware Polar Representation for Arbitrarily-Shaped Text Detection

Yanguang Bi, Zhiqiang Hu

Keywords Paper

0

0

0

0

9:56

18/07/2021

Order-Agnostic Cross Entropy for Non-Autoregressive Machine Translation

Cunxiao Du, Zhaopeng Tu, Jing Jiang

Keywords Paper

Applications, Natural Language Processing

0

0

0

0

17:21

22/11/2021

From Seq2Seq Recognition to Handwritten Word Embeddings

George Retsinas, Giorgos Sfikas, Christophoros Nikou, Petros Maragos

Keywords Paper

keyword spotting, handwritten text recognition, sequence-to-sequence

0

0

0

0

2:59

19/08/2021

ALaSca: an Automated approach for Large-Scale Lexical Substitution

Caterina Lacerra, Tommaso Pasini, Rocco Tripodi, Roberto Navigli

Keywords Paper

Natural Language Processing, Natural Language Semantics, Resources and Evaluation

0

0

0

0

14:27

07/09/2020

From Quantized DNNs to Quantizable DNNs

Kunyuan Du, Ya Zhang, Haibing Guan

Keywords Paper

Quantized DNNs, Dynamic Bit-width

0

0

0

0

4:05

26/04/2020

A Probabilistic Formulation of Unsupervised Text Style Transfer

Junxian He, Xinyi Wang, Graham Neubig, Taylor Berg-Kirkpatrick

Keywords Paper

unsupervised text style transfer, deep latent sequence model

0

0

0

0

5:02

19/10/2020

Robust normalized squares maximization for unsupervised domain adaptation

Wenju Zhang, Xiang Zhang, Qing Liao and
Wenjing Yang, Long Lan, Zhigang Luo

Keywords Paper

transfer learning, image classification, domain adaptation

0

0

0

0

6:23

05/01/2021

Intra-Class Part Swapping for Fine-Grained Image Classification

Lianbo Zhang, Shaoli Huang, Wei Liu

Keywords Paper

0

0

0

0

4:43

19/08/2021

Improving Text Generation with Dynamic Masking and Recovering

Zhidong Liu, Junhui Li, Muhua Zhu

Keywords Paper

Natural Language Processing, Machine Translation, Natural Language Generation

0

0

0

0

13:44

26/04/2020

Residual Energy-Based Models for Text Generation

Yuntian Deng, Anton Bakhtin, Myle Ott and
Arthur Szlam, Marc'Aurelio Ranzato

Keywords Paper

energy-based models, text generation

0

0

0

0

4:59

12/07/2020

Latent Space Factorisation and Manipulation via Matrix Subspace Projection

Xiao Li, Chenghua Lin, Ruizhe Li and
Chaozheng Wang, Frank Guerin

Keywords Paper

Deep Learning - Generative Models and Autoencoders

0

0

0

0

12:54

16/11/2020

Small but Mighty: New Benchmarks for Split and Rephrase

Li Zhang, Huaiyu Zhu, Siddhartha Brahma, Yunyao Li

Keywords Paper

text task, fine-grained evaluation, automatic process, rule-based model

0

0

0

0

6:58

04/07/2020

Addressing Posterior Collapse with Mutual Information for Improved Variational Neural Machine Translation

Arya D. McCarthy, Xian Li, Jiatao Gu, Ning Dong

Keywords Paper

Variational Translation, posterior collapse, auxiliary task, uncertainty

0

0

0

0

11:00

02/02/2021

IsoBN: Fine-Tuning BERT with Isotropic Batch Normalization

Wenxuan Zhou, Bill Yuchen Lin, Xiang Ren

Keywords Paper

0

0

0

0

16:25

13/04/2021

Improving adversarial robustness via unlabeled out-of-domain data

Zhun Deng, Linjun Zhang, Amirata Ghorbani, James Zou

Keywords Paper

0

0

0

0

3:01

14/06/2020

ScrabbleGAN: Semi-Supervised Varying Length Handwritten Text Generation

Sharon Fogel, Hadar Averbuch-Elor, Sarel Cohen and
Shai Mazor, Roee Litman

Keywords Paper

gan, semi-supervised, domain-adaptation, handwriting, generative, unlabeled, transfer learning, ocr, text, augmentation

0

0

0

0

1:01

16/11/2020

Calibrated Language Model Fine-Tuning for In- and Out-of-Distribution Data

Lingkai Kong, Haoming Jiang, Yuchen Zhuang and
Jie Lyu, Tuo Zhao, Chao Zhang

Keywords Paper

augmented training, in-distribution calibration, text classification, expectation error

0

0

0

0

11:47

26/08/2020

Post-Estimation Smoothing: A Simple Baseline for Learning with Side Information

Esther Rolf, Michael Jordan, Benjamin Recht

Keywords Paper

0

0

0

0

14:27

02/02/2021

Merging Statistical Feature via Adaptive Gate for Improved Text Classification

Xianming Li, Zongxi Li, Haoran Xie, Qing Li

Keywords Paper

0

0

0

0

14:56

20/08/2020

Lower Your Guards: A Compositional Pattern-Match Coverage Checker

Sebastian Graf, Simon Peyton Jones, Ryan Scott

Keywords Paper

guards, Haskell, pattern matching, strictness

0

0

0

0

14:09

04/07/2020

Combining Subword Representations into Word-level Representations in the Transformer Architecture

Noe Casas, Marta R. Costa-jussà, José A. R. Fonollosa

Keywords Paper

Neural Translation, Subword Representations, Word-level Representations, Transformer Architecture

0

0

0

0

9:59

12/07/2020

Educating Text Autoencoders: Latent Representation Guidance via Denoising

Tianxiao Shen, Jonas Mueller, Regina Barzilay, Tommi Jaakkola

Keywords Paper

Deep Learning - Generative Models and Autoencoders

0

0

0

0

17:06

02/02/2021

Generating CCG Categories

Yufang Liu, Tao Ji, Yuanbin Wu, Man Lan

Keywords Paper

0

0

0

0

15:20

07/09/2020

Semi-supervised semantic segmentation needs strong, varied perturbations

Geoffrey French, Samuli Laine, Timo Aila and
Michal Mackiewicz, Graham Finlayson

Keywords Paper

semantic segmentation, semi-supervised, deep learning, consistency regularization, data augmentation, deep learning, cluster assumption

0

0

0

0

8:49

06/12/2021

A Multi-Implicit Neural Representation for Fonts

Pradyumna Reddy, Zhifei Zhang, Matthew Fisher and
Hailin Jin, Zhaowen Wang, Niloy Mitra

Keywords Paper

deep learning, representation learning

0

0

0

0

8:42

04/07/2020

Estimating the influence of auxiliary tasks for multi-task learning of sequence tagging tasks

Fynn Schröder, Chris Biemann

Keywords Paper

multi-task tasks, MTL, TL, MTL setups

0

0

0

0

12:02

02/02/2021

Fake it Till You Make it: Self-Supervised Semantic Shifts for Monolingual Word Embedding Tasks

Maurício Gruppi, Pin-Yu Chen, Sibel Adali

Keywords Paper

0

0

0

0

19:35

04/07/2020

Learning to Faithfully Rationalize by Construction

Sarthak Jain, Sarah Wiegreffe, Yuval Pinter, Byron C. Wallace

Keywords Paper

NLP, neural classification, training, automatic evaluations

0

0

0

0

11:55

08/12/2020

A Semantically Consistent and Syntactically Variational Encoder-Decoder Framework for Paraphrase Generation

Wenqing Chen, Jidong Tian, Liqiang Xiao and
Hao He, Yaohui Jin

Keywords Paper

0

0

0

0

14:50

02/02/2021

Rethinking Boundaries: End-To-End Recognition of Discontinuous Mentions with Pointer Networks

Hao Fei, Donghong Ji, Bobo Li and
Yijiang Liu, Yafeng Ren, Fei Li

Keywords Paper

0

0

0

0

16:51

14/06/2020

Cascade EF-GAN: Progressive Facial Expression Editing With Local Focuses

Rongliang Wu, Gongjie Zhang, Shijian Lu, Tao Chen

Keywords Paper

gan, facial expression editing, image synthesis

0

0

0

0

5:01

04/07/2020

Multimodal Transformer for Multimodal Machine Translation

Shaowei Yao, Xiaojun Wan

Keywords Paper

Multimodal MMT, Multimodal, MMT, representation images

1

0

0

0

5:11

02/02/2021

High Fidelity GAN Inversion via Prior Multi-Subspace Feature Composition

Guanyue Li, Qianfen Jiao, Sheng Qian and
Si Wu, Hau-San Wong

Keywords Paper

0

0

0

0

16:11

16/11/2020

Sparse Text Generation

Pedro Henrique Martins, Zita Marinho, André F. T. Martins

Keywords Paper

story completion, dialogue generation, text generators, language models

0

0

0

0

11:27

16/11/2020

Iterative Refinement in the Continuous Space for Non-Autoregressive Neural Machine Translation

Jason Lee, Raphael Shu, Kyunghyun Cho

Keywords Paper

non-autoregressive translation, translation, machine translation, inference procedure

0

0

0

0

11:44

04/07/2020

Iterative Edit-Based Unsupervised Sentence Simplification

Dhruv Kumar, Lili Mou, Lukasz Golab, Olga Vechtomova

Keywords Paper

Iterative Simplification, unsupervised simplification, iterative approach, word edits

0

0

0

0

12:14

04/07/2020

Learning Source Phrase Representations for Neural Machine Translation

Hongfei Xu, Josef van Genabith, Deyi Xiong and
Qiuhui Liu, Jingyi Zhang

Keywords Paper

Neural Translation, WMT tasks, Learning Representations, Transformer model

0

0

0

0

7:18

18/07/2021

Image-Level or Object-Level? A Tale of Two Resampling Strategies for Long-Tailed Detection

Nadine Chang, Zhiding Yu, Yu-Xiong Wang and
Anima Anandkumar, Sanja Fidler, Jose Alvarez

Keywords Paper

Applications, Computer Vision

0

0

0

0

5:17

07/09/2020

Advancing weakly supervised cross-domain alignment with optimal transport

Siyang Yuan, Ke Bai, Liqun Chen and
Yizhe Zhang, Chenyang Tao, Chunyuan Li, Guoyin Wang, Ricardo Henao, Lawrence Carin Duke

Keywords Paper

Optimal Transport, Cross Domain Alignment

0

0

0

0

10:04