Effectively pretraining a speech translation decoder with Machine Translation data

16/11/2020

Effectively pretraining a speech translation decoder with Machine Translation data

Ashkan Alinejad, Anoop Sarkar

Keywords: automatic task, neural task, speech translation, end-to-end approach

Abstract Paper Similar Papers

Abstract: Directly translating from speech to text using an end-to-end approach is still challenging for many language pairs due to insufficient data. Although pretraining the encoder parameters using the Automatic Speech Recognition (ASR) task improves the results in low resource settings, attempting to use pretrained parameters from the Neural Machine Translation (NMT) task has been largely unsuccessful in previous works. In this paper, we will show that by using an adversarial regularizer, we can bring the encoder representations of the ASR and NMT tasks closer even though they are in different modalities, and how this helps us effectively use a pretrained NMT decoder for speech translation.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at EMNLP 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

18/07/2021

Fused Acoustic and Text Encoding for Multimodal Bilingual Pretraining and Speech Translation

Renjie Zheng, Junkun Chen, Mingbo Ma, Liang Huang

Keywords Paper

Applications, Natural Language Processing

0

0

0

0

5:19

16/11/2020

Consistent Transcription and Translation of Speech

Matthias Sperber, Hendra Setiawan, Christian Gollan and
Udhay Nallasamy, Matthias Paulik

Keywords Paper

speech translation, jointly speech, joint task, speech step

0

0

0

0

11:52

04/07/2020

Meta-Transfer Learning for Code-Switched Speech Recognition

Genta Indra Winata, Samuel Cahyawijaya, Zhaojiang Lin and
Zihan Liu, Peng Xu, Pascale Fung

Keywords Paper

Code-Switched Recognition, speech recognition, speech tasks, language tasks

0

0

0

0

6:07

03/05/2021

Bidirectional Variational Inference for Non-Autoregressive Text-to-Speech

Yoonhyung Lee, Joongbo Shin, Kyomin Jung

Keywords Paper

VAE, non-autoregressive, speech synthesis, text-to-speech

0

0

0

0

5:40

19/04/2021

Continuous learning in neural machine translation using bilingual dictionaries

Jan Niehues

Keywords Paper

0

0

0

0

11:48

19/04/2021

A phonetic model of non-native spoken word processing

Yevgen Matusevych, Herman Kamper, Thomas Schatz and
Naomi Feldman, Sharon Goldwater

Keywords Paper

0

0

0

0

11:58

04/07/2020

Unsupervised Paraphasia Classification in Aphasic Speech

Sharan Pai, Nikhil Sachdeva, Prince Sachdeva, Rajiv Ratn Shah

Keywords Paper

Unsupervised Classification, speech disorder, naming detection, treatment

0

0

0

0

10:02

01/07/2020

End-to-End Speech Translation with Adversarial Training

Xuancai Li, Chen Kehai, Tiejun Zhao, Muyun Yang

Keywords Paper

0

0

0

0

8:53

05/12/2020

An exploratory study on multilingual quality estimation

Shuo Sun, Marina Fomicheva, Frédéric Blain and
Vishrav Chaudhary, Ahmed El-Kishky, Adithya Renduchintala, Francisco Guzmán, Lucia Specia

Keywords Paper

0

0

0

0

14:31

19/04/2021

Semantic parsing of disfluent speech

Priyanka Sen, Isabel Groves

Keywords Paper

0

0

0

0

7:14

05/12/2020

A general framework for adaptation of neural machine translation to simultaneous translation

Yun Chen, Liangyou Li, Xin Jiang and
Xiao Chen, Qun Liu

Keywords Paper

0

0

0

0

14:22

04/07/2020

MMPE: A Multi-Modal Interface for Post-Editing Machine Translation

Nico Herbig, Tim Düwel, Santanu Pal and
Kalliopi Meladaki, Mahsa Monshizadeh, Antonio Krüger, Josef van Genabith

Keywords Paper

Post-Editing Translation, machine translation, MT, translators

0

0

0

0

11:41

06/12/2021

Speech-T: Transducer for Text to Speech and Beyond

Jiawei Chen, Xu Tan, Yichong Leng and
Jin Xu, Guihua Wen, Tao Qin, Tie-Yan Liu

Keywords Paper

transformers

0

0

0

0

8:38

04/07/2020

Gender in Danger? Evaluating Speech Translation Technology on the MuST-SHE Corpus

Luisa Bentivogli, Beatrice Savoldi, Matteo Negri and
Mattia A. Di Gangi, Roldano Cattoni, Marco Turchi

Keywords Paper

Speech Technology, Translating, machines, machine translation

0

0

0

0

11:02

04/07/2020

Learning Spoken Language Representations with Neural Lattice Language Modeling

Chao-Wei Huang, Yun-Nung Chen

Keywords Paper

NLP tasks, spoken tasks, intent detection, Spoken Representations

0

0

0

0

6:39

06/12/2021

Unsupervised Speech Recognition

Alexei Baevski, Wei-Ning Hsu, Alexis CONNEAU, Michael Auli

Keywords Paper

deep learning, adversarial robustness and security, self-supervised learning, generative model

0

0

0

0

19:16

04/07/2020

LINSPECTOR: Multilingual Probing Tasks for Word Representations

Gözde Gül Sahin, Clara Vania, Ilia Kuznetsov, Iryna Gurevych

Keywords Paper

Word Representations, NLP, classification tasks, probing tasks

0

0

0

0

11:51

01/07/2020

Is 42 the Answer to Everything in Subtitling-oriented Speech Translation?

Alina Karakanta, Matteo Negri, Marco Turchi

Keywords Paper

0

0

0

0

18:10

08/12/2020

Emergent Communication Pretraining for Few-Shot Machine Translation

Yaoyiran Li, Edoardo Maria Ponti, Ivan Vulić, Anna Korhonen

Keywords Paper

0

0

0

0

14:42

08/12/2020

Evaluating Cross-Lingual Transfer Learning Approaches in Multilingual Conversational Agent Models

Lizhen Tan, Olga Golovneva

Keywords Paper

0

0

0

0

9:23

19/04/2021

Better neural machine translation by extracting linguistic information from BERT

Hassan S. Shavarani, Anoop Sarkar

Keywords Paper

0

0

0

0

12:15

06/12/2021

NORESQA: A Framework for Speech Quality Assessment using Non-Matching References

Pranay Manocha, Buye Xu, Anurag Kumar

Keywords Paper

deep learning, robustness, self-supervised learning

0

0

0

0

14:30

04/07/2020

Tagged Back-translation Revisited: Why Does It Really Work?

Benjamin Marie, Raphael Rubino, Atsushi Fujita

Keywords Paper

Tagged Revisited, neural systems, NMT systems, back-translations

0

0

0

0

6:54

04/07/2020

Curriculum Pre-training for End-to-End Speech Translation

Chengyi Wang, Yu Wu, Shujie Liu and
Ming Zhou, Zhenglu Yang

Keywords Paper

Curriculum Pre-training, End-to-End Translation, speech recognition, transcription learning

0

0

0

0

11:10

04/07/2020

Unsupervised Multimodal Neural Machine Translation with Pseudo Visual Pivoting

Po-Yao Huang, Junjie Hu, Xiaojun Chang, Alexander Hauptmann

Keywords Paper

Unsupervised Translation, Unsupervised MT, MT, alignment

0

0

0

0

12:17

12/07/2020

XTREME: A Massively Multilingual Multi-task Benchmark for Evaluating Cross-lingual Generalisation

Junjie Hu, Sebastian Ruder, Aditya Siddhant and
Graham Neubig, Orhan Firat, Melvin Johnson

Keywords Paper

Applications - Language, Speech and Dialog

0

0

0

0

15:36

16/11/2020

Language Model Prior for Low-Resource Neural Machine Translation

Christos Baziotis, Barry Haddow, Alexandra Birch

Keywords Paper

neural translation, neural tm, knowledge distillation, training time

0

0

0

0

11:16

04/07/2020

Learning an Unreferenced Metric for Online Dialogue Evaluation

Koustuv Sinha, Prasanna Parthasarathi, Jasmine Wang and
Ryan Lowe, William L. Hamilton, Joelle Pineau

Keywords Paper

Online Evaluation, inference, online setting, Unreferenced Metric

0

0

0

0

6:58

04/07/2020

Language-aware Interlingua for Multilingual Neural Machine Translation

Changfeng Zhu, Heng Yu, Shanbo Cheng, Weihua Luo

Keywords Paper

Multilingual Translation, low-resource scenarios, Language-aware Interlingua, NMT

0

0

0

0

6:09

08/12/2020

ContraCAT: Contrastive Coreference Analytical Templates for Machine Translation

Dario Stojanovski, Benno Krojer, Denis Peskov, Alexander Fraser

Keywords Paper

0

0

0

0

14:09

19/04/2021

WER-BERT: Automatic WER estimation with BERT in a balanced ordinal classification paradigm

Akshay Krishna Sheshadri, Anvesh Rao Vijjini, Sukhdeep Kharbanda

Keywords Paper

0

0

0

0

11:45

01/07/2020

Robust Neural Machine Translation with ASR Errors

Haiyang Xue, Yang Feng, Shuhao Gu, Wei Chen

Keywords Paper

0

0

0

0

8:15

01/07/2020

Encodings of Source Syntax: Similarities in NMT Representations Across Target Languages

Tyler A. Chang, Anna Rafferty

Keywords Paper

0

0

0

0

4:00

08/12/2020

Text Classification by Contrastive Learning and Cross-lingual Data Augmentation for Alzheimer’s Disease Detection

Zhiqiang Guo, Zhaoci Liu, Zhenhua Ling and
Shijin Wang, Lingjing Jin, Yunxia Li

Keywords Paper

0

0

0

0

13:12

01/07/2020

Adapting End-to-End Speech Recognition for Readable Subtitles

Danni Liu, Jan Niehues, Gerasimos Spanakis

Keywords Paper

0

0

0

0

22:16

18/07/2021

Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation

Dongchan Min, Dong Bok Lee, Eunho Yang, Sung Ju Hwang

Keywords Paper

Applications, Audio and Speech Processing

0

0

0

0

5:17

16/11/2020

Towards Enhancing Faithfulness for Neural Machine Translation

Rongxiang Weng, Heng Yu, Xiangpeng Wei, Weihua Luo

Keywords Paper

neural nmt, neural, nmt, training strategy

0

0

0

1

11:32

25/04/2020

WithYou: Automated Adaptive Speech Tutoring With Context-Dependent Speech Recognition

Xinlei Zhang, Takashi Miyaki, Jun Rekimoto

Keywords Paper

computer assisted language learning (call), speaking, shadowing, speech recognition, intelligent tutoring system, language learning

0

0

0

0

14:41

19/04/2021

Enriching non-autoregressive transformer with syntactic and semantic structures for neural machine translation

Ye Liu, Yao Wan, Jianguo Zhang and
Wenting Zhao, Philip Yu

Keywords Paper

0

0

0

0

10:18

08/12/2020

TableGPT: Few-shot Table-to-Text Generation with Table Structure Reconstruction and Content Matching

Heng Gong, Yawei Sun, Xiaocheng Feng and
Bing Qin, Wei Bi, Xiaojiang Liu, Ting Liu

Keywords Paper

0

0

0

0

8:45