Dynamic Sentence Boundary Detection for Simultaneous Translation

01/07/2020

Dynamic Sentence Boundary Detection for Simultaneous Translation

Ruiqing Zhang, Chuanqiang Zhang

Keywords:

Abstract Paper Similar Papers

Abstract: Simultaneous Translation is a great challenge in which translation starts before the source sentence finished. Most studies take transcription as input and focus on balancing translation quality and latency for each sentence. However, most ASR systems can not provide accurate sentence boundaries in realtime. Thus it is a key problem to segment sentences for the word streaming before translation. In this paper, we propose a novel method for sentence boundary detection that takes it as a multi-class classification task under the end-to-end pre-training framework. Experiments show significant improvements both in terms of translation quality and latency.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ACL Workshops virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

04/07/2020

Curriculum Pre-training for End-to-End Speech Translation

Chengyi Wang, Yu Wu, Shujie Liu and
Ming Zhou, Zhenglu Yang

Keywords Paper

Curriculum Pre-training, End-to-End Translation, speech recognition, transcription learning

0

0

0

0

11:10

16/11/2020

Multi-Stage Pre-training for Low-Resource Domain Adaptation

Rong Zhang, Revanth Gangi Reddy, Md Arafat Sultan and
Vittorio Castelli, Anthony Ferritto, Radu Florian, Efsun Sarioglu Kayi, Salim Roukos, Avi Sil, Todd Ward

Keywords Paper

nlp tasks, fine-tuning, auxiliary tasks, lm transfer

0

0

0

0

6:56

04/07/2020

A Simple and Effective Unified Encoder for Document-Level Machine Translation

Shuming Ma, Dongdong Zhang, Ming Zhou

Keywords Paper

Document-Level Translation, Unified Encoder, encoders, pre-training models

0

0

0

0

7:04

04/07/2020

Meta-Transfer Learning for Code-Switched Speech Recognition

Genta Indra Winata, Samuel Cahyawijaya, Zhaojiang Lin and
Zihan Liu, Peng Xu, Pascale Fung

Keywords Paper

Code-Switched Recognition, speech recognition, speech tasks, language tasks

0

0

0

0

6:07

19/08/2021

Automatically Paraphrasing via Sentence Reconstruction and Round-trip Translation

Zilu Guo, Zhongqiang Huang, Kenny Q. Zhu and
Guandan Chen, Kaibo Zhang, Boxing Chen, Fei Huang

Keywords Paper

Natural Language Processing, Machine Translation, Natural Language Generation, NLP Applications and Tools

0

0

0

0

13:53

05/12/2020

A general framework for adaptation of neural machine translation to simultaneous translation

Yun Chen, Liangyou Li, Xin Jiang and
Xiao Chen, Qun Liu

Keywords Paper

0

0

0

0

14:22

04/07/2020

SimulSpeech: End-to-End Simultaneous Speech to Text Translation

Yi Ren, Jinglin Liu, Xu Tan and
Chen Zhang, Tao Qin, Zhou Zhao, Tie-Yan Liu

Keywords Paper

simultaneous translation, simultaneous recognition, ASR, NMT

0

0

0

0

5:51

02/02/2021

Synchronous Interactive Decoding for Multilingual Neural Machine Translation

Hao He, Qian Wang, Zhipeng Yu and
Yang Zhao, Jiajun Zhang, Chengqing Zong

Keywords Paper

0

0

0

0

14:32

16/11/2020

Learning Adaptive Segmentation Policy for Simultaneous Translation

Ruiqing Zhang, Chuanqiang Zhang, Zhongjun He and
Hua Wu, Haifeng Wang

Keywords Paper

simultaneous translation, translation, segmentation, chinese-english translation

0

0

0

0

11:43

04/07/2020

Jointly Learning to Align and Summarize for Neural Cross-Lingual Summarization

Yue Cao, Hui Liu, Xiaojun Wan

Keywords Paper

Neural Summarization, Cross-lingual summarization, cross-lingual training, pipeline methods

0

0

0

0

9:30

08/12/2020

Automatic Interlinear Glossing for Under-Resourced Languages Leveraging Translations

Xingyuan Zhao, Satoru Ozaki, Antonios Anastasopoulos and
Graham Neubig, Lori Levin

Keywords Paper

0

0

0

0

13:52

02/02/2021

Consecutive Decoding for Speech-to-text Translation

Qianqian Dong, Mingxuan Wang, Hao Zhou and
Shuang Xu, Bo Xu, Lei Li

Keywords Paper

0

0

0

0

14:20

19/08/2021

Cross-Domain Slot Filling as Machine Reading Comprehension

Mengshi Yu, Jian Liu, Yufeng Chen and
Jinan Xu, Yujie Zhang

Keywords Paper

Natural Language Processing, Dialogue, Information Extraction

0

0

0

0

11:09

04/07/2020

Unsupervised Word Translation with Adversarial Autoencoder

Tasnim Mohiuddin, Shafiq Joty

Keywords Paper

Unsupervised Translation, machine translation, transfer learning, word task

0

0

0

0

14:56

16/11/2020

Structured Pruning of Large Language Models

Ziheng Wang, Jeremy Wohlwend, Tao Lei

Keywords Paper

natural tasks, model compression, language tasks, pruning embeddings

0

0

0

0

11:04

16/11/2020

Small but Mighty: New Benchmarks for Split and Rephrase

Li Zhang, Huaiyu Zhu, Siddhartha Brahma, Yunyao Li

Keywords Paper

text task, fine-grained evaluation, automatic process, rule-based model

0

0

0

0

6:58

16/11/2020

Partially-Aligned Data-to-Text Generation with Distant Supervision

Zihao Fu, Bei Shi, Wai Lam and
Lidong Bing, Zhiyuan Liu

Keywords Paper

data-to-text task, generation task, dataset problem, over-generation problem

0

0

0

0

11:58

04/07/2020

Towards Faithful Neural Table-to-Text Generation with Content-Matching Constraints

Zhenyi Wang, Xiaoyang Wang, Bang An and
Dong Yu, Changyou Chen

Keywords Paper

Faithful Generation, Text generation, table-to-text problem, Transformer-based framework

0

0

0

1

10:15

19/04/2021

Understanding pre-editing for black-box neural machine translation

Rei Miyata, Atsushi Fujita

Keywords Paper

0

0

0

0

11:44

16/11/2020

Dynamic Context Selection for Document-level Neural Machine Translation via Reinforcement Learning

Xiaomian Kang, Yang Zhao, Jiajun Zhang, Chengqing Zong

Keywords Paper

document-level translation, translations, document-level model, selection module

0

0

0

0

11:36

04/07/2020

Using Context in Neural Machine Translation Training Objectives

Danielle Saunders, Felix Stahlberg, Bill Byrne

Keywords Paper

Neural training, NMT training, document-level training, NMT objective

0

0

0

0

6:48

04/07/2020

A Systematic Study of Inner-Attention-Based Sentence Representations in Multilingual Neural Machine Translation

Raúl Vázquez, Alessandro Raganato, Mathias Creutz, Jörg Tiedemann

Keywords Paper

Multilingual Translation, Neural translation, transfer learning, translation

0

0

0

0

14:05

26/04/2020

Neural Machine Translation with Universal Visual Representation

Zhuosheng Zhang, Kehai Chen, Rui Wang and
Masao Utiyama, Eiichiro Sumita, Zuchao Li, Hai Zhao

Keywords Paper

Neural Machine Translation, Visual Representation, Multimodal Machine Translation, Language Representation

0

0

0

0

4:50

02/02/2021

Fake it Till You Make it: Self-Supervised Semantic Shifts for Monolingual Word Embedding Tasks

Maurício Gruppi, Pin-Yu Chen, Sibel Adali

Keywords Paper

0

0

0

0

19:35

01/07/2020

Expand and Filter: CUNI and LMU Systems for the WNGT 2020 Duolingo Shared Task

Jindřich Libovický, Zdeněk Kasner, Jindřich Helcl, Ondřej Dušek

Keywords Paper

0

0

0

0

4:59

19/04/2021

Active learning for sequence tagging with deep pre-trained models and Bayesian uncertainty estimates

Artem Shelmanov, Dmitri Puzyrev, Lyubov Kupriyanova and
Denis Belyakov, Daniil Larionov, Nikita Khromov, Olga Kozlova, Ekaterina Artemova, Dmitry V. Dylov, Alexander Panchenko

Keywords Paper

0

0

0

0

11:47

16/11/2020

Self-Supervised Meta-Learning for Few-Shot Natural Language Classification Tasks

Trapit Bansal, Rishikesh Jha, Tsendsuren Munkhdalai, Andrew McCallum

Keywords Paper

nlp applications, fine-tuning, meta-learning problem, supervised tasks

0

0

0

0

11:49

05/12/2020

MaP: A matrix-based prediction approach to improve span extraction in machine reading comprehension

Huaishao Luo, Yu Shi, Ming Gong and
Linjun Shou, Tianrui Li

Keywords Paper

0

0

0

0

7:16

01/07/2020

Re-translation versus Streaming for Simultaneous Translation

Naveen Arivazhagan, Colin Cherry, Wolfgang Macherey, George Foster

Keywords Paper

0

0

0

0

23:21

04/07/2020

Emerging Cross-lingual Structure in Pretrained Language Models

Alexis Conneau, Shijie Wu, Haoran Li and
Luke Zettlemoyer, Veselin Stoyanov

Keywords Paper

multilingual modeling, cross-lingual transfer, transfer, Cross-lingual Models

0

0

0

0

11:49

02/02/2021

Empirical Regularization for Synthetic Sentence Pairs in Unsupervised Neural Machine Translation

Xi Ai, Bin Fang

Keywords Paper

0

0

0

0

15:07

16/11/2020

Pre-training for Abstractive Document Summarization by Reinstating Source Text

Yanyan Zou, Xingxing Zhang, Wei Lu and
Furu Wei, Ming Zhou

Keywords Paper

abstractive summarization, sequence-to-sequence problem, sentence reordering, next generation

0

0

0

0

10:25

04/07/2020

Speech Translation and the End-to-End Promise: Taking Stock of Where We Are

Matthias Sperber, Matthias Paulik

Keywords Paper

Speech Translation, speech recognition, machine translation, data scarcity

0

0

0

0

11:01

02/02/2021

Style-transfer and Paraphrase: Looking for a Sensible Semantic Similarity Metric

Ivan P. Yamshchikov, Viacheslav Shibaev, Nikolay Khlebnikov, Alexey Tikhonov

Keywords Paper

0

0

0

0

19:29

16/11/2020

Translation Artifacts in Cross-lingual Transfer Learning

Mikel Artetxe, Gorka Labaka, Eneko Agirre

Keywords Paper

human translation, cross-lingual learning, natural inference, machine translation

0

0

0

0

11:30

04/07/2020

Opportunistic Decoding with Timely Correction for Simultaneous Translation

Renjie Zheng, Mingbo Ma, Baigong Zheng and
Kaibo Liu, Liang Huang

Keywords Paper

Simultaneous Translation, Chinese-to-English translation, Opportunistic Decoding, Timely Correction

0

0

0

0

6:43

16/11/2020

Cross-Thought for Sentence Encoder Pre-training

Shuohang Wang, Yuwei Fang, Siqi Sun and
Zhe Gan, Yu Cheng, Jingjing Liu, Jing Jiang

Keywords Paper

pre-training encoder, large-scale tasks, question answering, predicting words

0

0

0

0

12:06

16/11/2020

Simulated multiple reference training improves low-resource machine translation

Huda Khayrallah, Brian Thompson, Matt Post, Philipp Koehn

Keywords Paper

machine mt, mt, simulated training, simulated

0

0

0

0

6:56

02/02/2021

A Theoretical Analysis of the Repetition Problem in Text Generation

Zihao Fu, Wai Lam, Anthony Man-Cho So, Bei Shi

Keywords Paper

0

0

0

0

18:44

06/12/2021

NxMTransformer: Semi-Structured Sparsification for Natural Language Understanding via ADMM

Connor Holmes, Minjia Zhang, Yuxiong He, Bo Wu

Keywords Paper

optimization, transformers, language

0

0

0

0

10:53