How Accents Confound: Probing for Accent Information in End-to-End Speech Recognition Systems

04/07/2020

How Accents Confound: Probing for Accent Information in End-to-End Speech Recognition Systems

Archiki Prasad, Preethi Jyothi

Keywords: Probing, Accent Information, End-to-End Systems, end-to-end system

Abstract Paper Similar Papers

Abstract: In this work, we present a detailed analysis of how accent information is reflected in the internal representation of speech in an end-to-end automatic speech recognition (ASR) system. We use a state-of-the-art end-to-end ASR system, comprising convolutional and recurrent layers, that is trained on a large amount of US-accented English speech and evaluate the model on speech samples from seven different English accents. We examine the effects of accent on the internal representation using three main probing techniques: a) Gradient-based explanation methods, b) Information-theoretic measures, and c) Outputs of accent and phone classifiers. We find different accents exhibiting similar trends irrespective of the probing technique used. We also find that most accent information is encoded within the first recurrent layer, which is suggestive of how one could adapt such an end-to-end model to learn representations that are invariant to accents.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ACL 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

19/04/2021

WER-BERT: Automatic WER estimation with BERT in a balanced ordinal classification paradigm

Akshay Krishna Sheshadri, Anvesh Rao Vijjini, Sukhdeep Kharbanda

Keywords Paper

0

0

0

0

11:45

04/07/2020

Towards end-2-end learning for predicting behavior codes from spoken utterances in psychotherapy conversations

Karan Singla, Zhuohao Chen, David Atkins, Shrikanth Narayanan

Keywords Paper

predicting codes, Spoken tasks, voice detection, speaker diarization

0

0

0

0

7:16

08/12/2020

A Sentiment-annotated Dataset of English Causal Connectives

Marta Andersson, Murathan Kurfalı, Robert Östling

Keywords Paper

0

0

0

0

15:35

04/07/2020

Investigating the effect of auxiliary objectives for the automated grading of learner English speech transcriptions

Hannah Craighead, Andrew Caines, Paula Buttery, Helen Yannakoudakis

Keywords Paper

automated transcriptions, automatically speech, multi-task learning, inductive transfer

0

0

0

0

11:37

16/11/2020

The role of context in neural pitch accent detection in English

Elizabeth Nielsen, Mark Steedman, Sharon Goldwater

Keywords Paper

pitch detection, cnn-based model, phenomena, contrast

0

0

0

0

6:41

16/11/2020

BLiMP: The Benchmark of Linguistic Minimal Pairs for English

Alex Warstadt, Alicia Parrish, Haokun Liu and
Anhad Monananey, Wei Peng, Sheng-Fu Wang, Samuel Bowman

Keywords Paper

linguistic, blimp, lms, linguist-crafted templates

0

0

0

0

11:42

08/12/2020

SpanAlign: Sentence Alignment Method based on Cross-Language Span Prediction and ILP

Katsuki Chousa, Masaaki Nagata, Masaaki Nishino

Keywords Paper

0

0

0

0

14:39

18/07/2021

UniSpeech: Unified Speech Representation Learning with Labeled and Unlabeled Data

Chengyi Wang, Yu Wu, Yao Qian and
Kenichi Kumatani, Shujie Liu, Furu Wei, Michael Zeng, Xuedong Huang

Keywords Paper

Applications, Speech Recognition

0

0

0

0

5:19

02/02/2021

Listen, Understand and Translate: Triple Supervision Decouples End-to-end Speech-to-text Translation

Qianqian Dong, Rong Ye, Mingxuan Wang and
Hao Zhou, Shuang Xu, Bo Xu, Lei Li

Keywords Paper

0

0

0

0

14:09

01/07/2020

Contextual and Non-Contextual Word Embeddings: an in-depth Linguistic Investigation

Alessio Miaschi, Felice Dell’Orletta

Keywords Paper

0

0

0

0

4:42

26/04/2020

Multilingual Alignment of Contextual Word Representations

Steven Cao, Nikita Kitaev, Dan Klein

Keywords Paper

multilingual, natural language processing, embedding alignment, BERT, word embeddings, transfer

0

0

0

0

4:55

08/12/2020

Text Classification by Contrastive Learning and Cross-lingual Data Augmentation for Alzheimer’s Disease Detection

Zhiqiang Guo, Zhaoci Liu, Zhenhua Ling and
Shijin Wang, Lingjing Jin, Yunxia Li

Keywords Paper

0

0

0

0

13:12

04/07/2020

Cross-Linguistic Syntactic Evaluation of Word Prediction Models

Aaron Mueller, Garrett Nicolai, Panayiota Petrou-Zeniou and
Natalia Talmina, Tal Linzen

Keywords Paper

Cross-Linguistic Syntax, Syntax, Cross-Linguistic Models, neural models

0

0

0

0

10:48

19/04/2021

Segmenting subtitles for correcting ASR segmentation errors

David Wan, Chris Kedzie, Faisal Ladhak and
Elsbeth Turcan, Petra Galuscakova, Elena Zotkina, Zhengping Jiang, Peter Bell, Kathleen McKeown

Keywords Paper

0

0

0

0

11:49

02/02/2021

Interactive Speech and Noise Modeling for Speech Enhancement

Chengyu Zheng, Xiulian Peng, Yuan Zhang and
Sriram Srinivasan, Yan Lu

Keywords Paper

0

0

0

0

14:47

19/04/2021

Deep subjecthood: Higher-order grammatical features in multilingual BERT

Isabel Papadimitriou, Ethan A. Chi, Richard Futrell, Kyle Mahowald

Keywords Paper

0

0

0

0

11:56

08/12/2020

Understanding the effects of word-level linguistic annotations in under-resourced neural machine translation

Víctor M. Sánchez-Cartagena, Juan Antonio Pérez-Ortiz, Felipe Sánchez-Martínez

Keywords Paper

0

0

0

0

14:59

16/11/2020

Compositional and Lexical Semantics in RoBERTa, BERT and DistilBERT: A Case Study on CoQA

Ieva Staliūnaitė, Ignacio Iacobacci

Keywords Paper

nlp tasks, conversational task, semantic labeling, contextualized embeddings

0

0

0

0

11:23

02/11/2020

Ensemble of sequence matching networks for dynamic sound event localization, detection, and tracking

Thi Ngoc Tho Nguyen, Douglas L. Jones, Woon Seng Gan

Keywords Paper

0

0

0

0

11:06

04/07/2020

On the Robustness of Language Encoders against Grammatical Errors

Fan Yin, Quanyu Long, Tao Meng, Kai-Wei Chang

Keywords Paper

downstream applications, linguistic task, Language Encoders, pre-trained encoders

0

0

0

0

11:09

18/07/2021

Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation

Dongchan Min, Dong Bok Lee, Eunho Yang, Sung Ju Hwang

Keywords Paper

Applications, Audio and Speech Processing

0

0

0

0

5:17

04/07/2020

A Multitask Learning Approach for Diacritic Restoration

Sawsan Alqahtani, Ajay Mishra, Mona Diab

Keywords Paper

Diacritic Restoration, computational processing, restoring diacritics, NLP problems

0

0

0

0

14:12

02/02/2021

Revisiting Mahalanobis Distance for Transformer-Based Out-of-Domain Detection

Alexander Podolskiy, Dmitry Lipin, Andrey Bout and
Ekaterina Artemova, Irina Piontkovskaya

Keywords Paper

0

0

0

0

16:08

06/12/2020

Language Through a Prism: A Spectral Approach for Multiscale Language Representations

Alex Tamkin, Dan Jurafsky, Noah Goodman

Keywords Paper

0

0

0

0

3:34

19/04/2021

Language models for lexical inference in context

Martin Schmitt, Hinrich Schütze

Keywords Paper

0

0

0

0

11:39

19/04/2021

Streaming models for joint speech recognition and translation

Orion Weller, Matthias Sperber, Christian Gollan, Joris Kluivers

Keywords Paper

0

0

0

0

5:11

08/12/2020

A Human Evaluation of AMR-to-English Generation Systems

Emma Manning, Shira Wein, Nathan Schneider

Keywords Paper

0

0

0

0

15:12

02/02/2021

Fake it Till You Make it: Self-Supervised Semantic Shifts for Monolingual Word Embedding Tasks

Maurício Gruppi, Pin-Yu Chen, Sibel Adali

Keywords Paper

0

0

0

0

19:35

19/08/2021

Exemplification Modeling: Can You Give Me an Example, Please?

Edoardo Barba, Luigi Procopio, Caterina Lacerra and
Tommaso Pasini, Roberto Navigli

Keywords Paper

Natural Language Processing, Natural Language Semantics, Resources and Evaluation

0

0

0

0

14:47

08/12/2020

DiaSense at SemEval-2020 Task 1: Modeling Sense Change via Pre-trained BERT Embeddings

Christin Beck

Keywords Paper

0

0

0

0

14:08

06/12/2020

Listening to Sounds of Silence for Speech Denoising

Henry Xu, Rundi Wu, Yuko Ishiwaka and
Carl Vondrick, Changxi Zheng

Keywords Paper

0

0

0

0

3:22

01/07/2020

End-to-End Simultaneous Translation System for IWSLT2020 Using Modality Agnostic Meta-Learning

Hou Jeung Han, Mohd Abbas Zaidi, Sathish Reddy Indurthi and
Nikhil Kumar Lakumarapu, Beomseok Lee, Sangha Kim

Keywords Paper

0

0

0

0

7:29

26/04/2020

Learning Hierarchical Discrete Linguistic Units from Visually-Grounded Speech

David Harwath, Wei-Ning Hsu, James Glass

Keywords Paper

visually-grounded speech, self-supervised learning, discrete representation learning, vision and language, vision and speech, hierarchical representation learning

0

0

0

0

13:42

08/12/2020

Multi-task Learning of Spoken Language Understanding by Integrating N-Best Hypotheses with Hierarchical Attention

Mingda Li, Xinyue Liu, Weitong Ruan and
Luca Soldaini, Wael Hamza, Chengwei Su

Keywords Paper

0

0

0

0

14:43

04/07/2020

Self-Attention with Cross-Lingual Position Representation

Liang Ding, Longyue Wang, Dacheng Tao

Keywords Paper

natural tasks, WMT'17 tasks, Cross-Lingual Representation, Position encoding

0

0

0

0

7:46

02/02/2021

Audio-Oriented Multimodal Machine Comprehension via Dynamic Inter- and Intra-modality Attention

Zhiqi Huang, Fenglin Liu, Xian Wu and
Shen Ge, Helin Wang, Wei Fan, Yuexian Zou

Keywords Paper

0

0

0

0

14:47

04/07/2020

Recurrent Neural Network Language Models Always Learn English-Like Relative Clause Attachment

Forrest Davis, Marten van Schijndel

Keywords Paper

production, Recurrent Always, language models, RNN LMs

0

0

0

0

7:48

02/02/2021

On the Importance of Word Order Information in Cross-lingual Sequence Labeling

Zihan Liu, Genta I Winata, Samuel Cahyawijaya and
Andrea Madotto, Zhaojiang Lin, Pascale Fung

Keywords Paper

0

0

0

0

15:22

06/12/2021

Diversity Enhanced Active Learning with Strictly Proper Scoring Rules

Wei Tan, Lan Du, Wray Buntine

Keywords Paper

machine learning, active learning

0

0

0

0

13:21

01/07/2020

Towards Understanding ASR Error Correction for Medical Conversations

Anirudh Mani, Shruti Palaskar, Sandeep Konam

Keywords Paper

0

0

0

0

12:40