A Call for More Rigor in Unsupervised Cross-lingual Learning

Abstract: We review motivations, definition, approaches, and methodology for unsupervised cross-lingual learning and call for a more rigorous position in each of them. An existing rationale for such research is based on the lack of parallel data for many of the world's languages. However, we argue that a scenario without any parallel data and abundant monolingual data is unrealistic in practice. We also discuss different training signals that have been used in previous work, which depart from the pure unsupervised setting. We then describe common methodological issues in tuning and evaluation of unsupervised cross-lingual models and present best practices. Finally, we provide a unified outlook for different types of research in this area (i.e., cross-lingual word embeddings, deep multilingual pretraining, and unsupervised machine translation) and argue for comparable evaluation of these models.

04/07/2020

face presentation attack detection, face anti-spoofing, cross-domain, disentangled representation learning, multi-domain learning.

1:01

23/08/2020

A Call for More Rigor in Unsupervised Cross-lingual Learning

Mikel Artetxe, Sebastian Ruder, Dani Yogatama, Gorka Labaka, Eneko Agirre

Comments

Similar Papers

Unsupervised Word Translation with Adversarial Autoencoder

Tasnim Mohiuddin, Shafiq Joty

Keywords Abstract Paper

Unsupervised Translation, machine translation, transfer learning, word task

Iterative Domain-Repaired Back-Translation

Hao-Ran Wei, Zhirui Zhang, Boxing Chen, Weihua Luo

Keywords Abstract Paper

domain-specific translation, domain adaptation, back-translation method, out-of-domain systems

Cross-Domain Face Presentation Attack Detection via Multi-Domain Disentangled Representation Learning

Guoqing Wang, Hu Han, Shiguang Shan, Xilin Chen

Keywords Abstract Paper

face presentation attack detection, face anti-spoofing, cross-domain, disentangled representation learning, multi-domain learning.

Spectrum-guided adversarial disparity learning

Zhe Liu, Lina Yao, Lei Bai and Xianzhi Wang, Can Wang

Keywords Abstract Paper

adversarial autoencoder, generative models, intraclass variability, activity recognition

Parallel Data Augmentation for Formality Style Transfer

Yi Zhang, Tao Ge, Xu SUN

Keywords Abstract Paper

Parallel Augmentation, Formality Transfer, data methods, parallel data

LAReQA: Language-Agnostic Answer Retrieval from a Multilingual Pool

Uma Roy, Noah Constant, Rami Al-Rfou and Aditya Barua, Aaron Phillips, Yinfei Yang

Keywords Abstract Paper

language-agnostic retrieval, cross-lingual tasks, cross-lingual retrieval, alignment

Domain-adaptive neural automated essay scoring

Yue Cao, Hanqi Jin, Xiaojun Wan, Zhiwei Yu

Keywords Abstract Paper

domain adaptation, natural language processing, automated essay scoring, self-supervised learning

It's Easier to Translate out of English than into it: Measuring Neural Translation Difficulty by Cross-Mutual Information

Emanuele Bugliarello, Sabrina J. Mielke, Antonios Anastasopoulos and Ryan Cotterell, Naoaki Okazaki

Keywords Abstract Paper

Measuring Difficulty, generation, asymmetric difficulty, machine difficulty

CrossNER: Evaluating Cross-Domain Named Entity Recognition

Zihan Liu, Yan Xu, Tiezheng Yu and Wenliang Dai, Ziwei Ji, Samuel Cahyawijaya, Andrea Madotto, Pascale Fung

Keywords Abstract Paper

Emerging Cross-lingual Structure in Pretrained Language Models

Alexis Conneau, Shijie Wu, Haoran Li and Luke Zettlemoyer, Veselin Stoyanov

Keywords Abstract Paper

multilingual modeling, cross-lingual transfer, transfer, Cross-lingual Models

Towards Universal Representation Learning for Deep Face Recognition

Yichun Shi, Xiang Yu, Kihyuk Sohn and Manmohan Chandraker, Anil K. Jain

Keywords Abstract Paper

face recognition, universal representation, data augmentation

Learning with Noisy Correspondence for Cross-modal Matching

Zhenyu Huang, Guocheng Niu, Xiao Liu and Wenbiao Ding, Xinyan Xiao, Hua Wu, Xi Peng

Keywords Abstract Paper

deep learning, language

Diversity-Based Generalization for Unsupervised Text Classification under Domain Shift

Jitin Krishnan, Hemant Purohit, Huzefa Rangwala

Keywords Abstract Paper

text classification, unsupervised domain adaptation, natural language processing, neural networks

Revisiting Mahalanobis Distance for Transformer-Based Out-of-Domain Detection

Alexander Podolskiy, Dmitry Lipin, Andrey Bout and Ekaterina Artemova, Irina Piontkovskaya

Keywords Abstract Paper

Targeted data-driven regularization for out-of-distribution generalization

Mohammad Mahdi Kamani, Sadegh Farhang, Mehrdad Mahdavi, James Z. Wang

Keywords Abstract Paper

data-driven regularization, out-of-distribution generalization, bilevel programming

Single-Side Domain Generalization for Face Anti-Spoofing

Yunpei Jia, Jie Zhang, Shiguang Shan, Xilin Chen

Keywords Abstract Paper

face anti-spoofing, face presentation attack detection, domain generalization

Interpretable Multi-dataset Evaluation for Named Entity Recognition

Jinlan Fu, Pengfei Liu, Graham Neubig

Keywords Abstract Paper

natural tasks, interpretable evaluation, named task, analysis tool

A Probabilistic Formulation of Unsupervised Text Style Transfer

Junxian He, Xinyi Wang, Graham Neubig, Taylor Berg-Kirkpatrick

Keywords Abstract Paper

unsupervised text style transfer, deep latent sequence model

Transformer Based Multi-Source Domain Adaptation

Dustin Wright, Isabelle Augenstein

Keywords Abstract Paper

unsupervised adaptation, cnns, rnns, domain classifiers

Translation Artifacts in Cross-lingual Transfer Learning

Mikel Artetxe, Gorka Labaka, Eneko Agirre

Keywords Paper

Keywords Paper

Keywords Paper

Zhe Liu, Lina Yao, Lei Bai and
Xianzhi Wang, Can Wang

Keywords Paper

Keywords Paper

Uma Roy, Noah Constant, Rami Al-Rfou and
Aditya Barua, Aaron Phillips, Yinfei Yang

Keywords Paper

Keywords Paper

Emanuele Bugliarello, Sabrina J. Mielke, Antonios Anastasopoulos and
Ryan Cotterell, Naoaki Okazaki

Keywords Paper

Zihan Liu, Yan Xu, Tiezheng Yu and
Wenliang Dai, Ziwei Ji, Samuel Cahyawijaya, Andrea Madotto, Pascale Fung

Keywords Paper

Alexis Conneau, Shijie Wu, Haoran Li and
Luke Zettlemoyer, Veselin Stoyanov

Keywords Paper

Yichun Shi, Xiang Yu, Kihyuk Sohn and
Manmohan Chandraker, Anil K. Jain

Keywords Paper

Zhenyu Huang, Guocheng Niu, Xiao Liu and
Wenbiao Ding, Xinyan Xiao, Hua Wu, Xi Peng

Keywords Paper

Keywords Paper

Alexander Podolskiy, Dmitry Lipin, Andrey Bout and
Ekaterina Artemova, Irina Piontkovskaya

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Matthias Sperber, Hendra Setiawan, Christian Gollan and
Udhay Nallasamy, Matthias Paulik

Keywords Paper

Keywords Paper

Keywords Paper

Tianhe (Kevin) Yu, Saurabh Kumar, Abhishek Gupta and
Sergey Levine, Karol Hausman, Chelsea Finn

Keywords Paper

Keywords Paper

Keywords Paper

Zibo Lin, Deng Cai, Yan Wang and
Xiaojiang Liu, Haitao Zheng, Shuming Shi

Keywords Paper

Keywords Paper

Keywords Paper

Mengmeng Ma, Jian Ren, Long Zhao and
Sergey Tulyakov, Cathy Wu, Xi Peng

Keywords Paper

Fengxiang Yang, Zhun Zhong, Hong Liu and
Zheng Wang, Zhiming Luo, Shaozi Li, Nicu Sebe, Shin'ichi Satoh

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Yi-Lin Tuan, Ahmed El-Kishky, Adithya Renduchintala and
Vishrav Chaudhary, Francisco Guzmán, Lucia Specia

Keywords Paper