Style-transfer and Paraphrase: Looking for a Sensible Semantic Similarity Metric

Abstract: The rapid development of such natural language processing tasks as style transfer, paraphrase, and machine translation often calls for the use of semantic similarity metrics. In recent years a lot of methods to measure the semantic similarity of two short texts were developed. This paper provides a comprehensive analysis for more than a dozen of such methods. Using a new dataset of fourteen thousand sentence pairs human-labeled according to their semantic similarity, we demonstrate that none of the metrics widely used in the literature is close enough to human judgment in these tasks. A number of recently proposed metrics provide comparable results, yet Word Mover Distance is shown to be the most reasonable solution to measure semantic similarity in reformulated texts at the moment.

Style-transfer and Paraphrase: Looking for a Sensible Semantic Similarity Metric

Ivan P. Yamshchikov, Viacheslav Shibaev, Nikolay Khlebnikov, Alexey Tikhonov

Comments

Similar Papers

Unsupervised extractive summarization using pointwise mutual information

Vishakh Padmakumar, He He

Keywords Abstract Paper

Automatically Paraphrasing via Sentence Reconstruction and Round-trip Translation

Zilu Guo, Zhongqiang Huang, Kenny Q. Zhu and Guandan Chen, Kaibo Zhang, Boxing Chen, Fei Huang

Keywords Abstract Paper

Natural Language Processing, Machine Translation, Natural Language Generation, NLP Applications and Tools

Unsupervised Multilingual Sentence Embeddings for Parallel Corpus Mining

Ivana Kvapilíková, Mikel Artetxe, Gorka Labaka and Eneko Agirre, Ondřej Bojar

Keywords Abstract Paper

Unsupervised Embeddings, Parallel Mining, multilingual embeddings, parallel tasks

Automatic Interlinear Glossing for Under-Resourced Languages Leveraging Translations

Xingyuan Zhao, Satoru Ozaki, Antonios Anastasopoulos and Graham Neubig, Lori Levin

Keywords Abstract Paper

Exemplification Modeling: Can You Give Me an Example, Please?

Edoardo Barba, Luigi Procopio, Caterina Lacerra and Tommaso Pasini, Roberto Navigli

Keywords Abstract Paper

Natural Language Processing, Natural Language Semantics, Resources and Evaluation

A Simple Approach to Learning Unsupervised Multilingual Embeddings

Pratik Jawanpuria, Mayank Meghwanshi, Bamdev Mishra

Keywords Abstract Paper

learning alignment, unsupervised alignment, bilingual induction, cross-lingual similarity

Fake it Till You Make it: Self-Supervised Semantic Shifts for Monolingual Word Embedding Tasks

Maurício Gruppi, Pin-Yu Chen, Sibel Adali

Keywords Abstract Paper

Does she wink or does she nod? A challenging benchmark for evaluating word understanding of language models

Lutfi Kerem Senel, Hinrich Schütze

Keywords Abstract Paper

Reformulating Unsupervised Style Transfer as Paraphrase Generation

Kalpesh Krishna, John Wieting, Mohit Iyyer

Keywords Abstract Paper

style transfer, attribute transfer, unsupervised transfer, paraphrase problem

Fine-grained relevance annotations for multi-task document ranking and question answering

Sebastian Hofstätter, Markus Zlabinger, Mete Sertkan and Michael Schröder, Allan Hanbury

Keywords Abstract Paper

relevance distribution, position bias, word-level relevance, fine-grained annotations

Multilingual AMR-to-Text Generation

Angela Fan, Claire Gardent

Keywords Abstract Paper

multilingual generation, cross-lingual embeddings, pretraining, multilingual models

Have We Solved The Hard Problem? It’s Not Easy! Contextual Lexical Contrast as a Means to Probe Neural Coherence

Wenqiang Lei, Yisong Miao, Runpeng Xie and Bonnie Webber, Meichun Liu, Tat-Seng Chua, Nancy F. Chen

Keywords Abstract Paper

Generating similes effortlessly like a Pro: A Style Transfer Approach for Simile Generation

Tuhin Chakrabarty, Smaranda Muresan, Nanyun Peng

Keywords Abstract Paper

human imagination, simile generation, mapping properties, sequence model

On Learning Language-Invariant Representations for Universal Machine Translation

Han Zhao, Junjie Hu, Andrej Risteski

Keywords Abstract Paper

Learning Theory

On Learning Universal Representations Across Languages

Xiangpeng Wei, Rongxiang Weng, Yue Hu and Luxi Xing, Heng Yu, Weihua Luo

Keywords Abstract Paper

hierarchical contrastive learning, cross-lingual pretraining, universal representation learning

Generating Senses and RoLes: An End-to-End Model for Dependency- and Span-based Semantic Role Labeling

Rexhina Blloshmi, Simone Conia, Rocco Tripodi, Roberto Navigli

Keywords Abstract Paper

Natural Language Processing, Natural Language Semantics, Natural Language Generation, Natural Language Processing

Explicit Semantic Decomposition for Definition Generation

Jiahuan Li, Yu Bao, Shujian Huang and Xinyu Dai, Jiajun Chen

Keywords Abstract Paper

Definition Generation, construction dictionaries, under-specific generation, Explicit Decomposition

Small but Mighty: New Benchmarks for Split and Rephrase

Li Zhang, Huaiyu Zhu, Siddhartha Brahma, Yunyao Li

Keywords Abstract Paper

text task, fine-grained evaluation, automatic process, rule-based model

A Novel Cascade Binary Tagging Framework for Relational Triple Extraction

Zhepei Wei, Jianlin Su, Yue Wang and Yuan Tian, Yi Chang

Keywords Abstract Paper

Relational Extraction, large-scale construction, overlapping problem, relational task

LNMap: Departures from Isomorphic Assumption in Bilingual Lexicon Induction Through Non-Linear Mapping in Latent Space

Tasnim Mohiuddin, M Saiful Bari, Shafiq Joty

Keywords Abstract Paper

bilingual induction, bilingual, bli, semi-supervised method

Are we Estimating or Guesstimating Translation Quality?

Keywords Paper

Zilu Guo, Zhongqiang Huang, Kenny Q. Zhu and
Guandan Chen, Kaibo Zhang, Boxing Chen, Fei Huang

Keywords Paper

Ivana Kvapilíková, Mikel Artetxe, Gorka Labaka and
Eneko Agirre, Ondřej Bojar

Keywords Paper

Xingyuan Zhao, Satoru Ozaki, Antonios Anastasopoulos and
Graham Neubig, Lori Levin

Keywords Paper

Edoardo Barba, Luigi Procopio, Caterina Lacerra and
Tommaso Pasini, Roberto Navigli

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Sebastian Hofstätter, Markus Zlabinger, Mete Sertkan and
Michael Schröder, Allan Hanbury

Keywords Paper

Keywords Paper

Wenqiang Lei, Yisong Miao, Runpeng Xie and
Bonnie Webber, Meichun Liu, Tat-Seng Chua, Nancy F. Chen

Keywords Paper

Keywords Paper

Keywords Paper

Xiangpeng Wei, Rongxiang Weng, Yue Hu and
Luxi Xing, Heng Yu, Weihua Luo

Keywords Paper

Keywords Paper

Jiahuan Li, Yu Bao, Shujian Huang and
Xinyu Dai, Jiajun Chen

Keywords Paper

Keywords Paper

Zhepei Wei, Jianlin Su, Yue Wang and
Yuan Tian, Yi Chang

Keywords Paper

Keywords Paper

Keywords Paper

Zhenyi Wang, Xiaoyang Wang, Bang An and
Dong Yu, Changyou Chen

Keywords Paper

Rong Zhang, Revanth Gangi Reddy, Md Arafat Sultan and
Vittorio Castelli, Anthony Ferritto, Radu Florian, Efsun Sarioglu Kayi, Salim Roukos, Avi Sil, Todd Ward

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Billy Chiu, Sunil Kumar Sahu, Neha Sengupta and
Derek Thomas, Mohammady Mahdy

Keywords Paper

Keywords Paper

Qian Liu, Bei Chen, Jian-Guang Lou and
Bin Zhou, Dongmei Zhang

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Zihao Fu, Bei Shi, Wai Lam and
Lidong Bing, Zhiyuan Liu

Keywords Paper

Dmitry Nikolaev, Ofir Arviv, Taelin Karidi and
Neta Kenneth, Veronika Mitnik, Lilja Maria Saeboe, Omri Abend

Keywords Paper