Extending Multi-Sense Word Embedding to Phrases and Sentences for Unsupervised Semantic Applications

Abstract: Most unsupervised NLP models represent each word with a single point or single region in semantic space, while the existing multi-sense word embeddings cannot represent longer word sequences like phrases or sentences. We propose a novel embedding method for a text sequence (a phrase or a sentence) where each sequence is represented by a distinct set of multi-mode codebook embeddings to capture different semantic facets of its meaning. The codebook embeddings can be viewed as the cluster centers which summarize the distribution of possibly co-occurring words in a pre-trained word embedding space. We introduce an end-to-end trainable neural model that directly predicts the set of cluster centers from the input text sequence during test time. Our experiments show that the per-sentence codebook embeddings significantly improve the performances in unsupervised sentence similarity and extractive summarization benchmarks. In phrase similarity experiments, we discover that the multi-facet embeddings provide an interpretable semantic representation but do not outperform the single-facet baseline.

02/02/2021

Extending Multi-Sense Word Embedding to Phrases and Sentences for Unsupervised Semantic Applications

Haw-Shiuan Chang, Amol Agrawal, Andrew McCallum

Comments

Similar Papers

A Supervised Multi-Head Self-Attention Network for Nested Named Entity Recognition

Yongxiu Xu, Heyan Huang, Chong Feng, Yue Hu

Keywords Abstract Paper

An Unsupervised Method for Learning Representations of Multi-word Expressions for Semantic Classification

Robert Vacareanu, Marco A. Valenzuela-Escárcega, Rebecca Sharp, Mihai Surdeanu

Keywords Abstract Paper

Pre-training Multilingual Neural Machine Translation by Leveraging Alignment Information

Zehui Lin, Xiao Pan, Mingxuan Wang and Xipeng Qiu, Jiangtao Feng, Hao Zhou, Lei Li

Keywords Abstract Paper

machine mt, mt, rich mt, universal model

Cross-Lingual Unsupervised Sentiment Classification with Multi-View Transfer Learning

Hongliang Fei, Ping Li

Keywords Abstract Paper

Cross-Lingual Classification, sentiment classification, unsupervised system, classification

A Novel Graph-based Multi-modal Fusion Encoder for Neural Machine Translation

Yongjing Yin, Fandong Meng, Jinsong Su and Chulun Zhou, Zhengyuan Yang, Jie Zhou, Jiebo Luo

Keywords Abstract Paper

Neural Translation, multi-modal learning, NMT, Graph-based Encoder

Guided Attention Network for Concept Extraction

Songtao Fang, Zhenya Huang, Ming He and Shiwei Tong, Xiaoqing Huang, Ye Liu, Jie Huang, Qi Liu

Keywords Abstract Paper

Data Mining, Information Retrieval, Mining Text, Web, Social Media

KEML: A Knowledge-Enriched Meta-Learning Framework for Lexical Relation Classification

Chengyu Wang, Minghui Qiu, Jun Huang, Xiaofeng He

Keywords Abstract Paper

Sequence-to-Sequence Learning with Latent Neural Grammars

Yoon Kim

Keywords Abstract Paper

deep learning

Neural Extractive Summarization with Hierarchical Attentive Heterogeneous Graph Network

Ruipeng Jia, Yanan Cao, Hengzhu Tang and Fang Fang, Cong Cao, Shi Wang

Keywords Abstract Paper

sentence-level summarization, node task, network mining, text summarization

A Neural Generative Model for Joint Learning Topics and Topic-Specific Word Embeddings

Lixing Zhu, Deyu Zhou, Yulan He

Keywords Abstract Paper

word evaluation, word disambiguation, sentiment classification, generative model

A Sequence-to-Set Network for Nested Named Entity Recognition

Zeqi Tan, Yongliang Shen, Shuai Zhang and Weiming Lu, Yueting Zhuang

Keywords Abstract Paper

Natural Language Processing, Information Extraction, Named Entities

NASE: Learning knowledge graph embedding for link prediction via neural architecture search

Xiaoyu Kou, Bingfeng Luo, Huang Hu, Yan Zhang

Keywords Abstract Paper

kg embedding, neural architecture search, knowledge graph

Named entity recognition in multi-level contexts

Yubo Chen, Chuhan Wu, Tao Qi and Zhigang Yuan, Yongfeng Huang

Keywords Abstract Paper

Cross-Domain Grouping and Alignment for Domain Adaptive Semantic Segmentation

Minsu Kim, Sunghun Joung, Seungryong Kim and JungIn Park, Ig-Jae Kim, Kwanghoon Sohn

Keywords Abstract Paper

Discourse-Aware Neural Extractive Text Summarization

Jiacheng Xu, Zhe Gan, Yu Cheng, Jingjing Liu

Keywords Abstract Paper

Discourse-Aware Summarization, document encoding, extractive selection, text models

Finding Universal Grammatical Relations in Multilingual BERT

Ethan A. Chi, John Hewitt, Christopher D. Manning

Keywords Abstract Paper

zero-shot transfer, Multilingual BERT, Multilingual mBERT, Multilingual

COD3S: Diverse Generation with Discrete Semantic Signatures

Nathaniel Weir, João Sedoc, Benjamin Van Durme

Keywords Abstract Paper

causal generation, cods, neural models, seqseqs

Learning to Tag OOV Tokens by Integrating Contextual Representation and Background Knowledge

Keqing He, Yuanmeng Yan, Weiran XU

Keywords Abstract Paper

slot tagging, Contextual Representation, Neural-based models, knowledge-enhanced model

Exploring Explainable Selection to Control Abstractive Summarization

Haonan Wang, Yang Gao, Yu Bai and Mirella Lapata, Heyan Huang

Keywords Abstract Paper

Discovering New Intents with Deep Aligned Clustering

Hanlei Zhang, Hua Xu, Ting-En Lin, Rui Lyu

Keywords Abstract Paper

Unsupervised Semantic Aggregation and Deformable Template Matching for Semi-Supervised Learning

Tao Han, Junyu Gao, Yuan Yuan, Qi Wang

Keywords Abstract Paper

Keywords Paper

Keywords Paper

Zehui Lin, Xiao Pan, Mingxuan Wang and
Xipeng Qiu, Jiangtao Feng, Hao Zhou, Lei Li

Keywords Paper

Keywords Paper

Yongjing Yin, Fandong Meng, Jinsong Su and
Chulun Zhou, Zhengyuan Yang, Jie Zhou, Jiebo Luo

Keywords Paper

Songtao Fang, Zhenya Huang, Ming He and
Shiwei Tong, Xiaoqing Huang, Ye Liu, Jie Huang, Qi Liu

Keywords Paper

Keywords Paper

Keywords Paper

Ruipeng Jia, Yanan Cao, Hengzhu Tang and
Fang Fang, Cong Cao, Shi Wang

Keywords Paper

Keywords Paper

Zeqi Tan, Yongliang Shen, Shuai Zhang and
Weiming Lu, Yueting Zhuang

Keywords Paper

Keywords Paper

Yubo Chen, Chuhan Wu, Tao Qi and
Zhigang Yuan, Yongfeng Huang

Keywords Paper

Minsu Kim, Sunghun Joung, Seungryong Kim and
JungIn Park, Ig-Jae Kim, Kwanghoon Sohn

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Haonan Wang, Yang Gao, Yu Bai and
Mirella Lapata, Heyan Huang

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Xisen Jin, Zhongyu Wei, Junyi Du and
Xiangyang Xue, Xiang Ren

Keywords Paper

Anne Lauscher, Ivan Vulić, Edoardo Maria Ponti and
Anna Korhonen, Goran Glavaš

Keywords Paper

Keywords Paper

Keywords Paper

Guozhong Li, Byron Choi, Jianliang Xu and
Sourav S Bhowmick, Kwok-Pan Chun, Grace Lai-Hung Wong

Keywords Paper

Keywords Paper

Ari Pakman, Yueqi Wang, Catalin Mitelut and
JinHyung Lee, Department of Statistics Liam Paninski

Keywords Paper

Jake Zhao Zhao, Mingfeng Ou, linji Xue and
Yunkai Cui, Sai Wu, Gang Chen

Keywords Paper

Jiarui Jin, Jiarui Qin, Yuchen Fang and
Kounianhua Du, Weinan Zhang, Yong Yu, Zheng Zhang, Alexander J. Smola

Keywords Paper

Keywords Paper

Xingdi Yuan, Tong Wang, Rui Meng and
Khushboo Thaker, Peter Brusilovsky, Daqing He, Adam Trischler

Keywords Paper

Jinming Zhao, Ming Liu, Longxiang Gao and
Yuan Jin, Lan Du, He Zhao, He Zhang, Gholamreza Haffari

Keywords Paper

Keywords Paper