Heavy-tailed Representations, Text Polarity Classification & Data Augmentation

06/12/2020

Heavy-tailed Representations, Text Polarity Classification & Data Augmentation

Hamid Jalalzai, Pierre Colombo, Chloé Clavel, Eric Gaussier, Giovanna Varni, Emmanuel Vignon, Anne Sabourin

Keywords:

Abstract Paper Similar Papers

Abstract: The dominant approaches to text representation in natural language rely on learning embeddings on massive corpora which have convenient properties such as compositionality and distance preservation. In this paper, we develop a novel method to learn a heavy-tailed embedding with desirable regularity properties regarding the distributional tails, which allows to analyze the points far away from the distribution bulk using the framework of multivariate extreme value theory. In particular, a classifier dedicated to the tails of the proposed embedding is obtained which exhibits a scale invariance property exploited in a novel text generation method for label preserving dataset augmentation. Experiments on synthetic and real text data show the relevance of the proposed framework and confirm that this method generates meaningful sentences with controllable attribute, e.g. positive or negative sentiments.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at NeurIPS 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

08/12/2020

A Deep Metric Learning Method for Biomedical Passage Retrieval

Andrés Rosso-Mateus, Fabio A. González, Manuel Montes-y-Gómez

Keywords Paper

0

0

0

0

14:58

02/02/2021

Merging Statistical Feature via Adaptive Gate for Improved Text Classification

Xianming Li, Zongxi Li, Haoran Xie, Qing Li

Keywords Paper

0

0

0

0

14:56

12/07/2020

How recurrent networks implement contextual processing in sentiment analysis

Niru Maheswaranathan, David Sussillo

Keywords Paper

Applications - Language, Speech and Dialog

0

0

0

0

14:01

02/02/2021

Improving Commonsense Causal Reasoning by Adversarial Training and Data Augmentation

Ieva Staliūnaitė, Philip John Gorinski, Ignacio Iacobacci

Keywords Paper

0

0

0

0

16:40

06/12/2020

Bidirectional Convolutional Poisson Gamma Dynamical Systems

wenchao chen, Chaojie Wang, Bo Chen and
Yicheng Liu, Hao Zhang, Mingyuan Zhou

Keywords Paper

0

0

0

0

3:23

16/11/2020

A Bilingual Generative Transformer for Semantic Sentence Embedding

John Wieting, Graham Neubig, Taylor Berg-Kirkpatrick

Keywords Paper

source separation, semantic encoding, data distributions, unsupervised evaluations

0

0

0

0

14:32

03/05/2021

CoDA: Contrast-enhanced and Diversity-promoting Data Augmentation for Natural Language Understanding

Yanru Qu, Dinghan Shen, Yelong Shen and
Sandra Sajeev, Weizhu Chen, Jiawei Han

Keywords Paper

consistency training, contrastive learning, data augmentation, natural language understanding

0

0

0

0

6:02

02/02/2021

Making the Relation Matters: Relation of Relation Learning Network for Sentence Semantic Matching

Kun Zhang, Le Wu, Guangyi Lv and
Meng Wang, Enhong Chen, Shulan Ruan

Keywords Paper

0

0

0

0

15:16

22/06/2020

Exploiting Semantic Relations for Fine-grained Entity Typing

Hongliang Dai, Yangqiu Song, Xin Li

Keywords Paper

Fine-grained Entity Typing, Hypernym Extraction, Semantic Role Labeling

0

0

0

0

4:45

26/04/2020

Improving Neural Language Generation with Spectrum Control

Lingxiao Wang, Jing Huang, Kevin Huang and
Ziniu Hu, Guangtao Wang, Quanquan Gu

Keywords Paper

0

0

0

0

4:58

14/06/2020

Image Search With Text Feedback by Visiolinguistic Attention Learning

Yanbei Chen, Shaogang Gong, Loris Bazzani

Keywords Paper

vision and language, image search, text feedback, attention mechanism, transformer, multimodal learning, representation learning, composition, image retrieval, interactive image search

0

0

0

0

1:00

16/11/2020

Unified Feature and Instance Based Domain Adaptation for Aspect-Based Sentiment Analysis

Chenggong Gong, Jianfei Yu, Rui Xia

Keywords Paper

aspect-based analysis, absa task, feature-based adaptation, auxiliary tasks

0

0

0

0

12:12

18/11/2020

Enhancing topic models by incorporating explicit and implicit external knowledge

Yang Hong, Xinhuai Tang, Tiancheng Tang and
Yunlong Hu, Jintai Tian

Keywords Paper

0

0

0

0

9:57

19/08/2021

Keep the Structure: A Latent Shift-Reduce Parser for Semantic Parsing

Yuntao Li, Bei Chen, Qian Liu and
Yan Gao, Jian-Guang Lou, Yan Zhang, Dongmei Zhang

Keywords Paper

Natural Language Processing, Natural Language Semantics

0

0

0

0

12:37

19/04/2021

Coordinate constructions in English enhanced Universal Dependencies: Analysis and computational modeling

Stefan Grünewald, Prisca Piccirilli, Annemarie Friedrich

Keywords Paper

0

0

0

0

12:44

16/11/2020

SelfORE: Self-supervised Relational Feature Learning for Open Relation Extraction

Xuming Hu, Lijie Wen, Yusong Xu and
Chenwei Zhang, Philip Yu

Keywords Paper

open extraction, extracting facts, adaptive clustering, relation classification

0

0

0

0

11:25

04/07/2020

Exploiting the Syntax-Model Consistency for Neural Relation Extraction

Amir Pouran Ben Veyseh, Franck Dernoncourt, Dejing Dou, Thien Huu Nguyen

Keywords Paper

Neural Extraction, Relation Extraction, RE, syntactic injection

0

0

0

0

11:03

19/04/2021

Is “hot pizza” positive or negative? Mining target-aware sentiment lexicons

Jie Zhou, Yuanbin Wu, Changzhi Sun, Liang He

Keywords Paper

0

0

0

0

10:19

19/08/2021

Generating Senses and RoLes: An End-to-End Model for Dependency- and Span-based Semantic Role Labeling

Rexhina Blloshmi, Simone Conia, Rocco Tripodi, Roberto Navigli

Keywords Paper

Natural Language Processing, Natural Language Semantics, Natural Language Generation, Natural Language Processing

0

0

0

0

15:18

01/07/2020

Supertagging with CCG primitives

Aditya Bhargava, Gerald Penn

Keywords Paper

0

0

0

0

5:00

03/05/2021

Probing BERT in Hyperbolic Spaces

Boli Chen, Yao Fu, Guangwei Xu and
Pengjun Xie, Chuanqi Tan, Mosha Chen, Liping Jing

Keywords Paper

Sentiment, Syntax, Probe, BERT, Hyperbolic

0

0

0

0

5:10

04/07/2020

SPECTER: Document-level Representation Learning using Citation-informed Transformers

Arman Cohan, Sergey Feldman, Iz Beltagy and
Doug Downey, Daniel Weld

Keywords Paper

Document-level Learning, Representation learning, natural systems, classification

0

0

0

0

13:07

18/11/2020

Bidirectional dependency-guided attention for relation extraction

Xingchen Deng, Lei Zhang, Yixing Fan and
Long Bai, Jiafeng Guo, Pengfei Wang

Keywords Paper

0

0

0

0

10:02

01/07/2020

Lexicalization of Probabilistic Linear Context-free Rewriting Systems

Richard Mörbitz, Thomas Ruprecht

Keywords Paper

0

0

0

0

7:58

04/07/2020

Cross-Lingual Semantic Role Labeling with High-Quality Translated Training Corpus

Hao Fei, Meishan Zhang, Donghong Ji

Keywords Paper

Cross-Lingual Labeling, semantic labeling, natural understanding, model transferring

0

0

0

0

10:32

16/11/2020

Exploring Semantic Capacity of Terms

Jie Huang, Zilong Wang, Kevin Chang and
Wen-mei Hwu, JinJun Xiong

Keywords Paper

natural processing, artificial intelligence, linear regression, semantic capacity

0

0

0

0

9:49

05/12/2020

Named entity recognition in multi-level contexts

Yubo Chen, Chuhan Wu, Tao Qi and
Zhigang Yuan, Yongfeng Huang

Keywords Paper

0

0

0

0

14:10

16/11/2020

T3: Tree-Autoencoder Constrained Adversarial Text Generation for Targeted Attack

Boxin Wang, Hengzhi Pei, Boyuan Pan and
Qian Chen, Shuohang Wang, Bo Li

Keywords Paper

adversarial generation, nlp tasks, sentiment analysis, qa

0

0

0

0

11:59

16/11/2020

Neural Extractive Summarization with Hierarchical Attentive Heterogeneous Graph Network

Ruipeng Jia, Yanan Cao, Hengzhu Tang and
Fang Fang, Cong Cao, Shi Wang

Keywords Paper

sentence-level summarization, node task, network mining, text summarization

0

0

0

0

10:03

19/08/2021

Method of Moments for Topic Models with Mixed Discrete and Continuous Features

Joachim Giesen, Paul Kahlmeyer, Sören Laue and
Matthias Mitterreiter, Frank Nussbaum, Christoph Staudt, Sina Zarrieß

Keywords Paper

Machine Learning, Learning Generative Models, Probabilistic Machine Learning, Unsupervised Learning

0

0

0

0

15:24

01/07/2020

Improving Slot Filling by Utilizing Contextual Information

Amir Pouran Ben Veyseh, Franck Dernoncourt, Thien Huu Nguyen

Keywords Paper

0

0

0

0

14:11

08/12/2020

How LSTM Encodes Syntax: Exploring Context Vectors and Semi-Quantization on Natural Text

Chihiro Shibata, Kei Uchiumi, Daichi Mochihashi

Keywords Paper

0

0

0

0

14:45

12/07/2020

Educating Text Autoencoders: Latent Representation Guidance via Denoising

Tianxiao Shen, Jonas Mueller, Regina Barzilay, Tommi Jaakkola

Keywords Paper

Deep Learning - Generative Models and Autoencoders

0

0

0

0

17:06

19/04/2021

Interpretability for morphological inflection: From character-level predictions to subword-level rules

Tatyana Ruzsics, Olga Sozinova, Ximena Gutierrez-Vasques, Tanja Samardzic

Keywords Paper

0

0

0

0

10:53

19/08/2021

Correlation-Guided Representation for Multi-Label Text Classification

Qian-Wen Zhang, Ximing Zhang, Zhao Yan and
Ruifang Liu, Yunbo Cao, Min-Ling Zhang

Keywords Paper

Machine Learning, Multi-instance; Multi-label; Multi-view learning, Classification, Text Classification

0

0

0

0

11:13

26/04/2020

Encoding word order in complex embeddings

Benyou Wang, Donghao Zhao, Christina Lioma and
Qiuchi Li, Peng Zhang, Jakob Grue Simonsen

Keywords Paper

word embedding, complex-valued neural network, position embedding

0

0

0

0

4:51

16/11/2020

Top-Rank-Focused Adaptive Vote Collection for the Evaluation of Domain-Specific Semantic Models

Pierangelo Lombardo, Alessio Boiardi, Luca Colombo and
Angelo Schiavone, Nicolò Tamagnone

Keywords Paper

content-based recommenders, construction, top-rank evaluation, semantic models

0

0

0

0

12:03

06/12/2020

Hierarchical Poset Decoding for Compositional Generalization in Language

Yinuo Guo, Zeqi Lin, Jian-Guang Lou, Dongmei Zhang

Keywords Paper

0

0

0

0

3:14

06/12/2020

Unsupervised Text Generation by Learning from Search

Jingjing Li, Zichao Li, Lili Mou and
Xin Jiang, Michael Lyu, Irwin King

Keywords Paper

0

0

0

0

3:24

04/07/2020

Efficient Strategies for Hierarchical Text Classification: External Knowledge and Auxiliary Tasks

Kervy Rivas Rojas, Gina Bustamante, Arturo Oncevay, Marco Antonio Sobrevilla Cabezudo

Keywords Paper

Hierarchical Classification, External Tasks, sequence-to-sequence problem, auxiliary bottom-up-classification

0

0

0

0

5:44