Pretrained Generalized Autoregressive Model with Adaptive Probabilistic Label Cluster for Extreme Multi-label Text Classification

12/07/2020

Pretrained Generalized Autoregressive Model with Adaptive Probabilistic Label Cluster for Extreme Multi-label Text Classification

Hui Ye, Zhiyu Chen, Da-Han Wang, Brian Davison

Keywords: Deep Learning - General

Abstract Paper Similar Papers

Abstract: Extreme multi-label text classification (XMTC) is a task for tagging a given text with the most relevant labels from an extremely large label set. We propose a novel deep learning method called APLC-XLNet. Our approach fine-tunes the recently released generalized autoregressive pretraining model (XLNet) to learn the dense representation for the input text. We propose the Adaptive Probabilistic Label Cluster (APLC) to approximate the cross entropy loss by exploiting the unbalanced label distribution to form clusters that explicitly reduce the computational time. Our experiments, carried out on five benchmark datasets, show that our approach significantly outperforms existing state-of-the-art methods. The code of our method will be released publicly at GitHub.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ICML 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

06/12/2020

Unsupervised Semantic Aggregation and Deformable Template Matching for Semi-Supervised Learning

Tao Han, Junyu Gao, Yuan Yuan, Qi Wang

Keywords Paper

0

0

0

0

3:22

06/12/2020

Unsupervised Text Generation by Learning from Search

Jingjing Li, Zichao Li, Lili Mou and
Xin Jiang, Michael Lyu, Irwin King

Keywords Paper

0

0

0

0

3:24

23/08/2020

Compositional embeddings using complementary partitions for memory-efficient recommendation systems

Hao-Jun Michael Shi, Dheevatsa Mudigere, Maxim Naumov, Jiyan Yang

Keywords Paper

embeddings, model compression, recommendation systems

0

0

0

0

16:14

05/01/2021

Unsupervised Domain Adaptation in Semantic Segmentation via Orthogonal and Clustered Embeddings

Marco Toldo, Umberto Michieli, Pietro Zanuttigh

Keywords Paper

0

0

0

0

4:59

07/09/2020

Learning Effectively from Noisy Supervision for Weakly Supervised Semantic Segmentation

Wenbin Xie, Qiaoqiao Wei, Zheng Li, Hui Zhang

Keywords Paper

Semantic Segmentation, Weakly Supervised Semantic Segmentation, Self Attention

0

0

0

0

3:46

04/07/2020

Cross-Lingual Unsupervised Sentiment Classification with Multi-View Transfer Learning

Hongliang Fei, Ping Li

Keywords Paper

Cross-Lingual Classification, sentiment classification, unsupervised system, classification

0

0

0

0

12:23

14/06/2020

Semantically Multi-Modal Image Synthesis

Zhen Zhu, Zhiliang Xu, Ansheng You, Xiang Bai

Keywords Paper

label-to-image, semantically multi-modal image synthesis, smis, groupdnet, group convolution, cg-norm

0

0

0

0

1:01

02/02/2021

Train a One-Million-Way Instance Classifier for Unsupervised Visual Representation Learning

Yu Liu, Lianghua Huang, Pan Pan and
Bin Wang, Yinghui Xu, Rong Jin

Keywords Paper

0

0

0

0

15:15

06/12/2021

HSVA: Hierarchical Semantic-Visual Adaptation for Zero-Shot Learning

Shiming Chen, Guosen Xie, Yang Liu and
Qinmu Peng, Baigui Sun, Hao Li, Xinge You, Ling Shao

Keywords Paper

generative model, domain adaptation

0

0

0

0

9:19

02/02/2021

Curriculum-Meta Learning for Order-Robust Continual Relation Extraction

Tongtong Wu, Xuekai Li, Yuan-Fang Li and
Gholamreza Haffari, Guilin Qi, Yujin Zhu, Guoqiang Xu

Keywords Paper

0

0

0

0

11:33

02/02/2021

LightXML: Transformer with Dynamic Negative Sampling for High-Performance Extreme Multi-label Text Classification

Ting Jiang, Deqing Wang, Leilei Sun and
Huayi Yang, Zhengyang Zhao, Fuzhen Zhuang

Keywords Paper

0

0

0

0

16:28

06/12/2020

Learning Black-Box Attackers with Transferable Priors and Query Feedback

Jiancheng YANG, Yangzhou Jiang, Xiaoyang Huang and
Bingbing Ni, Chenglong Zhao

Keywords Paper

0

0

0

0

3:22

19/08/2021

Correlation-Guided Representation for Multi-Label Text Classification

Qian-Wen Zhang, Ximing Zhang, Zhao Yan and
Ruifang Liu, Yunbo Cao, Min-Ling Zhang

Keywords Paper

Machine Learning, Multi-instance; Multi-label; Multi-view learning, Classification, Text Classification

0

0

0

0

11:13

18/11/2020

Bidirectional dependency-guided attention for relation extraction

Xingchen Deng, Lei Zhang, Yixing Fan and
Long Bai, Jiafeng Guo, Pengfei Wang

Keywords Paper

0

0

0

0

10:02

18/07/2021

Delving into Deep Imbalanced Regression

Yuzhe Yang, Kaiwen Zha, YINGCONG CHEN and
Hao Wang, Dina Katabi

Keywords Paper

Applications

0

0

0

0

16:37

06/12/2021

SSUL: Semantic Segmentation with Unknown Label for Exemplar-based Class-Incremental Learning

Sungmin Cha, beomyoung kim, YoungJoon Yoo, Taesup Moon

Keywords Paper

machine learning, vision

0

0

0

0

14:05

18/07/2021

Order-Agnostic Cross Entropy for Non-Autoregressive Machine Translation

Cunxiao Du, Zhaopeng Tu, Jing Jiang

Keywords Paper

Applications, Natural Language Processing

0

0

0

0

17:21

30/11/2020

Discrete Spatial Importance-Based Deep Weighted Hashing

Yang Shi, Xiushan Nie, Quan Zhou and
Xiaoming Xi, Yilong Yin

Keywords Paper

0

0

0

0

8:22

04/07/2020

Improving Image Captioning with Better Use of Caption

Zhan Shi, Xu Zhou, Xipeng Qiu, Xiaodan Zhu

Keywords Paper

Image Captioning, multimodal problem, natural processing, computer community

0

0

0

0

11:11

03/05/2021

Auxiliary Task Update Decomposition: The Good, the Bad and the Neutral

Lucio Dery, Yann Dauphin, David Grangier

Keywords Paper

multitask learning, deeplearning, pre-training, gradient decomposition

0

0

0

0

5:22

04/07/2020

Efficient Strategies for Hierarchical Text Classification: External Knowledge and Auxiliary Tasks

Kervy Rivas Rojas, Gina Bustamante, Arturo Oncevay, Marco Antonio Sobrevilla Cabezudo

Keywords Paper

Hierarchical Classification, External Tasks, sequence-to-sequence problem, auxiliary bottom-up-classification

0

0

0

0

5:44

06/12/2020

Learning Diverse and Discriminative Representations via the Principle of Maximal Coding Rate Reduction

Yaodong Yu, Ryan Chan, Chong You and
Chaobing Song, Yi Ma

Keywords Paper

0

0

0

0

3:20

06/12/2021

Align before Fuse: Vision and Language Representation Learning with Momentum Distillation

Junnan Li, Ramprasaath Selvaraju, Akhilesh Gotmare and
Shafiq Joty, Caiming Xiong, Steven Chu Hong Hoi

Keywords Paper

transformers, vision, representation learning

0

0

0

0

9:40

06/12/2021

Improving Contrastive Learning on Imbalanced Data via Open-World Sampling

Ziyu Jiang, Tianlong Chen, Ting Chen, Zhangyang Wang

Keywords Paper

contrastive learning

0

0

0

0

10:12

07/09/2020

Deep Metric Learning Meets Deep Clustering: An Novel Unsupervised Approach for Feature Embedding

Binh Nguyen, Binh Nguyen, Gustavo Carneiro and
Erman Tjiputra, Quang Tran, Thanh-Toan Do

Keywords Paper

unsupervised deep metric learning, unsupervised feature learning, unsupervised metric loss, negative mining, deep clustering, pseudo labels, reconstruction, centroid representations, retrieval, multi-task

0

0

0

0

6:18

12/07/2020

Differentiable Product Quantization for Learning Compact Embedding Layers

Ting Chen, Lala Li, Yizhou Sun

Keywords Paper

Applications - Language, Speech and Dialog

0

0

0

0

10:16

14/06/2020

Deep Semantic Clustering by Partition Confidence Maximisation

Jiabo Huang, Shaogang Gong, Xiatian Zhu

Keywords Paper

deep clustering, cluster separability, separability measurement, semantic plausibility

0

0

0

0

1:00

06/12/2020

Fewer is More: A Deep Graph Metric Learning Perspective Using Fewer Proxies

Yuehua Zhu, Muli Yang, Cheng Deng, Wei Liu

Keywords Paper

0

0

0

0

3:08

14/06/2020

Embedding Expansion: Augmentation in Embedding Space for Deep Metric Learning

Byungsoo Ko, Geonmo Gu

Keywords Paper

metric learning, image retrieval, image clustering, augmentation, sample generation, hard sample mining, pair-based loss, triplet loss, n-pair loss, multi-similarity loss

0

0

0

0

1:00

26/04/2020

Progressive learning and disentanglement of hierarchical representations

Zhiyuan Li, Jaideep Vitthal Murkute, Prashnna Kumar Gyawali, Linwei Wang

Keywords Paper

generative model, disentanglement, progressive learning, VAE

0

0

0

0

5:06

03/05/2021

Prototypical Contrastive Learning of Unsupervised Representations

Junnan Li, Pan Zhou, Caiming Xiong, Steven Hoi

Keywords Paper

self-supervised learning, unsupervised learning, representation learning, contrastive learning

0

0

0

0

4:51

14/06/2020

Auto-Encoding Twin-Bottleneck Hashing

Yuming Shen, Jie Qin, Jiaxin Chen and
Mengyang Yu, Li Liu, Fan Zhu, Fumin Shen, Ling Shao

Keywords Paper

image hashing, data retrieval, unsupervised learning, graph neural networks

0

0

0

0

1:00

02/02/2021

FILTER: An Enhanced Fusion Method for Cross-lingual Language Understanding

Yuwei Fang, Shuohang Wang, Zhe Gan and
Siqi Sun, Jingjing Liu

Keywords Paper

0

0

0

0

17:39

02/02/2021

Few-shot Learning for Multi-label Intent Detection

Yutai Hou, Yongkui Lai, Yushan Wu and
Wanxiang Che, Ting Liu

Keywords Paper

0

0

0

0

15:07

04/08/2021

Bounded Memory Active Learning through Enriched Queries

Max Hopkins, Daniel Kane, Shachar Lovett, Michal Moshkovitz

Keywords Paper

1

1

0

0

18:26

02/02/2021

Extending Multi-Sense Word Embedding to Phrases and Sentences for Unsupervised Semantic Applications

Haw-Shiuan Chang, Amol Agrawal, Andrew McCallum

Keywords Paper

0

0

0

0

19:44

12/07/2020

Variable-Bitrate Neural Compression via Bayesian Arithmetic Coding

Yibo Yang, Robert Bamler, Stephan Mandt

Keywords Paper

Deep Learning - General

0

0

0

0

15:08

03/05/2021

Autoregressive Entity Retrieval

Nicola De Cao, Gautier Izacard, Sebastian Riedel, Fabio Petroni

Keywords Paper

constrained beam search, entity disambiguation, end-to-end entity linking, entity linking, autoregressive language model, document retrieval, entity retrieval

0

0

0

0

10:14

02/02/2021

Adaptive Beam Search Decoding for Discrete Keyphrase Generation

Xiaoli Huang, Tongge Xu, Lvan Jiao and
Yueran Zu, Youmin Zhang

Keywords Paper

0

0

0

0

14:36

12/07/2020

Extreme Multi-label Classification from Aggregated Labels

Yanyao Shen, Hsiang-Fu Yu, Sujay Sanghavi, Inderjit Dhillon

Keywords Paper

Optimization - Large Scale, Parallel and Distributed

0

0

0

0

15:05