LightXML: Transformer with Dynamic Negative Sampling for High-Performance Extreme Multi-label Text Classification

02/02/2021

LightXML: Transformer with Dynamic Negative Sampling for High-Performance Extreme Multi-label Text Classification

Ting Jiang, Deqing Wang, Leilei Sun, Huayi Yang, Zhengyang Zhao, Fuzhen Zhuang

Keywords:

Abstract Paper Similar Papers

Abstract: Extreme multi-label text classification(XMC) is a task for finding the most relevant labels from a large label set. Nowadays deep learning-based methods have shown significant success in XMC. However, the existing methods (e.g., AttentionXML and X-Transformer etc) still suffer from 1) combining several models to train and predict for one dataset, and 2) sampling negative labels statically during the process of training label ranking model, which will harm the performance and accuracy of model. To address the above problems, we propose LightXML, which adopts end-to-end training and dynamical negative labels sampling. In LightXML, we use GAN like networks to recall and rank labels. The label recalling part will generate negative and positive labels, and the label ranking part will distinguish positive labels from these labels. Based on these networks, negative labels are sampled dynamically during label ranking part training. With feeding both label recalling and ranking parts with the same text representation, LightXML can reach high performance. Extensive experiments show that LightXML outperforms state-of-the-art methods in five extreme multi-label datasets with much smaller model size and lower computational complexity. In particular, on the Amazon dataset with 670K labels, LightXML can reduce the model size up to 72% compared to AttentionXML. Our code is available at http://github.com/kongds/LightXML.

The video of this talk cannot be embedded. You can watch it here:

https://slideslive.com/38948430

(Link will open in new window)

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at AAAI 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

06/12/2020

Rethinking the Value of Labels for Improving Class-Imbalanced Learning

Yuzhe Yang, Zhi Xu

Keywords Paper

Theory -> Hardness of Learning and Approximations; Theory -> Large Deviations and Asymptotic Analysis; Theory -> Learning Theor, Theory -> Information Theory

0

0

0

0

3:20

02/02/2021

Train a One-Million-Way Instance Classifier for Unsupervised Visual Representation Learning

Yu Liu, Lianghua Huang, Pan Pan and
Bin Wang, Yinghui Xu, Rong Jin

Keywords Paper

0

0

0

0

15:15

19/08/2021

Correlation-Guided Representation for Multi-Label Text Classification

Qian-Wen Zhang, Ximing Zhang, Zhao Yan and
Ruifang Liu, Yunbo Cao, Min-Ling Zhang

Keywords Paper

Machine Learning, Multi-instance; Multi-label; Multi-view learning, Classification, Text Classification

0

0

0

0

11:13

06/12/2020

Unsupervised Semantic Aggregation and Deformable Template Matching for Semi-Supervised Learning

Tao Han, Junyu Gao, Yuan Yuan, Qi Wang

Keywords Paper

0

0

0

0

3:22

19/08/2021

Two-stage Training for Learning from Label Proportions

Jiabin Liu, Bo Wang, Xin Shen and
Zhiquan Qi, Yingjie Tian

Keywords Paper

Machine Learning, Classification, Deep Learning, Weakly Supervised Learning

0

0

0

0

13:23

02/02/2021

Label Confusion Learning to Enhance Text Classification Models

Biyang Guo, Songqiao Han, Xiao Han and
Hailiang Huang, Ting Lu

Keywords Paper

0

0

0

0

15:17

06/12/2021

Improving Contrastive Learning on Imbalanced Data via Open-World Sampling

Ziyu Jiang, Tianlong Chen, Ting Chen, Zhangyang Wang

Keywords Paper

contrastive learning

0

0

0

0

10:12

06/12/2021

DP-SSL: Towards Robust Semi-supervised Learning with A Few Labeled Samples

Yi Xu, Jiandong Ding, Lu Zhang, Shuigeng Zhou

Keywords Paper

deep learning, machine learning, semi-supervised learning

0

0

0

0

10:11

02/02/2021

Task-Independent Knowledge Makes for Transferable Representations for Generalized Zero-Shot Learning

Chaoqun Wang, Xuejin Chen, Shaobo Min and
Xiaoyan Sun, Houqiang Li

Keywords Paper

0

0

0

0

14:56

06/12/2021

Rebooting ACGAN: Auxiliary Classifier GANs with Stable Training

Minguk Kang, Woohyeon Shim, Minsu Cho, Jaesik Park

Keywords Paper

generative model

0

0

0

0

9:03

03/05/2021

In Defense of Pseudo-Labeling: An Uncertainty-Aware Pseudo-label Selection Framework for Semi-Supervised Learning

Mamshad Nayeem Rizve, Kevin Duarte, Yogesh S Rawat, Mubarak Shah

Keywords Paper

Deep Learning, Calibration, Uncertainty, Pseudo-Labeling, Semi-Supervised Learning

0

0

0

0

5:06

14/06/2020

DLWL: Improving Detection for Lowshot Classes With Weakly Labelled Data

Vignesh Ramanathan, Rui Wang, Dhruv Mahajan

Keywords Paper

detection, lowshot, weak supervision, linear program, constraint, web-scale data, lvis0.5

0

0

0

0

1:01

06/12/2020

Disentangling Human Error from Ground Truth in Segmentation of Medical Images

Le Zhang, Ryu Tanno, Moucheng Xu and
Chen Jin, Joseph Jacob, Olga Cicarrelli, Frederik Barkhof, Daniel Alexander

Keywords Paper

0

0

0

0

3:21

05/01/2021

EvidentialMix: Learning With Combined Open-Set and Closed-Set Noisy Labels

Ragav Sachdeva, Filipe R. Cordeiro, Vasileios Belagiannis and
Ian Reid, Gustavo Carneiro

Keywords Paper

0

0

0

0

4:58

26/04/2020

Decoupling Representation and Classifier for Long-Tailed Recognition

Bingyi Kang, Saining Xie, Marcus Rohrbach and
Zhicheng Yan, Albert Gordo, Jiashi Feng, Yannis Kalantidis

Keywords Paper

long-tailed recognition, classification

0

0

0

1

5:00

06/12/2020

ContraGAN: Contrastive Learning for Conditional Image Generation

Minguk Kang, Jaesik Park

Keywords Paper

Neuroscience and Cognitive Science -> Brain Mapping, Neuroscience and Cognitive Science -> Visual Perception

0

0

0

0

3:21

22/11/2021

Alleviating Noisy-label Effects in Image Classification via Probability Transition Matrix

Ziqi Zhang, Yuexiang Li, Hongxin Wei and
Kai Ma, Tao Xu, Yefeng Zheng

Keywords Paper

noisy labels, image classification, instance selection, robust learning, inter-class correlation, soft label, medical image

0

0

0

0

2:52

06/12/2021

Cycle Self-Training for Domain Adaptation

Hong Liu, Jianmin Wang, Mingsheng Long

Keywords Paper

domain adaptation

0

0

0

0

8:34

25/07/2020

Sampling bias due to near-duplicates in learning to rank

Maik Fröbe, Janek Bevendorff, Jan Heinrich Reimer and
Martin Potthast, Matthias Hagen

Keywords Paper

near-duplicate-detection, selection bias, learning to rank, novelty principle

0

0

0

0

10:59

06/12/2021

Object-aware Contrastive Learning for Debiased Scene Representation

Sangwoo Mo, Hyunwoo Kang, Kihyuk Sohn and
Chun-Liang Li, Jinwoo Shin

Keywords Paper

self-supervised learning, contrastive learning, representation learning

0

0

0

0

10:25

06/12/2020

Dual Manifold Adversarial Robustness: Defense against Lp and non-Lp Adversarial Attacks

Wei-An Lin, Chun Pong Lau, Alexander Levine and
Rama Chellappa, Soheil Feizi

Keywords Paper

0

0

0

0

3:24

30/11/2020

DEAL: Difficulty-aware Active Learning for Semantic Segmentation

Shuai Xie, Zunlei Feng, Ying chen and
Songtao Sun, Chao Ma, Mingli Song

Keywords Paper

0

0

0

0

9:41

06/12/2021

A Theory-Driven Self-Labeling Refinement Method for Contrastive Representation Learning

Pan Zhou, Caiming Xiong, Xiaotong Yuan, Steven Chu Hong Hoi

Keywords Paper

theory, machine learning, self-supervised learning, contrastive learning, representation learning

0

0

0

0

14:12

06/12/2020

Contrastive Learning with Adversarial Examples

Chih-Hui Ho, Nuno Nvasconcelos

Keywords Paper

0

0

0

0

3:13

06/12/2021

A Gaussian Process-Bayesian Bernoulli Mixture Model for Multi-Label Active Learning

Weishi Shi, Dayou Yu, Qi Yu

Keywords Paper

machine learning, generative model, kernel methods, active learning

0

0

0

0

13:38

22/11/2021

Simpler Does It: Generating Semantic Labels with Objectness Guidance

Md Amirul Islam, Matthew Kowal, Sen Jia and
Konstantinos Derpanis, Neil Bruce

Keywords Paper

Weakly supervised segmentation, semi supervised segmentation, Pseudo-label generation, Class Activation Maps, Objectness, Saliency

0

0

0

0

3:02

18/07/2021

Delving into Deep Imbalanced Regression

Yuzhe Yang, Kaiwen Zha, YINGCONG CHEN and
Hao Wang, Dina Katabi

Keywords Paper

Applications

0

0

0

0

16:37

06/12/2020

Hard Negative Mixing for Contrastive Learning

Yannis Kalantidis, Mert Bulent Sariyildiz, Noe Pion and
Philippe Weinzaepfel, Diane Larlus

Keywords Paper

0

0

0

0

3:17

07/09/2020

BCaR: Beginner Classifier as Regularization Towards Generalizable Re-ID

Masato Tamura, Tomoaki Yoshinaga

Keywords Paper

person re-identification, generalizable, soft label, knowledge distillation, Re-ID, domain generalization

0

0

0

0

6:53

06/12/2021

Fairness via Representation Neutralization

Mengnan Du, Subhabrata Mukherjee, Guanchu Wang and
Ruixiang Tang, Ahmed Awadallah, Xia Hu

Keywords Paper

machine learning, fairness, interpretability

0

0

0

0

4:39

25/07/2020

DVGAN: A minimax game for search result diversification combining explicit and implicit features

Jiongnan Liu, Zhicheng Dou, Xiaojie Wang and
Shuqi Lu, Ji-Rong Wen

Keywords Paper

generative adversarial network, search result diversification

0

0

0

0

12:46

30/11/2020

Augmentation Network for Generalised Zero-Shot Learning

Rafael Felix, Michele Sasdelli, Ian Reid, Gustavo Carneiro

Keywords Paper

0

0

0

0

8:50

14/06/2020

Auxiliary Training: Towards Accurate and Robust Models

Linfeng Zhang, Muzhou Yu, Tong Chen and
Zuoqiang Shi, Chenglong Bao, Kaisheng Ma

Keywords Paper

model robustness, data augmentation, adversarial attack, training method, classification

0

0

0

0

0:56

07/09/2020

Object Detection as a Positive-Unlabeled Problem

Yuewei Yang, Kevin Liang, Lawrence Carin Duke

Keywords Paper

object detections, positive unlabeled learning

0

0

0

0

8:54

22/11/2021

Towards Overcoming False Positives in Visual Relationship Detection

Daisheng Jin, Xiao Ma, Chongzhi Zhang and
Yizhuo Zhou, Jiashu Tao, Mingyuan Zhang, Zhoujun Li

Keywords Paper

visual relationship detection, balanced sampling

0

0

0

0

2:52

18/07/2021

A Unified Generative Adversarial Network Training via Self-Labeling and Self-Attention

Tomoki Watanabe, Paolo Favaro

Keywords Paper

Deep Learning, Generative Models, Applications, Matrix and Tensor Factorization, Algorithms, Collaborative Filtering; Algorithms, Large Scale Learning; Applications, Denoising

0

0

0

0

5:12

22/11/2021

Holistic Guidance for Occluded Person Re-Identification

Madhu Kiran, Gnana Praveen Rajasekhar, Le Thanh Nguyen-Meidine and
Soufiane Belharbi, Louis-Antoine Blais-Morin, Eric Granger

Keywords Paper

Person ReID, Representation learning, image retreival

0

0

0

0

9:37

06/12/2020

Learning Black-Box Attackers with Transferable Priors and Query Feedback

Jiancheng YANG, Yangzhou Jiang, Xiaoyang Huang and
Bingbing Ni, Chenglong Zhao

Keywords Paper

0

0

0

0

3:22

12/07/2020

SIGUA: Forgetting May Make Learning with Noisy Labels More Robust

Bo Han, Gang Niu, Xingrui Yu and
QUANMING YAO, Miao Xu, Ivor Tsang, Masashi Sugiyama

Keywords Paper

Deep Learning - Algorithms

0

0

0

0

7:00

02/02/2021

Error-Correcting Output Codes with Ensemble Diversity for Robust Learning in Neural Networks

Yang Song, Qiyu Kang, Wee Peng Tay

Keywords Paper

0

0

0

0

15:04