Adaptive mixed component LDA for low resource topic modeling

19/04/2021

Adaptive mixed component LDA for low resource topic modeling

Suzanna Sia, Kevin Duh

Keywords:

Abstract Paper Similar Papers

Abstract: Probabilistic topic models in low data resource scenarios are faced with less reliable estimates due to sparsity of discrete word co-occurrence counts, and do not have the luxury of retraining word or topic embeddings using neural methods. In this challenging resource constrained setting, we explore mixture models which interpolate between the discrete and continuous topic-word distributions that utilise pre-trained embeddings to improve topic coherence. We introduce an automatic trade-off between the discrete and continuous representations via an adaptive mixture coefficient, which places greater weight on the discrete representation when the corpus statistics are more reliable. The adaptive mixture coefficient takes into account global corpus statistics, and the uncertainty in each topic’s continuous distributions. Our approach outperforms the fully discrete, fully continuous, and static mixture model on topic coherence in low resource settings. We additionally demonstrate the generalisability of our method by extending it to handle multilingual document collections.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at EACL 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

25/07/2020

Copula guided neural topic modelling for short texts

Lihui Lin, Hongyu Jiang, Yanghui Rao

Keywords Paper

short text modelling, Archimedean copulas, neural topic modelling, auto-encoding variational Bayes

0

0

0

0

8:46

19/04/2021

On the calibration and uncertainty of neural learning to rank models for conversational search

Gustavo Penha, Claudia Hauff

Keywords Paper

0

0

0

0

11:24

04/07/2020

Neural Mixed Counting Models for Dispersed Topic Discovery

Jiemin Wu, Yanghui Rao, Zusheng Zhang and
Haoran Xie, Qing Li, Fu Lee Wang, Ziye Chen

Keywords Paper

Dispersed Discovery, mining topics, Neural Models, Mixed models

0

0

0

0

10:29

03/05/2021

Heteroskedastic and Imbalanced Deep Learning with Adaptive Regularization

Kaidi Cao, Yining Chen, Junwei Lu and
Nikos Arechiga, Adrien Gaidon, Tengyu Ma

Keywords Paper

imbalanced learning, noise robust learning, deep learning

0

0

0

0

5:14

18/07/2021

Unbalanced minibatch Optimal Transport; applications to Domain Adaptation

Kilian Fatras, Thibault Séjourné, Rémi Flamary, Nicolas Courty

Keywords Paper

Algorithms, Optimal Transport

0

0

0

2

4:57

12/07/2020

Self-Modulating Nonparametric Event-Tensor Factorization

Zheng Wang, Xinqi Chu, Shandian Zhe

Keywords Paper

General Machine Learning Techniques

0

0

0

0

15:40

03/05/2021

Multi-Class Uncertainty Calibration via Mutual Information Maximization-based Binning

Kanil Patel, William H Beluch, Bin Yang and
Michael Pfeiffer, Dan Zhang

Keywords Paper

deep neural networks, histogram binning, post-hoc calibration, uncertainty calibration, mutual information

0

0

0

0

5:13

26/04/2020

Building Deep Equivariant Capsule Networks

Sai Raam Venkataraman, S. Balasubramanian, R. Raghunatha Sarma

Keywords Paper

Capsule networks, equivariance

0

0

0

0

18:55

06/12/2021

Locally Valid and Discriminative Prediction Intervals for Deep Learning Models

Zhen Lin, Shubhendu Trivedi, Jimeng Sun

Keywords Paper

deep learning

0

0

0

0

12:05

13/04/2021

Federated learning with compression: Unified analysis and sharp guarantees

Farzin Haddadpour, Mohammad Mahdi Kamani, Aryan Mokhtari, Mehrdad Mahdavi

Keywords Paper

0

0

0

0

3:03

13/04/2021

Latent derivative bayesian last layer networks

Joe Watson, Jihao Andreas Lin, Pascal Klink and
Joni Pajarinen, Jan Peters

Keywords Paper

0

0

0

0

3:05

02/02/2021

DeepPseudo: Pseudo Value Based Deep Learning Models for Competing Risk Analysis

Md Mahmudur Rahman, Koji Matsuo, Shinya Matsuzaki, Sanjay Purushotham

Keywords Paper

0

0

0

0

18:31

02/02/2021

Longitudinal Deep Kernel Gaussian Process Regression

Junjie Liang, Yanting Wu, Dongkuan Xu, Vasant G Honavar

Keywords Paper

0

0

0

0

16:27

06/12/2021

Rectangular Flows for Manifold Learning

Anthony Caterini, Gabriel Loaiza-Ganem, Geoff Pleiss, John Cunningham

Keywords Paper

deep learning, optimization, generative model

0

0

0

0

12:26

13/04/2021

Self-concordant analysis of generalized linear bandits with forgetting

Yoan Russac, Louis Faury, Olivier Cappé, Aurélien Garivier

Keywords Paper

0

0

0

0

3:06

12/07/2020

Efficient Policy Learning from Surrogate-Loss Classification Reductions

Andrew Bennett, Nathan Kallus

Keywords Paper

Causality

0

0

0

0

14:19

06/12/2021

Bayesian decision-making under misspecified priors with applications to meta-learning

Max Simchowitz, Christopher Tosh, Akshay Krishnamurthy and
Daniel Hsu, Thodoris Lykouris, Miro Dudik, Robert E Schapire

Keywords Paper

meta learning, bandits

0

0

0

0

14:58

03/05/2021

Property Controllable Variational Autoencoder via Invertible Mutual Dependence

Xiaojie Guo, Yuanqi Du, Liang Zhao

Keywords Paper

deep generative models, disentangled representation learning, interpretable latent representation

0

0

0

0

4:45

02/02/2021

How Does the Combined Risk Affect the Performance of Unsupervised Domain Adaptation Approaches?

Li Zhong, Zhen Fang, Feng Liu and
Jie Lu, Bo Yuan, Guangquan Zhang

Keywords Paper

0

0

0

0

13:36

12/08/2020

PCKV: Locally Differentially Private Correlated Key-Value Data Collection with Optimized Utility

Xiaolan Gu, Ming Li, Yueqiang Cheng and
Li Xiong, Yang Cao

Keywords Paper

0

0

0

0

12:32

18/07/2021

Learning Self-Modulating Attention in Continuous Time Space with Applications to Sequential Recommendation

Chao Chen, Haoyu Geng, Nianzu Yang and
Junchi Yan, Daiyue Xue, Jianping Yu, Xiaokang Yang

Keywords Paper

Algorithms, Ranking and Preference Learning

0

0

0

0

5:11

03/05/2021

Neural Topic Model via Optimal Transport

He Zhao, Dinh Phung, Viet Huynh and
Trung Le, Wray Buntine

Keywords Paper

optimal transport, document analysis, topic modelling

0

0

0

1

9:29

02/02/2021

Learning the Parameters of Bayesian Networks from Uncertain Data

Segev Wasserkrug, Radu Marinescu, Sergey Zeltyn and
Evgeny Shindin, Yishai A Feldman

Keywords Paper

0

0

0

0

19:29

03/05/2021

Exploring the Uncertainty Properties of Neural Networks’ Implicit Priors in the Infinite-Width Limit

Ben Adlam, Jaehoon Lee, Lechao Xiao and
Jeffrey Pennington, Jasper Snoek

Keywords Paper

Deep Learning, Bayesian Neural Networks, Neural Network Gaussian Process, Infinite-Width Limit, Uncertainty, Gaussian Process

0

0

0

0

4:34

02/02/2021

Uncertainty-Aware Policy Optimization: A Robust, Adaptive Trust Region Approach

James Queeney, Ioannis Ch. Paschalidis, Christos G. Cassandras

Keywords Paper

0

0

0

0

16:52

16/11/2020

Multi-document Summarization with Maximal Marginal Relevance-guided Reinforcement Learning

Yuning Mao, Yanru Qu, Yiqing Xie and
Xiang Ren, Jiawei Han

Keywords Paper

single-document summarization, single-document sds, multi-document summarization, multi-document mds

0

0

0

0

10:58

06/12/2021

Derivative-Free Policy Optimization for Linear Risk-Sensitive and Robust Control Design: Implicit Regularization and Sample Complexity

Kaiqing Zhang, Xiangyuan Zhang, Bin Hu, Tamer Basar

Keywords Paper

theory, optimization, reinforcement learning and planning

0

0

0

0

15:57

03/05/2021

Activation-level uncertainty in deep neural networks

Pablo Morales-Alvarez, Daniel Hernández-Lobato, Rafael Molina, José Miguel Hernández Lobato

Keywords Paper

Gaussian Processes, Bayesian Neural Networks, Deep Gaussian Processes, Uncertainty estimation

0

0

0

0

6:53

26/04/2020

Identifying through Flows for Recovering Latent Representations

Shen Li, Bryan Hooi, Gim Hee Lee

Keywords Paper

Representation learning, identifiable generative models, nonlinear-ICA

0

0

0

0

5:11

18/07/2021

Is Pessimism Provably Efficient for Offline RL?

Ying Jin, Zhuoran Yang, Zhaoran Wang

Keywords Paper

Reinforcement Learning and Planning, Others

0

0

0

0

5:17

13/04/2021

Predictive complexity priors

Eric Nalisnick, Jonathan Gordon, Jose Miguel Hernandez-Lobato

Keywords Paper

0

0

0

0

3:05

19/10/2020

Distant supervision in BERT-based adhoc document retrieval

Koustav Rudra, Avishek Anand

Keywords Paper

distant supervision, adhoc retrieval, document ranking

0

0

0

0

6:49

26/08/2020

Variational Autoencoders for Sparse and Overdispersed Discrete Data

He Zhao, Piyush Rai, Lan Du and
Wray Buntine, Dinh Phung, Mingyuan Zhou

Keywords Paper

0

0

0

0

14:28

06/12/2021

Towards Sample-Optimal Compressive Phase Retrieval with Sparse and Generative Priors

Zhaoqiang Liu, Subhroshekhar Ghosh, Jonathan Scarlett

Keywords Paper

theory, optimization, generative model

0

0

0

0

10:41

26/04/2020

Understanding the Limitations of Conditional Generative Models

Ethan Fetaya, Joern-Henrik Jacobsen, Will Grathwohl, Richard Zemel

Keywords Paper

Conditional Generative Models, Generative Classifiers, Robustness, Adversarial Examples

0

0

0

0

4:46

06/12/2021

Meta-Learning Reliable Priors in the Function Space

Jonas Rothfuss, Dominique Heyn, jinfan Chen, Andreas Krause

Keywords Paper

optimization, meta learning, continual learning

1

1

0

0

15:13

26/04/2020

Adversarially Robust Representations with Smooth Encoders

Taylan Cemgil, Sumedh Ghaisas, Krishnamurthy (Dj) Dvijotham, Pushmeet Kohli

Keywords Paper

Adversarial Learning, Robust Representations, Variational AutoEncoder, Wasserstein Distance, Variational Inference

0

0

0

0

5:16

06/12/2020

Depth Uncertainty in Neural Networks

Javier Antoran, James Allingham, Jose Miguel Hernández-Lobato

Keywords Paper

0

0

0

0

3:10

12/07/2020

Understanding the Curse of Horizon in Off-Policy Evaluation via Conditional Importance Sampling

Yao Liu, Pierre-Luc Bacon, Emma Brunskill

Keywords Paper

Reinforcement Learning - Theory

0

0

0

0

14:45

06/12/2020

Bayesian Pseudocoresets

Dionysis Manousakas, Zuheng Xu, Cecilia Mascolo, Trevor Campbell

Keywords Paper

0

0

0

0

3:19