Graph regularization for multi-lingual topic models

25/07/2020

Graph regularization for multi-lingual topic models

Arnav Kumar Jain, Gundeep Arora, Rahul Agrawal

Keywords: cross-lingual information retrieval, graph regularization, topic models

Abstract Paper Similar Papers

Abstract: Unsupervised multi-lingual language modeling has gained attraction in the last few years and poly-lingual topic models provide a mechanism to learn aligned document representations. However, training such models require translation-aligned data across languages, which is not always available. Also, in case of short texts like tweets, search queries, etc, the training of topic models continues to be a challenge. In this work, we present a novel strategy of creating a pseudo-parallel dataset followed by training topic models for sponsored search retrieval, that also mitigates the short text challenge. Our data augmentation strategy leverages easily available bipartite click-though graph that allows us to draw similar documents in different languages. The proposed methodology is evaluated on sponsored search system whose performance is measured on correctly matching the user intent, presented via the query, with ads provided by the advertiser. Our experiments substantiate the goodness of the method on EuroParl dataset and live search-engine traffic.

The video of this talk cannot be embedded. You can watch it here:

https://dl.acm.org/doi/10.1145/3397271.3401231#sec-supp

(Link will open in new window)

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at SIGIR 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

16/11/2020

An Imitation Game for Learning Semantic Parsers from User Interaction

Ziyu Yao, Yiqi Tang, Wen-tau Yih and
Huan Sun, Yu Su

Keywords Paper

bootstrapping, fine-tuning parsers, theoretical analysis, text-to-sql problem

0

0

0

0

11:49

16/11/2020

APE: Argument Pair Extraction from Peer Review and Rebuttal via Multi-task Learning

Liying Cheng, Lidong Bing, Qian Yu and
Wei Lu, Luo Si

Keywords Paper

peer review, argument task, sequence task, text task

0

0

0

0

11:05

04/07/2020

From Zero to Hero: Human-In-The-Loop Entity Linking in Low Resource Domains

Jan-Christoph Klie, Richard Eckart de Castilho, Iryna Gurevych

Keywords Paper

Human-In-The-Loop Linking, Entity linking, disambiguating mentions, annotation process

0

0

0

0

12:26

02/02/2021

EQG-RACE: Examination-Type Question Generation

Xin Jia, Wenjie Zhou, Xu Sun, Yunfang Wu

Keywords Paper

0

0

0

0

14:41

08/12/2020

A Graph Representation of Semi-structured Data for Web Question Answering

Xingyao Zhang, Linjun Shou, Jian Pei and
Ming Gong, Lijie Wen, Daxin Jiang

Keywords Paper

0

0

0

0

13:03

05/01/2021

TranstextNet: Transducing Text for Recognizing Unseen Visual Relationships

Gal S. Kenigsfield, Ran El-Yaniv

Keywords Paper

0

0

0

0

5:00

02/02/2021

An Unsupervised Sampling Approach for Image-Sentence Matching Using Document-level Structural Information

Zejun Li, Zhongyu Wei, Zhihao Fan and
Haijun Shan, Xuanjing Huang

Keywords Paper

0

0

0

0

18:39

04/07/2020

Document Modeling with Graph Attention Networks for Multi-grained Machine Reading Comprehension

Bo Zheng, Haoyang Wen, Yaobo Liang and
Nan Duan, Wanxiang Che, Daxin Jiang, Ming Zhou, Ting Liu

Keywords Paper

Document Modeling, Multi-grained Comprehension, machine comprehension, Graph Networks

0

0

0

0

10:51

19/10/2020

Enhance prototypical network with text descriptions for few-shot relation classification

Kaijia Yang, Nantao Zheng, Xinyu Dai and
Liang He, Shujian Huang, Jiajun Chen

Keywords Paper

text description, relation extraction, few shot

0

0

0

0

6:55

19/04/2021

Cross-lingual contextualized topic models with zero-shot learning

Federico Bianchi, Silvia Terragni, Dirk Hovy and
Debora Nozza, Elisabetta Fersini

Keywords Paper

0

0

0

0

6:36

05/01/2021

ChartOCR: Data Extraction From Charts Images via a Deep Hybrid Framework

Junyu Luo, Zekun Li, Jinpeng Wang, Chin-Yew Lin

Keywords Paper

0

0

0

0

4:58

04/07/2020

Enhancing Cross-target Stance Detection with Transferable Semantic-Emotion Knowledge

Bowen Zhang, Min Yang, Xutao Li and
Yunming Ye, Xiaofei Xu, Kuai Dai

Keywords Paper

Cross-target Detection, Stance detection, knowledge transfer, stance classifier

0

0

0

0

11:57

14/09/2020

Inductive Unsupervised Domain Adaptation for Few-Shot Classification via Clustering

Xin Cong, Bowen Yu, Tingwen Liu and
Shiyao Cui, Hengzhu Tang, Bin Wang

Keywords Paper

few-shot classification, domain adaptation, clustering

0

0

0

0

13:29

14/06/2020

Weakly Supervised Visual Semantic Parsing

Alireza Zareian, Svebor Karaman, Shih-Fu Chang

Keywords Paper

scene understanding, scene graph generation, weakly supervised learning, semantic parsing, graph neural networks, visual reasoning

0

0

0

0

5:00

14/06/2020

Webly Supervised Knowledge Embedding Model for Visual Reasoning

Wenbo Zheng, Lan Yan, Chao Gou, Fei-Yue Wang

Keywords Paper

visual reasoning, webly supervised learning

0

0

0

0

1:01

16/11/2020

Q-learning with Language Model for Edit-based Unsupervised Summarization

Ryosuke Kohita, Akifumi Wachi, Yang Zhao, Ryuki Tachibana

Keywords Paper

abstractive textsummarization, unsupervised summarization, unsupervised summarizers, unsupervised methods

0

0

0

0

12:32

02/02/2021

Simple is not Easy: A Simple Strong Baseline for TextVQA and TextCaps

Qi Zhu, Chenyu Gao, Peng Wang, Qi Wu

Keywords Paper

0

0

0

0

15:58

04/07/2020

TransS-Driven Joint Learning Architecture for Implicit Discourse Relation Recognition

Ruifang He, Jian Wang, Fengyu Guo, Yugui Han

Keywords Paper

Implicit Recognition, discourse understanding, TransS-Driven Architecture, multi-level encoder

0

0

0

0

11:42

02/02/2021

Dual-level Collaborative Transformer for Image Captioning

Yunpeng Luo, Jiayi Ji, Xiaoshuai Sun and
Liujuan Cao, Yongjian Wu, Feiyue Huang, Chia-Wen Lin, Rongrong Ji

Keywords Paper

0

0

0

0

14:58

04/07/2020

Cross-modal Language Generation using Pivot Stabilization for Web-scale Language Coverage

Ashish V. Thapliyal, Radu Soricut

Keywords Paper

Cross-modal Generation, Web-scale Coverage, Cross-modal tasks, Pivot Stabilization

0

0

0

0

11:43

14/06/2020

Learning Meta Face Recognition in Unseen Domains

Jianzhu Guo, Xiangyu Zhu, Chenxu Zhao and
Dong Cao, Zhen Lei, Stan Z. Li

Keywords Paper

face recognition, meta learning, domain generalization, metric learning

0

0

0

0

5:01

07/09/2020

BriNet: Towards Bridging the Intra-class and Inter-class Gaps in One-Shot Segmentation

Xianghui Yang, Bairun Wang, Xinchi Zhou and
Kaige Chen, Shuai Yi, Wanli Ouyang, Luping Zhou

Keywords Paper

Few-shot Semantic Segmentation, Few-shot learning, Semantic Segmentation

0

0

0

0

8:26

06/12/2021

Weak-shot Fine-grained Classification via Similarity Transfer

Junjie Chen, Li Niu, Liu Liu, Liqing Zhang

Keywords Paper

machine learning, transfer learning

0

0

0

0

7:01

05/01/2021

ClassMix: Segmentation-Based Data Augmentation for Semi-Supervised Learning

Viktor Olsson, Wilhelm Tranheden, Juliano Pinto, Lennart Svensson

Keywords Paper

0

0

0

0

4:58

16/11/2020

KGPT: Knowledge-Grounded Pre-Training for Data-to-Text Generation

Wenhu Chen, Yu Su, Xifeng Yan, William Yang Wang

Keywords Paper

data-to-text generation, data-to-text tasks, fully-supervised setting, pre-training learning

0

0

0

0

11:10

02/02/2021

GraphMSE: Efficient Meta-path Selection in Semantically Aligned Feature Space for Graph Neural Networks

Yi Li, Yilun Jin, Guojie Song and
Zihao Zhu, Chuan Shi, Yiming Wang

Keywords Paper

0

0

0

0

15:25

19/10/2020

Zero-shot heterogeneous transfer learning from recommender systems to cold-start search retrieval

Tao Wu, Ellie Ka-In Chio, Heng-Tze Cheng and
Yu Du, Steffen Rendle, Dima Kuzmin, Ritesh Agarwal, Li Zhang, John Anderson, Sarvjeet Singh, Tushar Chandra, Ed H. Chi, Wen Li, Ankit Kumar, Xiang Ma, Alex Soares, Nitin Jindal, Pei Cao

Keywords Paper

search, recommender systems, zero-shot learning, transfer learning

0

0

0

0

9:53

05/01/2021

RNNP: A Robust Few-Shot Learning Approach

Pratik Mazumder, Pravendra Singh, Vinay P. Namboodiri

Keywords Paper

0

0

0

0

4:50

02/02/2021

A Graph-based Relevance Matching Model for Ad-hoc Retrieval

Yufeng Zhang, Jinghao Zhang, Zeyu Cui and
Shu Wu, Liang Wang

Keywords Paper

0

0

0

0

13:11

06/12/2021

Environment Generation for Zero-Shot Compositional Reinforcement Learning

Izzeddin Gur, Natasha Jaques, Yingjie Miao and
Jongwook Choi, Manoj Tiwari, Honglak Lee, Aleksandra Faust

Keywords Paper

reinforcement learning and planning, robustness, graph learning

0

0

0

0

8:40

19/04/2021

TrNews: Heterogeneous user-interest transfer learning for news recommendation

Guangneng Hu, Qiang Yang

Keywords Paper

0

0

0

0

11:51

14/09/2020

GRAM-SMOT: Top-N Personalized Bundle Recommendation via Graph Attention Mechanism and Sub-Modular Optimization

Vijaikumar M, Shirish Shevade, M Narasimha Murty

Keywords Paper

personalized bundle recommendation, graph attention mechanism, submodular optimization

0

0

0

0

14:50

16/11/2020

Learn to Cross-lingual Transfer with Meta Graph Learning Across Heterogeneous Languages

Zheng Li, Mukul Kumar, William Headden and
Bing Yin, Ying Wei, Yu Zhang, Qiang Yang

Keywords Paper

cross-lingual transfer, clt task, multilingual, mplm

0

0

0

0

11:49

02/02/2021

Train a One-Million-Way Instance Classifier for Unsupervised Visual Representation Learning

Yu Liu, Lianghua Huang, Pan Pan and
Bin Wang, Yinghui Xu, Rong Jin

Keywords Paper

0

0

0

0

15:15

04/07/2020

Automatic Generation of Citation Texts in Scholarly Papers: A Pilot Study

Xinyu Xing, Xiaosheng Fan, Xiaojun Wan

Keywords Paper

Automatic Texts, citation task, citation generation, automatically texts

0

0

0

0

10:01

19/08/2021

TextGTL: Graph-based Transductive Learning for Semi-supervised Text Classification via Structure-Sensitive Interpolation

Chen Li, Xutan Peng, Hao Peng and
Jianxin Li, Lihong Wang

Keywords Paper

Machine Learning, Semi-Supervised Learning, Mining Graphs, Semi Structured Data, Complex Data

0

0

0

0

13:14

19/10/2020

AutoADR: Automatic model design for ad relevance

Yiren Chen, Yaming Yang, Hong Sun and
Yujing Wang, Yu Xu, Wei Shen, Rong Zhou, Yunhai Tong, Jing Bai, Ruofei Zhang

Keywords Paper

neural architecture search, knowledge distillation, ad relevance

0

0

0

0

9:24

19/10/2020

Intent-driven similarity in e-commerce listings

Gilad Fuchs, Yoni Acriche, Idan Hasson, Pavel Petrov

Keywords Paper

machine learning, e-commerce, sentence similarity

0

0

0

0

9:57

25/07/2020

Learning to transfer graph embeddings for inductive graph based recommendation

Le Wu, Yonghui Yang, Lei Chen and
Defu Lian, Richang Hong, Meng Wang

Keywords Paper

graph neural network, content based recommendation, inductive graph learning

0

0

0

0

15:15

04/07/2020

Jointly Learning to Align and Summarize for Neural Cross-Lingual Summarization

Yue Cao, Hui Liu, Xiaojun Wan

Keywords Paper

Neural Summarization, Cross-lingual summarization, cross-lingual training, pipeline methods

0

0

0

0

9:30