Cross-Media Keyphrase Prediction: A Unified Framework with Multi-Modality Multi-Head Attention and Image Wordings

16/11/2020

Cross-Media Keyphrase Prediction: A Unified Framework with Multi-Modality Multi-Head Attention and Image Wordings

Yue Wang, Jing Li, Michael Lyu, Irwin King

Keywords: keyphrase prediction, text modeling, classification, generation

Abstract Paper Similar Papers

Abstract: Social media produces large amounts of contents every day. To help users quickly capture what they need, keyphrase prediction is receiving a growing attention. Nevertheless, most prior efforts focus on text modeling, largely ignoring the rich features embedded in the matching images. In this work, we explore the joint effects of texts and images in predicting the keyphrases for a multimedia post. To better align social media style texts and images, we propose: (1) a novel Multi-Modality MultiHead Attention (M3H-Att) to capture the intricate cross-media interactions; (2) image wordings, in forms of optical characters and image attributes, to bridge the two modalities. Moreover, we design a unified framework to leverage the outputs of keyphrase classification and generation and couple their advantages. Extensive experiments on a large-scale dataset newly collected from Twitter show that our model significantly outperforms the previous state of the art based on traditional attention mechanisms. Further analyses show that our multi-head attention is able to attend information from various aspects and boost classification or generation in diverse scenarios.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at EMNLP 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

14/06/2020

Multimodal Categorization of Crisis Events in Social Media

Mahdi Abavisani, Liwei Wu, Shengli Hu and
Joel Tetreault, Alejandro Jaimes

Keywords Paper

multimodal learning, multimodal categorization, cross-attention, stochastic shared embedding, event detection, social media, image-text fusion, ai for social goods, language and vision, emergency response

0

0

0

0

1:01

04/07/2020

Improving Multimodal Named Entity Recognition via Entity Span Detection with Unified Multimodal Transformer

Jianfei Yu, Jing Jiang, Li Yang, Rui Xia

Keywords Paper

Multimodal Recognition, Multimodal MNER, Multimodal, MNER

1

0

0

0

12:52

23/08/2020

SimClusters: Community-based representations for heterogeneous recommendations at twitter

Venu Satuluri, Yao Wu, Xun Zheng and
Yilei Qian, Brian Wichers, Qieyun Dai, Gui Ming Tang, Jerry Jiang, Jimmy Lin

Keywords Paper

community detection, personalization, recommender systems

0

0

0

0

4:55

04/07/2020

Relational Graph Attention Network for Aspect-based Sentiment Analysis

Kai Wang, Weizhou Shen, Yunyi Yang and
Xiaojun Quan, Rui Wang

Keywords Paper

Aspect-based Analysis, encoding information, sentiment prediction, Relational Network

0

0

0

0

6:56

19/10/2020

Event-driven network for cross-modal retrieval

Zhixiong Zeng, Nan Xu, Wenji Mao

Keywords Paper

cross-modal retrieval, event embedding, text representation

0

0

0

0

5:59

25/07/2020

Social media user geolocation via hybrid attention

Cheng Zheng, Jyun-Yu Jiang, Yichao Zhou and
Sean D. Young, Wei Wang

Keywords Paper

attention mechanism, interpretability, graph attention, social media user geolocation, hierarchical structure

0

0

0

0

7:20

16/11/2020

Online Conversation Disentanglement with Pointer Networks

Tao Yu, Shafiq Joty

Keywords Paper

conversation disentanglement, generalization, time-consuming engineering, disentanglement

0

0

0

0

11:26

14/06/2020

Fine-Grained Video-Text Retrieval With Hierarchical Graph Reasoning

Shizhe Chen, Yida Zhao, Qin Jin, Qi Wu

Keywords Paper

video-text retrieval, cross-modal matching, graph neural network

0

0

0

0

1:01

02/02/2021

Twitter Event Summarization by Exploiting Semantic Terms and Graph Network

Quanzhi Li, Qiong Zhang

Keywords Paper

0

0

0

0

15:58

25/07/2020

Transfer learning via contextual invariants for one-to-many cross-domain recommendation

Adit Krishnan, Mahashweta Das, Mangesh Bendre and
Hao Yang, Hari Sundaram

Keywords Paper

data sparsity, transfer learning, cross-domain recommendation, contextual invariants, neural layer adaptation

0

0

0

0

19:24

16/11/2020

HENIN: Learning Heterogeneous Neural Interaction Networks for Explainable Cyberbullying Detection on Social Media

Hsin-Yu Chen, Cheng-Te Li

Keywords Paper

computational cyberbullying, text sessions, model explainability, explainable detection

0

0

0

0

10:23

19/08/2021

Layer-Assisted Neural Topic Modeling over Document Networks

Yiming Wang, Ximing Li, Jihong Ouyang

Keywords Paper

Machine Learning, Learning Graphical Models, Bayesian Networks, Graphical Models

0

0

0

0

12:18

06/12/2021

Distilling Meta Knowledge on Heterogeneous Graph for Illicit Drug Trafficker Detection on Social Media

Yiyue Qian, Yiming Zhang, Yanfang (Fa Ye, Chuxu Zhang

Keywords Paper

deep learning, optimization, graph learning, meta learning, representation learning, few shot learning

0

0

0

0

14:10

07/06/2021

Coordinated Behavior on Social Media in 2019 UK General Election

Leonardo Nizzoli, Serena Tardelli, Marco Avvenuti and
Stefano Cresci, Maurizio Tesconi

Keywords Paper

Qualitative and quantitative studies of social media, Social network analysis, communities identification, expertise and authority discovery, Organizational and group behavior mediated by social media, interpersonal communication mediated by social media

0

0

0

0

7:46

19/08/2021

Exploring Periodicity and Interactivity in Multi-Interest Framework for Sequential Recommendation

Gaode Chen, Xinghua Zhang, Yanyan Zhao and
Cong Xue, Ji Xiang

Keywords Paper

Data Mining, Recommender Systems, Big Data, Large-Scale Systems, Recommender Systems

0

0

0

0

15:20

07/06/2020

Variation across Scales: Measurement Fidelity under Twitter Data Sampling

Siqi Wu, Marian-Andrei Rizoiu, Lexing Xie

Keywords Paper

attention, bias, cascades, changes, collection, graphs, influences, measures, networks, rates, structure, terms, tweets, twitter

0

0

0

0

9:59

06/12/2021

Align before Fuse: Vision and Language Representation Learning with Momentum Distillation

Junnan Li, Ramprasaath Selvaraju, Akhilesh Gotmare and
Shafiq Joty, Caiming Xiong, Steven Chu Hong Hoi

Keywords Paper

transformers, vision, representation learning

0

0

0

0

9:40

19/10/2020

Relevance ranking for real-time tweet search

Yan Xia, Yu Sun, Tian Wang and
Juan Caicedo Carvajal, Jinliang Fan, Bhargav Mangipudi, Lisa Huang, Yatharth Saraf

Keywords Paper

tweet search, social network, large-scale ml system

0

0

0

0

9:18

04/11/2020

FlightTracker: Consistency across Read-Optimized Online Stores at Facebook

Xiao Shi, Scott Pruett, Kevin Doherty and
Jinyu Han, Dmitri Petrov, Jim Carrig, John Hugg, Nathan Bronson

Keywords Paper

0

0

0

0

17:57

16/11/2020

KGPT: Knowledge-Grounded Pre-Training for Data-to-Text Generation

Wenhu Chen, Yu Su, Xifeng Yan, William Yang Wang

Keywords Paper

data-to-text generation, data-to-text tasks, fully-supervised setting, pre-training learning

0

0

0

0

11:10

02/02/2021

Simple is not Easy: A Simple Strong Baseline for TextVQA and TextCaps

Qi Zhu, Chenyu Gao, Peng Wang, Qi Wu

Keywords Paper

0

0

0

0

15:58

07/06/2020

Pie Chart or Pizza: Identifying Chart Types and their Virality on Twitter

Pavlos Vougiouklis, Leslie Carr, Elena Simperl

Keywords Paper

classification, graphs, images, images shared, networks, predictions, shared, twitter, types

0

0

0

0

12:38

07/06/2020

Top Comment or Flop Comment? Predicting and Explaining User Engagement in Online News Discussions

Julian Risch, Ralf Krestel

Keywords Paper

articles, classifiers, discussions, engagement, influences, networks, news, news articles, predictions, texts, traditional, words

0

0

0

0

10:25

07/06/2021

CrisisBench: Benchmarking Crisis-related Social Media Datasets for Humanitarian Information Processing

Firoj Alam, Hassan Sajjad, Muhammad Imran, Ferda Ofli

Keywords Paper

Qualitative and quantitative studies of social media, Text categorization, topic recognition, demographic/gender/age identification, Measuring predictability of real world phenomena based on social media, e.g., spanning politics, finance, and health

0

0

0

0

3:06

07/06/2020

An Experimental Study of Structural Diversity in Social Networks

Jessica Su, Krishna Kamath, Aneesh Sharma and
Johan Ugander, Sharad Goel

Keywords Paper

cases, causal, changes, common, engagement, groups, large_scale, networks, rates, relationships, retention rates, twitter

0

0

0

0

8:44

07/06/2021

HumAID: Human-Annotated Disaster Incidents Data from Twitter with Deep Learning Benchmarks

Firoj Alam, Umair Qazi, Muhammad Imran, Ferda Ofli

Keywords Paper

Qualitative and quantitative studies of social media, Text categorization, topic recognition, demographic/gender/age identification, Measuring predictability of real world phenomena based on social media, e.g., spanning politics, finance, and health

0

0

0

0

3:11

14/06/2020

Webly Supervised Knowledge Embedding Model for Visual Reasoning

Wenbo Zheng, Lan Yan, Chao Gou, Fei-Yue Wang

Keywords Paper

visual reasoning, webly supervised learning

0

0

0

0

1:01

04/07/2020

Graph Neural News Recommendation with Unsupervised Preference Disentanglement

Linmei Hu, Siyong Xu, Chen Li and
Cheng Yang, Chuan Shi, Nan Duan, Xing Xie, Ming Zhou

Keywords Paper

Graph Recommendation, Unsupervised Disentanglement, personalized recommendation, recommendation

0

0

0

0

10:55

04/07/2020

MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning

Jie Lei, Liwei Wang, Yelong Shen and
Dong Yu, Tamara Berg, Mohit Bansal

Keywords Paper

Coherent Captioning, Generating descriptions, captioning tasks, coherent generation

0

0

0

0

10:51

23/08/2020

FreeDOM: A transferable neural architecture for structured information extraction on web documents

Bill Yuchen Lin, Ying Sheng, Nguyen Vo, Sandeep Tata

Keywords Paper

structured data extraction, web information extraction

0

0

0

0

17:39

04/07/2020

Document-Level Event Role Filler Extraction using Multi-Granularity Contextualized Encoding

Xinya Du, Claire Cardie

Keywords Paper

Document-Level Extraction, event extraction, extraction decisions, Multi-Granularity Encoding

0

0

0

0

10:27

14/06/2020

Cross-Modal Cross-Domain Moment Alignment Network for Person Search

Ya Jing, Wei Wang, Liang Wang, Tieniu Tan

Keywords Paper

cross-domain adaptation, text-based person search, moment alignment network, cross-modal retrieval, unsupervised learning

0

0

0

0

1:01

19/08/2021

Dig into Multi-modal Cues for Video Retrieval with Hierarchical Alignment

Wenzhe Wang, Mengdan Zhang, Runnan Chen and
Guanyu Cai, Penghao Zhou, Pai Peng, Xiaowei Guo, Jian Wu, Xing Sun

Keywords Paper

Computer Vision, Language and Vision, Deep Learning

0

0

0

0

9:07

14/06/2020

Iterative Answer Prediction With Pointer-Augmented Multimodal Transformers for TextVQA

Ronghang Hu, Amanpreet Singh, Trevor Darrell, Marcus Rohrbach

Keywords Paper

textvqa, visual question answering, vqa, vision and language, st-vqa, ocr-vqa, transformer, pointer network, ocr

0

0

0

0

4:56

16/11/2020

Named Entity Recognition for Social Media Texts with Semantic Augmentation

Yuyang Nie, Yuanhe Tian, Xiang Wan and
Yan Song, Bo Dai

Keywords Paper

named recognition, data problems, semantic augmentation, pre-trained embeddings

0

0

0

0

6:20

16/11/2020

Supertagging Combinatory Categorial Grammar with Attentive Graph Convolutional Networks

Yuanhe Tian, Yan Song, Fei Xia

Keywords Paper

supertagging, combinatory parsing, neural supertagging, parsing

0

0

0

0

6:53

07/06/2020

MimicProp: Learning to Incorporate Lexicon Knowledge into Distributed Word Representation for Social Media Analysis

Muheng Yan, Yu-Ru Lin, Rebecca Hwa and
Ali Mert Ertugrul, Meiqi Guo, Wen-Ting Chung

Keywords Paper

classification, embeddings, impact, learning, performance, representations, terms, texts, word embeddings, words

0

0

0

0

10:25

23/08/2020

TIMME: Twitter ideology-detection via multi-task multi-relational embedding

Zhiping Xiao, Weiping Song, Haoyan Xu and
Zhicheng Ren, Yizhou Sun

Keywords Paper

graph convolutional networks, social network analysis, ideology detection, heterogeneous information network, multi-task learning

0

0

0

0

17:22

07/06/2021

How-to Present News on Social Media: A Causal Analysis of Editing News Headlines for Boosting User Engagement

Kunwoo Park, Haewoon Kwak, Jisun An, Sanjay Chawla

Keywords Paper

Analysis of the relationship between social media and mainstream media, Credibility of online content, Text categorization, topic recognition, demographic/gender/age identification, Engagement, motivations, incentives, and gamification.

0

0

0

0

7:06

02/02/2021

Unsupervised Summarization for Chat Logs with Topic-Oriented Ranking and Context-Aware Auto-Encoders

Yicheng Zou, Jun Lin, Lujun Zhao and
Yangyang Kang, Zhuoren Jiang, Changlong Sun, Qi Zhang, Xuanjing Huang, Xiaozhong Liu

Keywords Paper

0

0

0

0

14:21