Adaptive fusion techniques for multimodal data

Abstract: Effective fusion of data from multiple modalities, such as video, speech, and text, is challenging due to the heterogeneous nature of multimodal data. In this paper, we propose adaptive fusion techniques that aim to model context from different modalities effectively. Instead of defining a deterministic fusion operation, such as concatenation, for the network, we let the network decide “how” to combine a given set of multimodal features more effectively. We propose two networks: 1) Auto-Fusion, which learns to compress information from different modalities while preserving the context, and 2) GAN-Fusion, which regularizes the learned latent space given context from complementing modalities. A quantitative evaluation on the tasks of multimodal machine translation and emotion recognition suggests that our lightweight, adaptive networks can better model context from other modalities than existing methods, many of which employ massive transformer-based networks.

05/01/2021

Adaptive fusion techniques for multimodal data

Gaurav Sahu, Olga Vechtomova

Comments

Similar Papers

Attentional Feature Fusion

Yimian Dai, Fabian Gieseke, Stefan Oehmcke and Yiquan Wu, Kobus Barnard

Keywords Abstract Paper

Attention Bottlenecks for Multimodal Fusion

Arsha Nagrani, Shan Yang, Anurag Arnab and Aren Jansen, Cordelia Schmid, Chen Sun

Keywords Abstract Paper

machine learning, transformers

BasisVAE: Translation-invariant feature-level clustering with Variational Autoencoders

Kaspar Märtens, Christopher Yau

Keywords Abstract Paper

Regional Attention Networks With Context-Aware Fusion for Group Emotion Recognition

Ahmed Shehab Khan, Zhiyuan Li, Jie Cai, Yan Tong

Keywords Abstract Paper

Deep Mutual Information Maximin for Cross-Modal Clustering

Yiqiao Mao, Xiaoqiang Yan, Qiang Guo, Yangdong Ye

Keywords Abstract Paper

Correlative Channel-Aware Fusion for Multi-View Time Series Classification

Yue Bai, Lichen Wang, Zhiqiang Tao and Sheng Li, Yun Fu

Keywords Abstract Paper

What to Select: Pursuing Consistent Motion Segmentation from Multiple Geometric Models

Yangbangyan Jiang, Qianqian Xu, Ke Ma and Zhiyong Yang, Xiaochun Cao, Qingming Huang

Keywords Abstract Paper

Deep Multimodal Fusion by Channel Exchanging

Yikai Wang, Wenbing Huang, Fuchun Sun and Tingyang Xu, Yu Rong, Junzhou Huang

Keywords Abstract Paper

DNNFusion: Accelerating Deep Neural Networks Execution with Advanced Operator Fusion

Wei Niu, Jiexiong Guan, Yanzhi Wang and Gagan Agrawal, Bin Ren

Keywords Abstract Paper

Compiler Optimization, Operator Fusion, Deep Neural Network, Mobile Devices

Multistage Fusion with Forget Gate for Multimodal Summarization in Open-Domain Videos

Nayu Liu, Xian Sun, Hongfeng Yu and Wenkai Zhang, Guangluan Xu

Keywords Abstract Paper

multimodal summarization, multimodal tasks, multiencoder-decoder frameworks, multistage network

Conjugate Energy-Based Models

Hao Wu, Babak Esmaeili, Michael Wick and Jean-Baptiste Tristan, Jan-Willem van de Meent

Keywords Abstract Paper

Deep Learning, Generative Models

Multi-Scale Fusion Subspace Clustering Using Similarity Constraint

Zhiyuan Dang, Cheng Deng, Xu Yang, Heng Huang

Keywords Abstract Paper

subspace clustering, deep neural network, multi-scale fusion, spectral clustering

CO-Optimal Transport

Vayer Titouan, Ievgen Redko, Rémi Flamary, Nicolas Courty

Keywords Abstract Paper

Time Series Analysis using a Kernel based Multi-Modal Uncertainty Decomposition Framework

Rishabh Singh, Jose Principe

Keywords Abstract Paper

Cross-Domain Grouping and Alignment for Domain Adaptive Semantic Segmentation

Minsu Kim, Sunghun Joung, Seungryong Kim and JungIn Park, Ig-Jae Kim, Kwanghoon Sohn

Keywords Abstract Paper

Multiplicative Interactions and Where to Find Them

Siddhant M. Jayakumar, Wojciech M. Czarnecki, Jacob Menick and Jonathan Schwarz, Jack Rae, Simon Osindero, Yee Whye Teh, Tim Harley, Razvan Pascanu

Keywords Abstract Paper

multiplicative interactions, hypernetworks, attention

Multi-facet universal schema

Rohan Paul, Haw-Shiuan Chang, Andrew McCallum

Keywords Abstract Paper

Topic Modeling via Full Dependence Mixtures

Dan Fisher, Mark Kozdoba, Shie Mannor

Keywords Abstract Paper

Unsupervised and Semi-Supervised Learning

GOCor: Bringing Globally Optimized Correspondence Volumes into Your Neural Network

Prune Truong, Martin Danelljan, Luc V Gool, Radu Timofte

Keywords Abstract Paper

Minimax Dynamics of Optimally Balanced Spiking Networks of Excitatory and Inhibitory Neurons

Qianyi Li, Cengiz Pehlevan

Keywords Abstract Paper

A Bilingual Generative Transformer for Semantic Sentence Embedding

John Wieting, Graham Neubig, Taylor Berg-Kirkpatrick

Keywords Abstract Paper

source separation, semantic encoding, data distributions, unsupervised evaluations

Can You Put it All Together: Evaluating Conversational Agents' Ability to Blend Skills

Eric Michael Smith, Mary Williamson, Kurt Shuster and Jason Weston, Y-Lan Boureau

Keywords Abstract Paper

conversational agent, open-domain agent, model schemes, multi-task training

The Return of Lexical Dependencies: Neural Lexicalized PCFGs

Yimian Dai, Fabian Gieseke, Stefan Oehmcke and
Yiquan Wu, Kobus Barnard

Keywords Paper

Arsha Nagrani, Shan Yang, Anurag Arnab and
Aren Jansen, Cordelia Schmid, Chen Sun

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Yue Bai, Lichen Wang, Zhiqiang Tao and
Sheng Li, Yun Fu

Keywords Paper

Yangbangyan Jiang, Qianqian Xu, Ke Ma and
Zhiyong Yang, Xiaochun Cao, Qingming Huang

Keywords Paper

Yikai Wang, Wenbing Huang, Fuchun Sun and
Tingyang Xu, Yu Rong, Junzhou Huang

Keywords Paper

Wei Niu, Jiexiong Guan, Yanzhi Wang and
Gagan Agrawal, Bin Ren

Keywords Paper

Nayu Liu, Xian Sun, Hongfeng Yu and
Wenkai Zhang, Guangluan Xu

Keywords Paper

Hao Wu, Babak Esmaeili, Michael Wick and
Jean-Baptiste Tristan, Jan-Willem van de Meent

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Minsu Kim, Sunghun Joung, Seungryong Kim and
JungIn Park, Ig-Jae Kim, Kwanghoon Sohn

Keywords Paper

Siddhant M. Jayakumar, Wojciech M. Czarnecki, Jacob Menick and
Jonathan Schwarz, Jack Rae, Simon Osindero, Yee Whye Teh, Tim Harley, Razvan Pascanu

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Eric Michael Smith, Mary Williamson, Kurt Shuster and
Jason Weston, Y-Lan Boureau

Keywords Paper

Keywords Paper

Keywords Paper

Lei Zhang, Jiangtao Nie, Wei Wei and
Yanning Zhang, Shengcai Liao, Ling Shao

Keywords Paper

Huaijin Pi, Huiyu Wang, Yingwei Li and
Zizhang Li, Alan Yuille

Keywords Paper

Keywords Paper

Hao He, Qian Wang, Zhipeng Yu and
Yang Zhao, Jiajun Zhang, Chengqing Zong

Keywords Paper

Kai Sun, Richong Zhang, Samuel Mensah and
Yongyi Mao, Xudong Liu

Keywords Paper

Kai Zhang, Yifan Sun, Rui Wang and
Haichang Li, Xiaohui Hu

Keywords Paper

Keywords Paper

Keywords Paper

Bin Duan, Hao Tang, Wei Wang and
Ziliang Zong, Guowei Yang, Yan Yan

Keywords Paper

Hung Le, Doyen Sahoo, Chenghao Liu and
Nancy Chen, Steven C.H. Hoi

Keywords Paper

William Merrill, Gail Weiss, Yoav Goldberg and
Roy Schwartz, Noah A. Smith, Eran Yahav

Keywords Paper

Keywords Paper