Conflict-Averse Gradient Descent for Multi-task learning

Abstract: The goal of multi-task learning is to enable more efficient learning than single task learning by sharing model structures for a diverse set of tasks. A standard multi-task learning objective is to minimize the average loss across all tasks. While straightforward, using this objective often results in much worse final performance for each task than learning them independently. A major challenge in optimizing a multi-task model is the conflicting gradients, where gradients of different task objectives are not well aligned so that following the average gradient direction can be detrimental to specific tasks' performance. Previous work has proposed several heuristics to manipulate the task gradients for mitigating this problem. But most of them lack convergence guarantee and/or could converge to any Pareto-stationary point.In this paper, we introduce Conflict-Averse Gradient descent (CAGrad) which minimizes the average loss function, while leveraging the worst local improvement of individual tasks to regularize the algorithm trajectory. CAGrad balances the objectives automatically and still provably converges to a minimum over the average loss. It includes the regular gradient descent (GD) and the multiple gradient descent algorithm (MGDA) in the multi-objective optimization (MOO) literature as special cases. On a series of challenging multi-task supervised learning and reinforcement learning tasks, CAGrad achieves improved performance over prior state-of-the-art multi-objective gradient manipulation methods.

06/12/2021

deep learning, machine learning, self-supervised learning, generative model, contrastive learning, representation learning

10:52

13/04/2021

Conflict-Averse Gradient Descent for Multi-task learning

Bo Liu, Xingchao Liu, Xiaojie Jin, Peter Stone, Qiang Liu

Comments

Similar Papers

Consistency Regularization for Variational Auto-Encoders

Samarth Sinha, Adji Bousso Dieng

Keywords Abstract Paper

deep learning, machine learning, self-supervised learning, generative model, contrastive learning, representation learning

Online model selection for reinforcement learning with function approximation

Jonathan Lee, Aldo Pacchiano, Vidya Muthukumar and Weihao Kong, Emma Brunskill

Keywords Abstract Paper

Linear Mode Connectivity in Multitask and Continual Learning

Seyed Iman Mirzadeh, Mehrdad Farajtabar, Dilan Gorur and Razvan Pascanu, Hassan Ghasemzadeh

Keywords Abstract Paper

multitask learning, mode connectivity, continual learning, catastrophic forgetting

Efficient Bandit Convex Optimization: Beyond Linear Losses

Arun Sai Suggala, Pradeep Ravikumar, Praneeth Netrapalli

Keywords Abstract Paper

Learning to Forget for Meta-Learning

Sungyong Baik, Seokil Hong, Kyoung Mu Lee

Keywords Abstract Paper

meta learning, few-shot learning, reinforcement learning

Cogradient Descent for Bilinear Optimization

Li'an Zhuo, Baochang Zhang, Linlin Yang and Hanlin Chen, Qixiang Ye, David Doermann, Rongrong Ji, Guodong Guo

Keywords Abstract Paper

bilinear optimization, gradient descent algorithm, convolutional sparse coding, network pruning

Agnostic Learning with Multiple Objectives

Corinna Cortes, Mehryar Mohri, Javier Gonzalvo, Dmitry Storcheus

Keywords Abstract Paper

Hybrid Regret Bounds for Combinatorial Semi-Bandits and Adversarial Linear Bandits

Shinji Ito

Keywords Abstract Paper

bandits

Least Squares Regression with Markovian Data: Fundamental Limits and Algorithms

Dheeraj Nagaraj, Xian Wu, Guy Bresler and Prateek Jain, Praneeth Netrapalli

Keywords Abstract Paper

Improved Penalty Method via Doubly Stochastic Gradients for Bilevel Hyperparameter Optimization

Wanli Shi, Bin Gu

Keywords Abstract Paper

ErrorCompensatedX: error compensation for variance reduced algorithms

Hanlin Tang, Yao Li, Ji Liu, Ming Yan

Keywords Abstract Paper

optimization

CAQL: Continuous Action Q-Learning

Moonkyung Ryu, Yinlam Chow, Ross Anderson and Christian Tjandraatmadja, Craig Boutilier

Keywords Abstract Paper

Reinforcement learning (RL), DQN, Continuous control, Mixed-Integer Programming (MIP)

Distributionally Robust Neural Networks

Shiori Sagawa*, Pang Wei Koh*, Tatsunori B. Hashimoto, Percy Liang

Keywords Abstract Paper

distributionally robust optimization, deep learning, robustness, generalization, regularization

A Distribution-dependent Analysis of Meta Learning

Mikhail Konobeev, Ilja Kuzborskij, Csaba Szepesvari

Keywords Abstract Paper

Theory, Statistical Learning Theory

Extrapolation for Large-batch Training in Deep Learning

Tao LIN, Lingjing Kong, Sebastian Stich, Martin Jaggi

Keywords Abstract Paper

Deep Learning - Algorithms

Beyond Variance Reduction: Understanding the True Impact of Baselines on Policy Optimization

Wes Chung, Valentin Thomas, Marlos C. Machado, Nicolas Le Roux

Keywords Abstract Paper

Reinforcement Learning and Planning

Examining and Combating Spurious Features under Distribution Shift

Chunting Zhou, Xuezhe Ma, Paul Michel, Graham Neubig

Keywords Abstract Paper

Deep Learning, Embedding and Representation learning

Bayesian decision-making under misspecified priors with applications to meta-learning

Max Simchowitz, Christopher Tosh, Akshay Krishnamurthy and Daniel Hsu, Thodoris Lykouris, Miro Dudik, Robert E Schapire

Keywords Abstract Paper

meta learning, bandits

Dissecting Supervised Constrastive Learning

Florian Graf, Christoph Hofer, Marc Niethammer, Roland Kwitt

Keywords Abstract Paper

Theory, Deep learning Theory

Large-Margin Contrastive Learning with Distance Polarization Regularizer

Shuo Chen, Gang Niu, Chen Gong and Jun Li, Jian Yang, Masashi Sugiyama

Keywords Abstract Paper

Algorithms, Unsupervised Learning

Statistically and Computationally Efficient Linear Meta-representation Learning

Keywords Paper

Jonathan Lee, Aldo Pacchiano, Vidya Muthukumar and
Weihao Kong, Emma Brunskill

Keywords Paper

Seyed Iman Mirzadeh, Mehrdad Farajtabar, Dilan Gorur and
Razvan Pascanu, Hassan Ghasemzadeh

Keywords Paper

Keywords Paper

Keywords Paper

Li'an Zhuo, Baochang Zhang, Linlin Yang and
Hanlin Chen, Qixiang Ye, David Doermann, Rongrong Ji, Guodong Guo

Keywords Paper

Keywords Paper

Keywords Paper

Dheeraj Nagaraj, Xian Wu, Guy Bresler and
Prateek Jain, Praneeth Netrapalli

Keywords Paper

Keywords Paper

Keywords Paper

Moonkyung Ryu, Yinlam Chow, Ross Anderson and
Christian Tjandraatmadja, Craig Boutilier

Keywords Paper

Shiori Sagawa, Pang Wei Koh, Tatsunori B. Hashimoto, Percy Liang

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Max Simchowitz, Christopher Tosh, Akshay Krishnamurthy and
Daniel Hsu, Thodoris Lykouris, Miro Dudik, Robert E Schapire

Keywords Paper

Keywords Paper

Shuo Chen, Gang Niu, Chen Gong and
Jun Li, Jian Yang, Masashi Sugiyama

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Tianhe (Kevin) Yu, Saurabh Kumar, Abhishek Gupta and
Sergey Levine, Karol Hausman, Chelsea Finn

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Haoyue Bai, Rui Sun, Lanqing Hong and
Fengwei Zhou, Nanyang Ye, Han-Jia Ye, S.-H. Gary Chan, Zhenguo Li

Keywords Paper

Keywords Paper

Alexander Robey, Luiz Chamon, George J. Pappas and
Hamed Hassani, Alejandro Ribeiro

Keywords Paper

Zifeng Wang, Xi Chen, Rui Wen and
Shao-Lun Huang, Ercan E Kuruoglu, Yefeng Zheng

Keywords Paper

Keywords Paper