Understanding the role of importance weighting for deep learning

Abstract: The recent paper by Byrd & Lipton (2019), based on empirical observations, raises a major concern on the impact of importance weighting for the over-parameterized deep learning models. They observe that as long as the model can separate the training data, the impact of importance weighting diminishes as the training proceeds. Nevertheless, there lacks a rigorous characterization of this phenomenon. In this paper, we provide formal characterizations and theoretical justifications on the role of importance weighting with respect to the implicit bias of gradient descent and margin-based learning theory. We reveal both the optimization dynamics and generalization performance under deep learning models. Our work not only explains the various novel phenomenons observed for importance weighting in deep learning, but also extends to the studies where the weights are being optimized as part of the model, which applies to a number of topics under active research.

13/04/2021

Understanding the role of importance weighting for deep learning

Da Xu, Yuting Ye, Chuanwei Ruan

Comments

Similar Papers

Bayesian active learning by soft mean objective cost of uncertainty

Guang Zhao, Edward Dougherty, Byung-Jun Yoon and Francis J. Alexander, Xiaoning Qian

Keywords Abstract Paper

Leveraging Recursive Gumbel-Max Trick for Approximate Inference in Combinatorial Spaces

Kirill Struminsky, Artyom Gadetsky, Denis Rakitin and Danil Karpushkin, Dmitry Vetrov

Keywords Abstract Paper

deep learning, optimization

What Neural Networks Memorize and Why: Discovering the Long Tail via Influence Estimation

Vitaly Feldman, Chiyuan Zhang

Keywords Abstract Paper

Imitating Deep Learning Dynamics via Locally Elastic Stochastic Differential Equations

Jiayao Zhang, Hua Wang, Weijie Su

Keywords Abstract Paper

deep learning, optimization

Understanding Instance-based Interpretability of Variational Auto-Encoders

Zhifeng Kong, Kamalika Chaudhuri

Keywords Abstract Paper

deep learning, self-supervised learning, generative model, interpretability

The Difficulty of Passive Learning in Deep Reinforcement Learning

Georg Ostrovski, Pablo Samuel Castro, Will Dabney

Keywords Abstract Paper

optimization, reinforcement learning and planning

Reconciling Modern Deep Learning with Traditional Optimization Analyses: The Intrinsic Learning Rate

Zhiyuan Li, Kaifeng Lyu, Sanjeev Arora

Keywords Abstract Paper

Representer Point Selection via Local Jacobian Expansion for Post-hoc Classifier Explanation of Deep Neural Networks and Ensemble Models

Yi Sui, Ga Wu, Scott Sanner

Keywords Abstract Paper

deep learning, optimization, machine learning, vision

Improving Generalization in Meta-learning via Task Augmentation

Huaxiu Yao, Long-Kai Huang, Linjun Zhang and Ying WEI, Li Tian, James Zou, Junzhou Huang, Zhenhui (Jessie) Li

Keywords Abstract Paper

Algorithms, Multitask, Transfer, and Meta Learning

Large-Scale Cross-Domain Few-Shot Learning

Jiechao Guan, Manli Zhang, Zhiwu Lu

Keywords Abstract Paper

Measuring Generalization with Optimal Transport

Ching-Yao Chuang, Youssef Mroueh, Kristjan Greenewald and Antonio Torralba, Stefanie Jegelka

Keywords Abstract Paper

deep learning, optimal transport

Learning MDPs from Features: Predict-Then-Optimize for Sequential Decision Making by Reinforcement Learning

Kai Wang, Sanket Shah, Haipeng Chen and Andrew Perrault, Finale Doshi-Velez, Milind Tambe

Keywords Abstract Paper

deep learning, optimization, reinforcement learning and planning

Lower Bounds on Cross-Entropy Loss in the Presence of Test-time Adversaries

Arjun Nitin Bhagoji, Daniel Cullina, Vikash Sehwag, Prateek Mittal

Keywords Abstract Paper

Algorithms, Adversarial Examples

Extrapolation for Large-batch Training in Deep Learning

Tao LIN, Lingjing Kong, Sebastian Stich, Martin Jaggi

Keywords Abstract Paper

Deep Learning - Algorithms

Deep Frequency Principle Towards Understanding Why Deeper Learning Is Faster

Zhiqin John Xu, Hanxu Zhou

Keywords Abstract Paper

Learning A Minimax Optimizer: A Pilot Study

Jiayi Shen, Xiaohan Chen, Howard Heaton and Tianlong Chen, Jialin Liu, Wotao Yin, Zhangyang Wang

Keywords Abstract Paper

Minimax Optimization, Learning to Optimize

Bayesian Optimization for Iterative Learning

Vu Nguyen, Sebastian Schulze, Michael A Osborne

Keywords Abstract Paper

Model-based Adversarial Meta-Reinforcement Learning

Zichuan Lin, Garrett Thomas, Guangwen Yang, Tengyu Ma

Keywords Abstract Paper

Deep Extended Hazard Models for Survival Analysis

Qixian Zhong, Jonas Mueller, Jane-Ling Wang

Keywords Abstract Paper

deep learning

Imbalance Robust Softmax for Deep Embedding Learning

Hao Zhu, Yang Yuan, Guosheng Hu and Xiang Wu, Neil Robertson

Keywords Abstract Paper

Two sides of the same coin: White-box and black-box attacks for transfer learning

Yinghua Zhang, Yangqiu Song, Jian Liang and Kun Bai, Qiang Yang

Keywords Abstract Paper

adversarial attacks, neural networks, transfer learning

Guang Zhao, Edward Dougherty, Byung-Jun Yoon and
Francis J. Alexander, Xiaoning Qian

Keywords Paper

Kirill Struminsky, Artyom Gadetsky, Denis Rakitin and
Danil Karpushkin, Dmitry Vetrov

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Huaxiu Yao, Long-Kai Huang, Linjun Zhang and
Ying WEI, Li Tian, James Zou, Junzhou Huang, Zhenhui (Jessie) Li

Keywords Paper

Keywords Paper

Ching-Yao Chuang, Youssef Mroueh, Kristjan Greenewald and
Antonio Torralba, Stefanie Jegelka

Keywords Paper

Kai Wang, Sanket Shah, Haipeng Chen and
Andrew Perrault, Finale Doshi-Velez, Milind Tambe

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Jiayi Shen, Xiaohan Chen, Howard Heaton and
Tianlong Chen, Jialin Liu, Wotao Yin, Zhangyang Wang

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Hao Zhu, Yang Yuan, Guosheng Hu and
Xiang Wu, Neil Robertson

Keywords Paper

Yinghua Zhang, Yangqiu Song, Jian Liang and
Kun Bai, Qiang Yang

Keywords Paper

Mi Luo, Fei Chen, Dapeng Hu and
Yifan Zhang, Jian Liang, Jiashi Feng

Keywords Paper

Keywords Paper

Baifeng Shi, Judy Hoffman, Kate Saenko and
Trevor Darrell, Huijuan Xu

Keywords Paper

Keywords Paper

Alexander Camuto, George Deligiannidis, Murat Erdogdu and
Mert Gurbuzbalaban, Umut Simsekli, Lingjiong Zhu

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Yue Sun, Adhyyan Narang, Ibrahim Gulluk and
Samet Oymak, Maryam Fazel

Keywords Paper

Keywords Paper

Qianli Shen, Yan Li, Haoming Jiang and
Zhaoran Wang, Tuo Zhao

Keywords Paper