Understanding Deflation Process in Over-parametrized Tensor Decomposition

06/12/2021

Understanding Deflation Process in Over-parametrized Tensor Decomposition

Rong Ge, Yunwei Ren, Xiang Wang, Mo Zhou

Keywords:

Abstract Paper Similar Papers

Abstract: In this paper we study the training dynamics for gradient flow on over-parametrized tensor decomposition problems. Empirically, such training process often first fits larger components and then discovers smaller components, which is similar to a tensor deflation process that is commonly used in tensor decomposition algorithms. We prove that for orthogonally decomposable tensor, a slightly modified version of gradient flow would follow a tensor deflation process and recover all the tensor components. Our proof suggests that for orthogonal tensors, gradient flow dynamics works similarly as greedy low-rank learning in the matrix setting, which is a first step towards understanding the implicit regularization effect of over-parametrized models for low-rank tensors.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at NeurIPS 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

06/12/2020

Beyond Lazy Training for Over-parameterized Tensor Decomposition

Xiang Wang, Chenwei Wu, Jason Lee and
Tengyu Ma, Rong Ge

Keywords Paper

0

0

0

0

3:16

18/07/2021

Implicit Regularization in Tensor Factorization

Noam Razin, Asaf Maman, Nadav Cohen

Keywords Paper

Theory, Deep learning Theory

0

0

0

0

5:11

02/02/2021

Fine-grained Generalization Analysis of Vector-Valued Learning

Liang Wu, Antoine Ledent, Yunwen Lei, Marius Kloft

Keywords Paper

0

0

0

0

13:54

03/05/2021

Initialization and Regularization of Factorized Neural Layers

Misha Khodak, Neil Tenenholtz, Lester Mackey, Nicolo Fusi

Keywords Paper

matrix factorization, knowledge distillation, multi-head attention, model compression

0

0

0

0

4:25

03/05/2021

A unifying view on implicit bias in training linear neural networks

Chulhee (Charlie) Yun, Shankar Krishnan, Hossein Mobahi

Keywords Paper

convergence, implicit bias, gradient flow, implicit regularization, gradient descent

0

0

0

0

5:24

06/12/2020

Dual-Free Stochastic Decentralized Optimization with Variance Reduction

Hadrien Hendrikx, Francis Bach, Laurent Massoulié

Keywords Paper

0

0

0

0

3:28

13/04/2021

Spectral tensor train parameterization of deep learning layers

Anton Obukhov, Maxim Rakhuba, Alexander Liniger and
Zhiwu Huang, Stamatios Georgoulis, Dengxin Dai, Luc Van Gool

Keywords Paper

0

0

0

0

3:09

03/05/2021

Differentiable Segmentation of Sequences

Erik Scharwächter, Jonathan Lennartz, Emmanuel Müller

Keywords Paper

warping functions, concept drift, change point detection, segmented models, segmentation, gradient descent

0

1

0

0

5:10

18/07/2021

Self Normalizing Flows

T. Anderson Keller, Jorn Peters, Priyank Jaini and
Emiel Hoogeboom, Patrick Forré, Max Welling

Keywords Paper

Deep Learning, Generative Models

0

1

1

0

4:24

06/12/2021

Lower and Upper Bounds on the Pseudo-Dimension of Tensor Network Models

Behnoush Khavari, Guillaume Rabusseau

Keywords Paper

theory, machine learning

0

0

0

0

14:16

14/06/2020

TESA: Tensor Element Self-Attention via Matricization

Francesca Babiloni, Ioannis Marras, Gregory Slabaugh, Stefanos Zafeiriou

Keywords Paper

nonlocal neural network, self-attention, matricization, computational photograpy, representation learning, inpainting, image classification, instance segmentation, short-exposure-raw to long-exposure-rgb

0

0

0

0

1:00

06/12/2020

Relative gradient optimization of the Jacobian term in unsupervised deep learning

Luigi Gresele, Giancarlo Fissore, Adrián Javaloy and
Bernhard Schölkopf, Aapo Hyvarinen

Keywords Paper

0

0

0

0

3:15

06/12/2020

Lipschitz Bounds and Provably Robust Training by Laplacian Smoothing

Vishaal Krishnan, Abed AlRahman Al Makdah, Fabio Pasqualetti

Keywords Paper

0

0

0

0

3:48

18/07/2021

A Wasserstein Minimax Framework for Mixed Linear Regression

Theo Diamandis, Yonina Eldar, Alireza Fallah and
Farzan Farnia, Asuman Ozdaglar

Keywords Paper

Algorithms, Multimodal Learning

0

0

0

0

25:41

12/07/2020

Obtaining Adjustable Regularization for Free via Iterate Averaging

Jingfeng Wu, Vladimir Braverman, Lin Yang

Keywords Paper

Optimization - General

0

0

0

0

12:07

06/12/2021

DeepReduce: A Sparse-tensor Communication Framework for Federated Deep Learning

Hang Xu, Kelly Kostopoulou, Aritra Dutta and
Xin Li, Alexandros Ntoulas, Panos Kalnis

Keywords Paper

deep learning, federated learning

0

0

0

0

12:15

06/12/2021

An online passive-aggressive algorithm for difference-of-squares classification

Lawrence Saul

Keywords Paper

machine learning, online learning

0

0

0

0

14:00

06/12/2021

Even your Teacher Needs Guidance: Ground-Truth Targets Dampen Regularization Imposed by Self-Distillation

Kenneth Borup, Lars N Andersen

Keywords Paper

theory, deep learning, optimization

0

0

0

0

6:00

26/04/2020

Gradient Descent Maximizes the Margin of Homogeneous Neural Networks

Kaifeng Lyu, Jian Li

Keywords Paper

margin, homogeneous, gradient descent

0

0

0

0

15:02

12/07/2020

Optimistic bounds for multi-output learning

Henry Reeve, Ata Kaban

Keywords Paper

Supervised Learning

0

0

0

0

14:41

12/07/2020

Optimal transport mapping via input convex neural networks

Ashok Vardhan Makkuva, Amirhossein Taghvaei, Sewoong Oh, Jason Lee

Keywords Paper

Probabilistic Inference - Models and Probabilistic Programming

0

0

0

1

15:09

18/07/2021

Low-Rank Sinkhorn Factorization

Meyer Scetbon, Marco Cuturi, Gabriel Peyré

Keywords Paper

Algorithms, Optimal Transport

0

1

1

1

5:22

06/12/2020

Learning Linear Programs from Optimal Decisions

Yingcong Tan, Daria Terekhov, Andrew Delong

Keywords Paper

, Applications -> Privacy, Anonymity, and Security

0

0

0

0

3:21

06/12/2021

Simple Stochastic and Online Gradient Descent Algorithms for Pairwise Learning

ZHENHUAN YANG, Yunwen Lei, Puyu Wang and
Tianbao Yang, Yiming Ying

Keywords Paper

optimization, machine learning, privacy

0

0

0

0

14:40

03/05/2021

Vector-output ReLU Neural Network Problems are Copositive Programs: Convex Analysis of Two Layer Networks and Polynomial-time Algorithms

Arda Sahiner, Tolga Ergen, John M Pauly, Mert Pilanci

Keywords Paper

convolutional neural networks, convex duality, copositive programming, nonnegative PCA, semi-nonnegative matrix factorization, computational complexity, global optima, semi-infinite duality, theory, convex optimization, neural networks

0

0

0

0

6:08

06/12/2021

Faster Neural Network Training with Approximate Tensor Operations

Menachem Adelman, Kfir Levy, Ido Hakimi, Mark Silberstein

Keywords Paper

deep learning, optimization

0

0

0

0

7:48

13/04/2021

Convergence properties of stochastic hypergradients

Riccardo Grazzi, Massimiliano Pontil, Saverio Salzo

Keywords Paper

0

0

0

0

3:11

26/08/2020

Sketching Transformed Matrices with Applications to Natural Language Processing

Yingyu Liang, Zhao Song, Mengdi Wang and
Lin Yang, Xin Yang

Keywords Paper

0

0

0

0

11:17

06/12/2021

Beyond the Signs: Nonparametric Tensor Completion via Sign Series

Chanwoo Lee, Miaoyan Wang

Keywords Paper

theory, machine learning

0

0

0

0

12:05

12/07/2020

dS^2LBI: Exploring Structural Sparsity on Deep Network via Differential Inclusion Paths

Yanwei Fu, Chen Liu, Donghao Li and
Xinwei Sun, Jinshan ZENG, Yuan Yao

Keywords Paper

Deep Learning - Algorithms

0

0

0

1

12:45

13/04/2021

Understanding gradient clipping in incremental gradient methods

Jiang Qian, Yuren Wu, Bojin Zhuang and
Shaojun Wang, Jing Xiao

Keywords Paper

0

0

0

0

3:17

12/07/2020

Extrapolation for Large-batch Training in Deep Learning

Tao LIN, Lingjing Kong, Sebastian Stich, Martin Jaggi

Keywords Paper

Deep Learning - Algorithms

0

0

0

0

13:21

18/07/2021

Streaming Bayesian Deep Tensor Factorization

Shikai Fang, Zheng Wang, Zhimeng Pan and
Ji Liu, Shandian Zhe

Keywords Paper

Probabilistic Methods, Bayesian Methods

0

0

0

0

5:03

02/02/2021

GLISTER: Generalization based Data Subset Selection for Efficient and Robust Learning

Krishnateja Killamsetty, Durga Sivasubramanian, Ganesh Ramakrishnan, Rishabh Iyer

Keywords Paper

0

0

0

0

19:14

06/12/2021

Sampling with Trusthworthy Constraints: A Variational Gradient Framework

Xingchao Liu, Xin Tong, Qiang Liu

Keywords Paper

optimization, machine learning, fairness, interpretability

0

0

0

0

11:21

06/12/2021

Meta-Learning for Relative Density-Ratio Estimation

Atsutoshi Kumagai, Tomoharu Iwata, Yasuhiro Fujiwara

Keywords Paper

deep learning, machine learning, meta learning

0

0

0

0

8:56

26/04/2020

The Implicit Bias of Depth: How Incremental Learning Drives Generalization

Daniel Gissin, Shai Shalev-Shwartz, Amit Daniely

Keywords Paper

gradient flow, gradient descent, implicit regularization, implicit bias, generalization, optimization, quadratic network, matrix sensing

0

0

0

0

4:40

18/07/2021

A Sampling-Based Method for Tensor Ring Decomposition

Osman Asif Malik, Stephen Becker

Keywords Paper

Algorithms, Dimensionality Reduction

0

0

0

0

6:18

12/07/2020

Operation-Aware Soft Channel Pruning using Differentiable Masks

Minsoo Kang, Bohyung Han

Keywords Paper

Applications - Computer Vision

0

0

0

0

14:56

18/07/2021

Leveraging Non-uniformity in First-order Non-convex Optimization

Jincheng Mei, Yue Gao, Bo Dai and
Csaba Szepesvari, Dale Schuurmans

Keywords Paper

Theory, RL, Decisions and Control Theory

0

0

0

0

4:49