Dynamic Tensor Rematerialization

03/05/2021

Dynamic Tensor Rematerialization

Marisa Kirisame, Steven S. Lyubomirsky, Altan Haan, Jennifer Brennan, Mike He, Jared G Roesch, Tianqi Chen, Zachary Tatlock

Keywords: Runtime Systems, Memory-saving, Rematerialization, Checkpointing

Abstract Paper Similar Papers

Abstract: Checkpointing enables the training of deep learning models under restricted memory budgets by freeing intermediate activations from memory and recomputing them on demand. Current checkpointing techniques statically plan these recomputations offline and assume static computation graphs. We demonstrate that a simple online algorithm can achieve comparable performance by introducing Dynamic Tensor Rematerialization (DTR), a greedy online algorithm for checkpointing that is extensible and general, is parameterized by eviction policy, and supports dynamic models. We prove that DTR can train an $N$-layer linear feedforward network on an $\Omega(\sqrt{N})$ memory budget with only $\mathcal{O}(N)$ tensor operations. DTR closely matches the performance of optimal static checkpointing in simulated experiments. We incorporate a DTR prototype into PyTorch merely by interposing on tensor allocations and operator calls and collecting lightweight metadata on tensors.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ICLR 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

06/12/2020

Dual-Free Stochastic Decentralized Optimization with Variance Reduction

Hadrien Hendrikx, Francis Bach, Laurent Massoulié

Keywords Paper

0

0

0

0

3:28

02/02/2021

Proxy Synthesis: Learning with Synthetic Classes for Deep Metric Learning

Geonmo Gu, Byungsoo Ko, Han-Gyu Kim

Keywords Paper

0

0

0

0

16:33

06/12/2021

Differentiable Optimization of Generalized Nondecomposable Functions using Linear Programs

Zihang Meng, Lopamudra Mukherjee, Yichao Wu and
Vikas Singh, Sathya Narayanan Ravi

Keywords Paper

deep learning, optimization

0

0

0

0

13:21

23/08/2020

AutoShuffleNet: Learning permutation matrices via an exact lipschitz continuous penalty in deep convolutional neural networks

Jiancheng Lyu, Shuai Zhang, Yingyong Qi, Jack Xin

Keywords Paper

shufflenet, permutation, lipschitz continuous penalty, convolutional neural network

0

0

0

0

13:06

12/07/2020

Operation-Aware Soft Channel Pruning using Differentiable Masks

Minsoo Kang, Bohyung Han

Keywords Paper

Applications - Computer Vision

0

0

0

0

14:56

06/12/2021

Meta Learning Backpropagation And Improving It

Louis Kirsch, Jürgen Schmidhuber

Keywords Paper

deep learning, optimization, generative model, meta learning

0

0

0

0

12:39

13/04/2021

A theoretical characterization of semi-supervised learning with self-training for gaussian mixture models

Samet Oymak, Talha Cihad Gulcu

Keywords Paper

1

1

0

0

2:59

22/11/2021

Adaptive End-to-End Budgeted Network Learning via Inverse Scale Space

Zuyuan Zhong, Chen Liu, Yanwei Fu

Keywords Paper

deep learning, network architecture, growing network, budgeted network learning, pruning

0

0

0

0

2:58

06/12/2021

Simple Stochastic and Online Gradient Descent Algorithms for Pairwise Learning

ZHENHUAN YANG, Yunwen Lei, Puyu Wang and
Tianbao Yang, Yiming Ying

Keywords Paper

optimization, machine learning, privacy

0

0

0

0

14:40

06/12/2020

Learning Differentiable Programs with Admissible Neural Heuristics

Ameesh Shah, Eric Zhan, Jennifer Sun and
Abhinav Verma, Yisong Yue, Swarat Chaudhuri

Keywords Paper

Algorithms -> Missing Data; Algorithms -> Uncertainty Estimation; Probabilistic Methods -> Causal Inference; Probabilistic Meth, Probabilistic Methods -> Bayesian Nonparametrics

0

0

0

0

3:28

14/06/2020

MTL-NAS: Task-Agnostic Neural Architecture Search Towards General-Purpose Multi-Task Learning

Yuan Gao, Haoping Bai, Zequn Jie and
Jiayi Ma, Kui Jia, Wei Liu

Keywords Paper

neural architecture search, general-purpose multi-task learning, task-agnostic search space, single-shot gradient-based search algorithm, minimal entropy regularization

0

0

1

0

1:00

06/12/2021

Nonsmooth Implicit Differentiation for Machine-Learning and Optimization

Jérôme Bolte, Tam Le, Edouard Pauwels, Tony Silveti-Falls

Keywords Paper

deep learning, optimization, machine learning

0

0

0

0

12:32

12/07/2020

Multigrid Neural Memory

Tri Huynh, Michael Maire, Matthew Walter

Keywords Paper

Deep Learning - General

0

0

0

0

13:47

06/12/2021

Learned Robust PCA: A Scalable Deep Unfolding Approach for High-Dimensional Outlier Detection

HanQin Cai, Jialin Liu, Wotao Yin

Keywords Paper

deep learning, machine learning

0

0

0

0

8:07

26/04/2020

Gradients as Features for Deep Representation Learning

Fangzhou Mu, Yingyu Liang, Yin Li

Keywords Paper

representation learning, gradient features, deep learning

0

0

0

0

5:07

13/04/2021

ATOL: Measure vectorization for automatic topologically-oriented learning

Martin Royer, Frederic Chazal, Clément Levrard and
Yuhei Umeda, Yuichi Ike

Keywords Paper

0

0

0

0

3:05

06/12/2020

Supermasks in Superposition

Mitchell Wortsman, Vivek Ramanujan, Rosanne Liu and
Ani Kembhavi, Mohammad Rastegari, Jason Yosinski, Ali Farhadi

Keywords Paper

0

0

0

0

3:03

05/01/2021

Weakly Supervised Instance Segmentation by Deep Community Learning

Jaedong Hwang, Seohyun Kim, Jeany Son, Bohyung Han

Keywords Paper

0

0

0

0

4:14

03/05/2021

Contextual Transformation Networks for Online Continual Learning

Quang Pham, Chenghao Liu, Doyen Sahoo, Steven HOI

Keywords Paper

Continual Learning

0

0

0

0

4:48

15/06/2020

BatchCrypt: Efficient Homomorphic Encryption for Cross-Silo Federated Learning

Chengliang Zhang, Suyi Li, Junzhe Xia and
Wei Wang, Feng Yan, Yang Liu

Keywords Paper

0

0

0

0

22:38

13/04/2021

LassoNet: Neural networks with feature sparsity

Ismael Lemhadri, Feng Ruan, Rob Tibshirani

Keywords Paper

0

0

0

0

3:13

06/12/2021

Efficient Training of Retrieval Models using Negative Cache

Erik Lindgren, Sashank Reddi, Ruiqi Guo, Sanjiv Kumar

Keywords Paper

deep learning, machine learning

0

0

0

0

10:41

26/08/2020

Structured Conditional Continuous Normalizing Flows for Efficient Amortized Inference in Graphical Models

Christian Weilbach, Boyan Beronov, Frank Wood, William Harvey

Keywords Paper

0

0

0

0

14:27

08/07/2020

The Complexity of Bounded Context Switching with Dynamic Thread Creation

Pascal Baumann, Rupak Majumdar, Ramanathan Thinniyam Srinivasan, Georg Zetzsche

Keywords Paper

Dynamic thread creation, Bounded context switching, Asynchronous Programs, Safety verification, State reachability, Petri nets, Complexity, Succinctness, Counter Programs

0

0

0

0

24:26

09/07/2020

Implicit Bias of Gradient Descent for Wide Two-layer Neural Networks Trained with the Logistic Loss

Lénaïc Chizat, Francis Bach

Keywords Paper

Neural networks/deep learning, Non-convex optimization

0

0

0

0

14:41

02/02/2021

OpEvo: An Evolutionary Method for Tensor Operator Optimization

Xiaotian Gao, Wei Cui, Lintao Zhang, Mao Yang

Keywords Paper

0

0

0

0

16:02

18/07/2021

Message Passing Adaptive Resonance Theory for Online Active Semi-supervised Learning

Taehyeong Kim, Injune Hwang, Hyundo Lee and
Hyunseo Kim, Won-Seok Choi, Joseph Lim, Byoung-Tak Zhang

Keywords Paper

Algorithms, Active Learning

0

0

0

0

4:53

26/08/2020

Optimizing Millions of Hyperparameters by Implicit Differentiation

Jonathan Lorraine, Paul Vicol, David Duvenaud

Keywords Paper

0

0

0

0

14:04

23/06/2021

Perceus: Garbage Free Reference Counting with Reuse

Alex Reinking, Ningning Xie, Leonardo de Moura, Daan Leijen

Keywords Paper

Reference Counting, Algebraic Effects, Handlers

0

0

0

0

24:39

14/06/2020

LSM: Learning Subspace Minimization for Low-Level Vision

Chengzhou Tang, Lu Yuan, Ping Tan

Keywords Paper

low-level vision, subspace minimization, stereo matching, optical flow, interactive segmentation, video object segmentation, muli-task learning, zero-shot task transfer

0

0

0

0

5:00

18/07/2021

Grey-box Extraction of Natural Language Models

Santiago Zanella-Beguelin, Shruti Tople, Andrew Paverd, Boris Köpf

Keywords Paper

Algorithms, Unsupervised Learning, Probabilistic Methods; Probabilistic Methods, Graphical Models, Social Aspects of Machine Learning, Privacy, Anonymity, and Security

0

0

0

0

5:22

03/05/2021

MetaNorm: Learning to Normalize Few-Shot Batches Across Domains

Yingjun Du, Xiantong Zhen, Ling Shao, Cees G Snoek

Keywords Paper

batch normalization, Meta-learning, few-shot domain generalization

0

0

0

0

5:48

12/07/2020

Training Neural Networks for and by Interpolation

Leonard Berrada, M. Pawan Kumar, Andrew Zisserman

Keywords Paper

Deep Learning - General

0

0

0

0

16:12

06/12/2020

Personalized Federated Learning with Moreau Envelopes

Canh T. Dinh, Nguyen H. Tran, Josh Nguyen

Keywords Paper

0

0

0

0

3:17

06/12/2020

Dynamical mean-field theory for stochastic gradient descent in Gaussian mixture classification

Francesca Mignacco, Florent Krzakala, Pierfrancesco Urbani, Lenka Zdeborová

Keywords Paper

0

0

0

0

3:21

14/09/2020

An algorithmic framework for decentralised matrix factorisation

Erika Duriakova, Weipeng Huang, Elias Tragos and
Aonghus Lawlor, Barry Smyth, James Geraci, Neil Hurley

Keywords Paper

recommender systems, distributed learning, decentralised matrix factorisation, latent factor models, matrix factorisation, communication efficiency, convergence proof

0

0

0

1

13:30

06/12/2021

Remember What You Want to Forget: Algorithms for Machine Unlearning

Ayush Sekhari, Jayadev Acharya, Gautam Kamath, Ananda Theertha Suresh

Keywords Paper

theory, privacy

0

0

0

0

10:50

15/11/2020

DiffStream: Differential Output Testing for Stream Processing Programs

Konstantinos Kallas, Filip Niksic, Caleb Stanford, Rajeev Alur

Keywords Paper

runtime verification, differential testing, stream processing

0

0

0

0

15:50

03/05/2021

Evolving Reinforcement Learning Algorithms

John Co-Reyes, Yingjie Miao, Daiyi Peng and
Esteban Real, Quoc V Le, Sergey Levine, Honglak Lee, Aleksandra Faust

Keywords Paper

reinforcement learning, genetic programming, meta-learning, evolutionary algorithms

0

0

0

0

13:59

06/12/2020

Untangling tradeoffs between recurrence and self-attention in artificial neural networks

Giancarlo Kerg, bhargav104 Kanuparthi, Anirudh Goyal ALIAS PARTH GOYAL and
Kyle Goyette, Yoshua Bengio, Guillaume Lajoie

Keywords Paper

0

0

0

0

3:20