Cockpit: A Practical Debugging Tool for the Training of Deep Neural Networks

06/12/2021

Cockpit: A Practical Debugging Tool for the Training of Deep Neural Networks

Frank Schneider, Felix Dangel, Philipp Hennig

Keywords: deep learning, optimization, interpretability

Abstract Paper Similar Papers

Abstract: When engineers train deep learning models, they are very much "flying blind". Commonly used methods for real-time training diagnostics, such as monitoring the train/test loss, are limited. Assessing a network's training process solely through these performance indicators is akin to debugging software without access to internal states through a debugger. To address this, we present Cockpit, a collection of instruments that enable a closer look into the inner workings of a learning machine, and a more informative and meaningful status report for practitioners. It facilitates the identification of learning phases and failure modes, like ill-chosen hyperparameters. These instruments leverage novel higher-order information about the gradient distribution and curvature, which has only recently become efficiently accessible. We believe that such a debugging tool, which we open-source for PyTorch, is a valuable help in troubleshooting the training process. By revealing new insights, it also more generally contributes to explainability and interpretability of deep nets.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at NeurIPS 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

03/05/2021

Interactive Weak Supervision: Learning Useful Heuristics for Data Labeling

Benedikt Boecking, Willie Neiswanger, Eric P Xing, Artur Dubrawski

Keywords Paper

active learning, data programming, data labeling, weak supervision

0

0

0

0

5:10

04/11/2020

Retiarii: A Deep Learning Exploratory-Training Framework

Quanlu Zhang, Zhenhua Han, Fan Yang and
Yuge Zhang, Zhe Liu, Mao Yang, Lidong Zhou

Keywords Paper

0

0

0

0

20:05

14/06/2020

DaST: Data-Free Substitute Training for Adversarial Attacks

Mingyi Zhou, Jing Wu, Yipeng Liu and
Shuaicheng Liu, Ce Zhu

Keywords Paper

adversarial attacks, machine learning, generative adversarial networks, computer vision

0

0

0

0

4:59

02/02/2021

Single View Point Cloud Generation via Unified 3D Prototype

Yu Lin, Yigong Wang, Yi-Fan Li and
Zhuoyi Wang, Yang Gao, Latifur Khan

Keywords Paper

0

0

0

0

15:35

22/09/2020

Keeping dataset biases out of the simulation: A debiased simulator for reinforcement learning based recommender systems

Jin Huang, Harrie Oosterhuis, Maarten Rijke, Herke Hoof

Keywords Paper

Recommender systems, Simulation, Interaction bias, Reinforcement learning

0

0

0

0

2:45

26/04/2020

Meta-Dataset: A Dataset of Datasets for Learning to Learn from Few Examples

Eleni Triantafillou, Tyler Zhu, Vincent Dumoulin and
Pascal Lamblin, Utku Evci, Kelvin Xu, Ross Goroshin, Carles Gelada, Kevin Swersky, Pierre-Antoine Manzagol, Hugo Larochelle

Keywords Paper

few-shot learning, meta-learning, few-shot classification

0

0

0

0

5:05

06/12/2020

Training Stronger Baselines for Learning to Optimize

Tianlong Chen, Weiyi Zhang, Zhou Jingyang and
Shiyu Chang, Sijia Liu, Lisa Amini, Zhangyang Wang

Keywords Paper

0

0

0

0

3:18

02/02/2021

Deeplite NeutrinoTM: A BlackBox Framework for Constrained Deep Learning Model Optimization

Anush Sankaran, Olivier Mastropietro, Ehsan Saboori and
Yasser Idris, Davis Sawyer, MohammadHossein AskariHemmat, Ghouthi Boukli Hacene

Keywords Paper

0

0

0

0

18:29

25/04/2020

"Why is 'Chicago' deceptive?" Towards Building Model-Driven Tutorials for Humans

Vivian Lai, Han Liu, Chenhao Tan

Keywords Paper

explanations, interpretable machine learning, tutorials, deception detection

0

0

0

0

15:14

18/07/2021

Self-Damaging Contrastive Learning

Ziyu Jiang, Tianlong Chen, Bobak Mortazavi, Zhangyang Wang

Keywords Paper

Algorithms, Unsupervised Learning

0

0

0

1

5:10

06/12/2020

Self-Supervised Few-Shot Learning on Point Clouds

Charu Sharma, Manohar Kaul

Keywords Paper

Algorithms -> Missing Data; Applications -> Sustainability; Deep Learning -> Adversarial Networks; Deep Learning -> Efficient T, Applications -> Time Series Analysis

0

0

0

0

3:38

02/02/2021

Exploratory Machine Learning with Unknown Unknowns

Peng Zhao, Yu-Jie Zhang, Zhi-Hua Zhou

Keywords Paper

0

0

0

0

21:39

02/02/2021

Self-Domain Adaptation for Face Anti-Spoofing

Jingjing Wang, Jingyi Zhang, Ying Bian and
Youyi Cai, Chunmao Wang, Shiliang Pu

Keywords Paper

0

0

0

0

14:02

19/10/2020

Feature extraction for large-scale text collections

Luke Gallagher, Antonio Mallia, J. Shane Culpepper and
Torsten Suel, B. Barla Cambazoglu

Keywords Paper

clueweb, feature index, feature extraction, feature repository, lambdamart, ltr, learning to rank, feature importance

0

0

0

0

9:41

14/06/2020

SAL: Sign Agnostic Learning of Shapes From Raw Data

Matan Atzmon, Yaron Lipman

Keywords Paper

deep learning 3d shapes, implicit neural representations, 3d point clouds, 3d reconstruction, shape spaces, raw 3d data

0

0

0

0

5:01

26/04/2020

Strategies for Pre-training Graph Neural Networks

Weihua Hu, Bowen Liu, Joseph Gomes and
Marinka Zitnik, Percy Liang, Vijay Pande, Jure Leskovec

Keywords Paper

Pre-training, Transfer learning, Graph Neural Networks

0

0

0

0

4:56

16/11/2020

Never Stop Learning: The Effectiveness of Fine-Tuning in Robotic Reinforcement Learning

Ryan Julian, Benjamin Swanson, Gaurav Sukhatme and
Sergey Levine, Chelsea Finn, Karol Hausman

Keywords Paper

0

0

0

0

5:47

22/11/2021

On Automatic Data Augmentation for 3D Point Cloud Classification

Wanyue Zhang, Xun Xu, Fayao Liu and
Le Zhang, Chuan Sheng Foo

Keywords Paper

point cloud, automatic data augmentation

0

0

0

0

2:59

05/04/2021

Amazon SageMaker Debugger: A System for Real-Time Insights into Machine Learning Model Training

Nathalie Rauschmayr, Vikas Kumar, Rahul Huilgol and
Andrea Olgiati, Satadal Bhattacharjee, Nihal Harish, Vandana Kannan, Amol Lele, Anirudh Acharya, Jared Nielsen, Lakshmi Ramakrishnan, Ishan Bhatt, Kohen Chia, Neelesh Dodda, Zhihan Li, Jiacheng Gu, Miyoung Choi, Balajee Nagarajan Nagarajan, Jeffrey Geevarghese, Denis Davydenko, Sifei Li, Lu Huang, Edward Kim, Tyler Hill, Krishnaram Kenthapadi

Keywords Paper

0

0

0

0

19:27

05/04/2021

Amazon SageMaker Debugger: A System for Real-Time Insights into Machine Learning Model Training

Nathalie Rauschmayr, Vikas Kumar, Rahul Huilgol and
Andrea Olgiati, Satadal Bhattacharjee, Nihal Harish, Vandana Kannan, Amol Lele, Anirudh Acharya, Jared Nielsen, Lakshmi Ramakrishnan, Ishan Bhatt, Kohen Chia, Neelesh Dodda, Zhihan Li, Jiacheng Gu, Miyoung Choi, Balajee Nagarajan Nagarajan, Jeffrey Geevarghese, Denis Davydenko, Sifei Li, Lu Huang, Edward Kim, Tyler Hill, Krishnaram Kenthapadi

Keywords Paper

0

0

0

0

3:47

06/12/2020

Self-Distillation Amplifies Regularization in Hilbert Space

Hossein Mobahi, Mehrdad Farajtabar, Peter Bartlett

Keywords Paper

0

0

0

0

3:18

06/12/2021

No Fear of Heterogeneity: Classifier Calibration for Federated Learning with Non-IID Data

Mi Luo, Fei Chen, Dapeng Hu and
Yifan Zhang, Jian Liang, Jiashi Feng

Keywords Paper

optimization, machine learning, federated learning

0

0

0

0

3:27

18/07/2021

Training Data Subset Selection for Regression with Controlled Generalization Error

Durga S, Rishabh Iyer, Ganesh Ramakrishnan, Abir De

Keywords Paper

, Algorithms, Online Learning, Algorithms, Supervised Learning

0

0

0

0

4:15

06/12/2021

Intrinsic Dimension, Persistent Homology and Generalization in Neural Networks

Tolga Birdal, Aaron Lou, Leonidas Guibas, Umut Simsekli

Keywords Paper

theory, deep learning, optimization

0

0

0

0

14:38

06/12/2020

Probabilistic Active Meta-Learning

Jean Kaddour, Steindor Saemundsson, Marc Deisenroth

Keywords Paper

0

0

0

0

3:17

16/11/2020

An Imitation Game for Learning Semantic Parsers from User Interaction

Ziyu Yao, Yiqi Tang, Wen-tau Yih and
Huan Sun, Yu Su

Keywords Paper

bootstrapping, fine-tuning parsers, theoretical analysis, text-to-sql problem

0

0

0

0

11:49

14/06/2020

From Image Collections to Point Clouds With Self-Supervised Shape and Pose Networks

K L Navaneet, Ansu Mathew, Shashank Kashyap and
Wei-Chih Hung, Varun Jampani, R. Venkatesh Babu

Keywords Paper

3d reconstruction, single image reconstruction, self supervised, point clouds, unsupervised, 2d to 3d, image collections

0

0

0

0

1:01

19/08/2021

Cross-Domain Few-Shot Classification via Adversarial Task Augmentation

Haoqing Wang, Zhi-Hong Deng

Keywords Paper

Computer Vision, Recognition, Adversarial Machine Learning, Deep Learning

0

0

0

0

10:39

12/07/2020

More Data Can Expand The Generalization Gap Between Adversarially Robust and Standard Models

Lin Chen, Yifei Min, Mingrui Zhang, Amin Karbasi

Keywords Paper

Adversarial Examples

0

0

0

0

12:01

15/06/2020

Daydream: Accurately Estimating the Efficacy of Optimizations for DNN Training

Hongyu Zhu, Amar Phanishayee, Gennady Pekhimenko

Keywords Paper

0

0

0

0

25:03

18/07/2021

Selecting Data Augmentation for Simulating Interventions

Max Ilse, Jakub Tomczak, Patrick Forré

Keywords Paper

Algorithms, Supervised Learning

0

0

0

0

4:14

06/12/2020

What Neural Networks Memorize and Why: Discovering the Long Tail via Influence Estimation

Vitaly Feldman, Chiyuan Zhang

Keywords Paper

0

0

0

0

3:22

14/09/2020

Model Bridging: Connection between Simulation Model and Neural Network

Keiichi Kisamori, Keisuke Yamazaki, Yuto Komori, Hiroshi Tokieda

Keywords Paper

interpretability, simulation model, kernel mean embedding, data assimilation

0

0

0

0

14:20

06/12/2020

VIME: Extending the Success of Self- and Semi-supervised Learning to Tabular Domain

Jinsung Yoon, Yao Zhang, James Jordon, Mihaela van der Schaar

Keywords Paper

0

0

0

0

3:25

02/02/2021

Self-Supervised Self-Supervision by Combining Deep Learning and Probabilistic Logic

Hunter Lang, Hoifung Poon

Keywords Paper

0

0

0

0

18:09

02/02/2021

I3DOL: Incremental 3D Object Learning without Catastrophic Forgetting

Jiahua Dong, Yang Cong, Gan Sun and
Bingtao Ma, Lichen Wang

Keywords Paper

0

0

0

0

16:45

16/11/2020

Learning Dexterous Manipulation from Suboptimal Experts

Rae Jeong, Jost Tobias Springenberg, Jackie Kay and
Dan Zheng, Alexandre Galashov, Nicolas Heess, Francesco Nori

Keywords Paper

0

0

0

0

5:03

12/07/2020

Graph-based, Self-Supervised Program Repair from Diagnostic Feedback

Michihiro Yasunaga, Percy Liang

Keywords Paper

Applications - Language, Speech and Dialog

0

0

0

1

14:39

26/04/2020

The Ingredients of Real World Robotic Reinforcement Learning

Henry Zhu, Justin Yu, Abhishek Gupta and
Dhruv Shah, Kristian Hartikainen, Avi Singh, Vikash Kumar, Sergey Levine

Keywords Paper

Reinforcement Learning, Robotics

0

0

0

0

4:32

03/05/2021

Model-Based Visual Planning with Self-Supervised Functional Distances

Stephen Tian, Suraj Nair, Frederik Ebert and
Sudeep Dasari, Ben Eysenbach, Chelsea Finn, Sergey Levine

Keywords Paper

reinforcement learning, distance learning, model learning, robotics, planning

0

0

0

0

9:11