Generalized Bayesian Posterior Expectation Distillation for Deep Neural Networks

Abstract: In this paper, we present a general framework for distilling expectations with respect to the Bayesian posterior distribution of a deep neural network classifier, extending prior work on the Bayesian Dark Knowledge framework. The proposed framework takes as input "teacher" and "student" model architectures and a general posterior expectation of interest. The distillation method performs an online compression of the selected posterior expectation using iteratively generated Monte Carlo samples. We focus on the posterior predictive distribution and expected entropy as distillation targets. We investigate several aspects of this framework including the impact of uncertainty and the choice of student model architecture. We study methods for student model architecture search from a speed-storage-accuracy perspective and evaluate down-stream tasks leveraging entropy distillation including uncertainty ranking and out-of-distribution detection.

02/02/2021

Generalized Bayesian Posterior Expectation Distillation for Deep Neural Networks

Meet Vadera, Brian Jalaian, Benjamin Marlin

Comments

Similar Papers

Teacher Guided Neural Architecture Search for Face Recognition

Xiaobo Wang

Keywords Abstract Paper

On Learnability via Gradient Method for Two-Layer ReLU Neural Networks in Teacher-Student Setting

Shunta Akiyama, Taiji Suzuki

Keywords Abstract Paper

Theory, Deep learning Theory

Online Learning Of Neural Computations From Sparse Temporal Feedback

Lukas Braun, Tim Vogels

Keywords Abstract Paper

deep learning, online learning

Stochastic Precision Ensemble: Self-Knowledge Distillation for Quantized Deep Neural Networks

Yoonho Boo, Sungho Shin, Jungwook Choi, Wonyong Sung

Keywords Abstract Paper

Learning Distilled Collaboration Graph for Multi-Agent Perception

Yiming Li, Shunli Ren, Pengxiang Wu and Siheng Chen, Chen Feng, Wenjun Zhang

Keywords Abstract Paper

vision, graph learning

Adaptive Distillation: Aggregating Knowledge from Multiple Paths for Efficient Distillation

Sumanth Chennupati, Mohammad Mahdi Kamani, Zhongwei Cheng, Lin Chen

Keywords Abstract Paper

Knowledge Distillation, Multitask Learning, Model Compression, Adaptive Distillation, Efficient Training

Learning Energy-Based Model with Variational Auto-Encoder as Amortized Sampler

Jianwen Xie, Zilong Zheng, Ping Li

Keywords Abstract Paper

Learning curves of generic features maps for realistic datasets with a teacher-student model

Bruno Loureiro, Cedric Gerbelot, Hugo Cui and Sebastian Goldt, Florent Krzakala, Marc Mezard, Lenka Zdeborová

Keywords Abstract Paper

deep learning, machine learning, kernel methods

Attentional graph convolutional networks for knowledge concept recommendation in MOOCs in a heterogeneous view

Jibing Gong, Shen Wang, Jinlong Wang and Wenzheng Feng, Hao Peng, Jie Tang, Philip S. Yu

Keywords Abstract Paper

graph neural networks, heterogeneous information network, recommender system

Data-Free Knowledge Distillation with Soft Targeted Transfer Set Synthesis

Zi Wang

Keywords Abstract Paper

Introspective Learning by Distilling Knowledge from Online Self-explanation

Jindong Gu, Zhiliang Wu, Volker Tresp

Keywords Abstract Paper

Variational Autoencoder with Embedded Student-t Mixture Model for Authorship Attribution

Benedikt Boenninghoff, Steffen Zeiler, Robert Nickel, Dorothea Kolossa

Keywords Abstract Paper

Zero-Shot Knowledge Distillation from a Decision-Based Black-Box Model

Zi Wang

Keywords Abstract Paper

Deep Learning

Trajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement Learning

Younggyo Seo, Kimin Lee, Ignasi Clavera Gilaberte and Thanard Kurutach, Jinwoo Shin, Pieter Abbeel

Keywords Abstract Paper

Explore Visual Concept Formation for Image Classification

Shengzhou Xiong, Yihua Tan, Guoyou Wang

Keywords Abstract Paper

Deep Learning

Meta-Learning for Relative Density-Ratio Estimation

Atsutoshi Kumagai, Tomoharu Iwata, Yasuhiro Fujiwara

Keywords Abstract Paper

deep learning, machine learning, meta learning

PDF-Distil: including Prediction Disagreements in Feature-based Distillation for object detection

Heng ZHANG, Elisa Fromont, Sébastien Lefèvre, Bruno AVIGNON

Keywords Abstract Paper

knowledge distillation: object detection

Few Sample Knowledge Distillation for Efficient Network Compression

Tianhong Li, Jianguo Li, Zhuang Liu, Changshui Zhang

Keywords Abstract Paper

efficient network compression, few samples, knowledge distillation

A Differentiable Point Process with Its Application to Spiking Neural Networks

Hiroshi Kajino

Keywords Abstract Paper

, Reinforcement Learning and Planning, Applications, Neuroscience and Cognitive Science

Learning Cycle-Consistent Cooperative Networks via Alternating MCMC Teaching for Unsupervised Cross-Domain Translation

Jianwen Xie, Zilong Zheng, Xiaolin Fang and Song-Chun Zhu, Ying Nian Wu

Keywords Abstract Paper

Black-Box Ripper: Copying black-box models using generative evolutionary algorithms

Antonio Barbalau, Adrian Cosma, Radu Tudor Ionescu, Marius Popescu

Keywords Abstract Paper

A Statistical Mechanics Framework for Task-Agnostic Sample Design in Machine Learning

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Yiming Li, Shunli Ren, Pengxiang Wu and
Siheng Chen, Chen Feng, Wenjun Zhang

Keywords Paper

Keywords Paper

Keywords Paper

Bruno Loureiro, Cedric Gerbelot, Hugo Cui and
Sebastian Goldt, Florent Krzakala, Marc Mezard, Lenka Zdeborová

Keywords Paper

Jibing Gong, Shen Wang, Jinlong Wang and
Wenzheng Feng, Hao Peng, Jie Tang, Philip S. Yu

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Younggyo Seo, Kimin Lee, Ignasi Clavera Gilaberte and
Thanard Kurutach, Jinwoo Shin, Pieter Abbeel

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Jianwen Xie, Zilong Zheng, Xiaolin Fang and
Song-Chun Zhu, Ying Nian Wu

Keywords Paper

Keywords Paper

Bhavya Kailkhura, Jayaraman Thiagarajan, Qunwei Li and
Jize Zhang, Yi Zhou, Timo Bremer

Keywords Paper

Zhongkai Hao, Chengqiang Lu, Zhenya Huang and
Hao Wang, Zheyuan Hu, Qi Liu, Enhong Chen, Cheekong Lee

Keywords Paper

Keywords Paper

Wangshu Zhang, Junhong Liu, Zujie Wen and
Yafang Wang, Gerard de Melo

Keywords Paper

Shell Xu Hu, Pablo Moreno, Yang Xiao and
Xi Shen, Guillaume Obozinski, Neil Lawrence, Andreas Damianou

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Zheng Li, Ying Huang, Defang Chen and
Tianren Luo, Ning Cai, Zhigeng Pan

Keywords Paper

Taehyeon Kim, Jaehoon Oh, Nak Yil Kim and
Sangwook Cho, Se-Young Yun

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Mengxue Li, Yi-Ming Zhai, You-Wei Luo and
Peng-Fei Ge, Chuan-Xian Ren

Keywords Paper