Deep kernel processes

18/07/2021

Deep kernel processes

Laurence Aitchison, Adam Yang, Sebastian Ober

Keywords: Deep Learning, Bayesian Deep Learning

Abstract Paper Similar Papers

Abstract: We define deep kernel processes in which positive definite Gram matrices are progressively transformed by nonlinear kernel functions and by sampling from (inverse) Wishart distributions. Remarkably, we find that deep Gaussian processes (DGPs), Bayesian neural networks (BNNs), infinite BNNs, and infinite BNNs with bottlenecks can all be written as deep kernel processes. For DGPs the equivalence arises because the Gram matrix formed by the inner product of features is Wishart distributed, and as we show, standard isotropic kernels can be written entirely in terms of this Gram matrix --- we do not need knowledge of the underlying features. We define a tractable deep kernel process, the deep inverse Wishart process, and give a doubly-stochastic inducing-point variational inference scheme that operates on the Gram matrices, not on the features, as in DGPs. We show that the deep inverse Wishart process gives superior performance to DGPs and infinite BNNs on fully-connected baselines.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ICML 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

06/12/2021

A variational approximate posterior for the deep Wishart process

Sebastian Ober, Laurence Aitchison

Keywords Paper

generative model, kernel methods

0

0

0

0

9:26

06/12/2021

Implicit Regularization in Matrix Sensing via Mirror Descent

Fan Wu, Patrick Rebeschini

Keywords Paper

optimization

0

0

0

0

9:35

26/08/2020

Interpretable Deep Gaussian Processes with Moments

Chi-Ken Lu, Scott Cheng-Hsin Yang, Xiaoran Hao, Patrick Shafto

Keywords Paper

0

0

0

0

12:51

13/04/2021

On the convergence of gradient descent in GANs: MMD GAN as a gradient flow

Youssef Mroueh, Truyen Nguyen

Keywords Paper

0

0

0

0

2:52

06/12/2021

De-randomizing MCMC dynamics with the diffusion Stein operator

Zheyang Shen, Markus Heinonen, Samuel Kaski

Keywords Paper

optimization, generative model

0

0

0

0

13:42

06/12/2021

Kernel Functional Optimisation

Arun Kumar Anjanapura Venkatesh, Alistair Shilton, Santu Rana and
Sunil Gupta, Svetha Venkatesh

Keywords Paper

machine learning, kernel methods

0

0

0

0

12:48

03/05/2021

Implicit Normalizing Flows

Cheng Lu, Jianfei Chen, Chongxuan Li and
Qiuhao Wang, Jun Zhu

Keywords Paper

probabilistic inference, deep generative models, Normalizing flows, implicit functions

0

0

0

0

8:03

06/12/2021

A Non-commutative Extension of Lee-Seung's Algorithm for Positive Semidefinite Factorizations

Yong Sheng Soh, Antonios Varvitsiotis

Keywords Paper

theory, optimization

0

0

0

0

13:34

06/12/2021

Double Machine Learning Density Estimation for Local Treatment Effects with Instruments

Yonghan Jung, Jin Tian, Elias Bareinboim

Keywords Paper

machine learning, causality

0

0

0

0

14:24

06/12/2020

Tight Nonparametric Convergence Rates for Stochastic Gradient Descent under the Noiseless Linear Model

Raphaël Berthier, Francis Bach, Pierre Gaillard

Keywords Paper

Optimization -> Non-Convex Optimization, Deep Learning -> Optimization for Deep Networks

0

0

0

0

3:05

13/04/2021

Convergence of gaussian-smoothed optimal transport distance with sub-gamma distributions and dependent samples

Yixing Zhang, Xiuyuan Cheng, Galen Reeves

Keywords Paper

0

0

0

0

3:06

18/07/2021

Understanding the Dynamics of Gradient Flow in Overparameterized Linear models

Salma Tarmoun, Guilherme Franca, Benjamin Haeffele, Rene Vidal

Keywords Paper

Deep Learning, Optimization for Deep Networks

0

0

0

0

4:50

06/12/2021

Spatio-Temporal Variational Gaussian Processes

Oliver Hamelijnck, William Wilkinson, Niki Loppi and
Arno Solin, Theodoros Damoulas

Keywords Paper

generative model, kernel methods

0

0

0

0

6:04

26/08/2020

Learning spectrograms with convolutional spectral kernels

Zheyang Shen, Markus Heinonen, Samuel Kaski

Keywords Paper

0

0

0

0

10:47

12/07/2020

Eliminating the Invariance on the Loss Landscape of Linear Autoencoders

Reza Oftadeh, Jiayi Shen, Zhangyang Wang, Dylan Shell

Keywords Paper

Deep Learning - Theory

0

0

0

0

15:10

06/12/2020

Online Robust Regression via SGD on the l1 loss

Scott Pesme, Nicolas Flammarion

Keywords Paper

0

0

0

0

3:17

06/12/2020

Provable Online CP/PARAFAC Decomposition of a Structured Tensor via Dictionary Learning

Sirisha Rambhatla, Xingguo Li, Jarvis Haupt

Keywords Paper

0

0

0

0

3:22

08/07/2020

The Benefit of Being Non-Lazy in Probabilistic λ-calculus: Applicative Bisimulation is Fully Abstract for Non-Lazy Probabilistic Call-by-Name

Gianluca Curzi, Michele Pagani

Keywords Paper

Full abstraction, Observational equivalence, Bisimilarity, Probabilistic lambda calculus, Separation

0

0

0

0

23:57

09/07/2020

Logsmooth Gradient Concentration and Tighter Runtimes for Metropolized Hamiltonian Monte Carlo

Yin Tat Lee, Ruoqi Shen, Kevin Tian

Keywords Paper

Sampling algorithms, Bayesian methods

0

0

0

0

14:57

18/07/2021

Scalable Variational Gaussian Processes via Harmonic Kernel Decomposition

Shengyang Sun, Jiaxin Shi, Andrew Wilson, Roger Grosse

Keywords Paper

Algorithms, Uncertainty Estimation, Algorithms, Classification; Deep Learning; Deep Learning, Predictive Models; Deep Learning, Supervised Deep Networks, Probabilistic Methods, Gaussian Processes and Bayesian non-parametrics

0

0

0

0

5:02

03/05/2021

Improving Relational Regularized Autoencoders with Spherical Sliced Fused Gromov Wasserstein

Khai Nguyen, Son Nguyen, Nhat Ho and
Tung Pham, Hung Bui

Keywords Paper

sliced fused Gromov Wasserstein, Relational regularized autoencoder, deep generative model, spherical distributions

0

0

0

0

4:40

06/12/2021

Landscape analysis of an improved power method for tensor decomposition

Joe Kileel, Timo Klock, João M Pereira

Keywords Paper

optimization, robustness

0

0

0

0

12:05

26/08/2020

Langevin Monte Carlo without smoothness

Niladri Chatterji, Jelena Diakonikolas, Michael Jordan, Peter Bartlett

Keywords Paper

0

0

0

0

15:02

26/08/2020

Kernels over Sets of Finite Sets using RKHS Embeddings, with Application to Bayesian (Combinatorial) Optimization

Poompol Buathong, David Ginsbourger, Tipaluck Krityakierne

Keywords Paper

0

0

0

0

14:00

06/12/2020

Sinkhorn Barycenter via Functional Gradient Descent

Zebang Shen, Zhenfu Wang, Alejandro Ribeiro, Hamed Hassani

Keywords Paper

0

0

0

1

3:14

04/08/2021

Kernel Thinning

Raaz Dwivedi, Lester Mackey

Keywords Paper

0

0

0

0

16:25

04/08/2021

Nonparametric Regression with Shallow Overparametrized Neural Networks Trained by GD with Early Stopping

Ilja Kuzborskij , Csaba Szepesvari

Keywords Paper

0

0

0

0

15:14

18/07/2021

SGLB: Stochastic Gradient Langevin Boosting

Aleksei Ustimenko, Liudmila Prokhorenkova

Keywords Paper

Algorithms, Boosting and Ensemble Methods

0

0

0

0

4:44

12/07/2020

State Space Expectation Propagation: Efficient Inference Schemes for Temporal Gaussian Processes

William Wilkinson, Paul Chang, Michael Andersen, Arno Solin

Keywords Paper

Gaussian Processes

0

0

0

0

13:31

14/09/2020

Skew Gaussian Processes for Classification

Alessio Benavoli, Dario Azzimonti, Dario Pig

Keywords Paper

0

0

0

0

14:28

06/12/2021

DeepGEM: Generalized Expectation-Maximization for Blind Inversion

Angela Gao, Jorge Castellanos, Yisong Yue and
Zachary Ross, Katherine Bouman

Keywords Paper

generative model, graph learning

0

0

0

0

3:46

06/12/2021

DeepReduce: A Sparse-tensor Communication Framework for Federated Deep Learning

Hang Xu, Kelly Kostopoulou, Aritra Dutta and
Xin Li, Alexandros Ntoulas, Panos Kalnis

Keywords Paper

deep learning, federated learning

0

0

0

0

12:15

06/12/2021

Smooth Bilevel Programming for Sparse Regularization

Clarice Poon, Gabriel Peyré

Keywords Paper

machine learning

0

0

0

0

13:06

26/04/2020

Neural tangent kernels, transportation mappings, and universal approximation

Ziwei Ji, Matus Telgarsky, Ruicheng Xian

Keywords Paper

Neural Tangent Kernel, universal approximation, Barron, transport mapping

0

0

0

0

4:48

06/12/2020

Reciprocal Adversarial Learning via Characteristic Functions

Shengxi Li, Zeyang Yu, Min Xiang, Danilo Mandic

Keywords Paper

0

0

0

0

3:21

18/07/2021

Barlow Twins: Self-Supervised Learning via Redundancy Reduction

Jure Zbontar, Li Jing, Ishan Misra and
yann lecun, Stephane Deny

Keywords Paper

Algorithms, Unsupervised Learning

0

0

0

0

4:21

07/09/2020

Lifted Regression/Reconstruction Networks

Rasmus Høier, Christopher Zach

Keywords Paper

Lifted neural networks, Lipschitz continuity, adversarial robustness, energy-based models

0

0

0

0

8:23

26/04/2020

Functional vs. parametric equivalence of ReLU networks

Mary Phuong, Christoph H. Lampert

Keywords Paper

ReLU networks, symmetry, functional equivalence, over-parameterization

0

0

0

0

5:15

02/02/2021

FIMAP: Feature Importance by Minimal Adversarial Perturbation

Matt Chapman-Rounds, Umang Bhatt, Erik Pazos and
Marc-Andre Schulz, Konstantinos Georgatzis

Keywords Paper

0

0

0

0

20:10

06/12/2021

Multiwavelet-based Operator Learning for Differential Equations

Gaurav Gupta, Xiongye Xiao, Paul Bogdan

Keywords Paper

0

0

0

0

12:15