Bayesian inference with certifiable adversarial robustness

Abstract: We consider adversarial training of deep neural networks through the lens of Bayesian learning and present a principled framework for adversarial training of Bayesian Neural Networks (BNNs) with certifiable guarantees. We rely on techniques from constraint relaxation of non-convex optimisation problems and modify the standard cross-entropy error model to enforce posterior robustness to worst-case perturbations in \epsilon-balls around input points. We illustrate how the resulting framework can be combined with methods commonly employed for approximate inference of BNNs. In an empirical investigation, we demonstrate that the presented approach enables training of certifiably robust models on MNIST, FashionMNIST, and CIFAR-10 and can also be beneficial for uncertainty calibration. Our method is the first to directly train certifiable BNNs, thus facilitating their deployment in safety-critical applications.

06/12/2020

adversarial example detection, adversarial examples classification, robust optimization, ML security, generative modeling, generative classification

5:14

06/12/2020

Bayesian inference with certifiable adversarial robustness

Matthew Wicker, Luca Laurenti, Andrea Patane, Zhuotong Chen, Zheng Zhang, Marta Kwiatkowska

Comments

Similar Papers

Adversarial Training is a Form of Data-dependent Operator Norm Regularization

Kevin Roth, Yannic Kilcher, Thomas Hofmann

Keywords Abstract Paper

GAT: Generative Adversarial Training for Adversarial Example Detection and Classification

Xuwang Yin, Soheil Kolouri, Gustavo K Rohde

Keywords Abstract Paper

adversarial example detection, adversarial examples classification, robust optimization, ML security, generative modeling, generative classification

Neurosymbolic Reinforcement Learning with Formally Verified Exploration

Greg Anderson, Abhinav Verma, Isil Dillig, Swarat Chaudhuri

Keywords Abstract Paper

Gradients as Features for Deep Representation Learning

Fangzhou Mu, Yingyu Liang, Yin Li

Keywords Abstract Paper

representation learning, gradient features, deep learning

Improving Adversarial Robustness via Channel-wise Activation Suppressing

Yang Bai, Yuyuan Zeng, Yong Jiang and Shu-Tao Xia, Daniel Ma, Yisen Wang

Keywords Abstract Paper

channel suppressing, Adversarial robustness, activation strategy.

Security Analysis of Safe and Seldonian Reinforcement Learning Algorithms

Pinar Ozisik, Philip Thomas

Keywords Abstract Paper

Deep Evidential Regression

Alexander Amini, Wilko Schwarting, Ava P Soleimany, Daniela Rus

Keywords Abstract Paper

Theory and Evaluation Metrics for Learning Disentangled Representations

Kien Do, Truyen Tran

Keywords Abstract Paper

disentanglement, metrics

Infinite Time Horizon Safety of Bayesian Neural Networks

Mathias Lechner, Đorđe Žikelić, Krishnendu Chatterjee, Thomas Henzinger

Keywords Abstract Paper

deep learning, reinforcement learning and planning

Adversarial Robustness via Runtime Masking and Cleansing

Yi-Hsuan Wu, Chia-Hung Yuan, Shan-Hung (Brandon) Wu

Keywords Abstract Paper

Adversarial Examples

Learning Optimal Representations with the Decodable Information Bottleneck

Yann Dubois, Douwe Kiela, David Schwab, Ramakrishna Vedantam

Keywords Abstract Paper

A Framework to Learn with Interpretation

Jayneel Parekh, Pavlo Mozharovskyi, Florence d'Alché-Buc

Keywords Abstract Paper

deep learning, interpretability

Bridging Adversarial and Statistical Domain Transfer via Spectral Adaptation Networks

Christoph Raab, Philipp Väth, Peter Meier, Frank-Michael Schleif

Keywords Abstract Paper

PAC Confidence Predictions for Deep Neural Network Classifiers

Sangdon Park, Shuo Li, Insup Lee, Osbert Bastani

Keywords Abstract Paper

classification, fast DNN inference, probably approximated correct guarantee, calibration, safe planning

Adversarial Training and Provable Robustness: A Tale of Two Objectives

Jiameng Fan, Wenchao Li

Keywords Abstract Paper

Learning a Single Neuron with Gradient Methods

Gilad Yehudai, Ohad Shamir

Keywords Abstract Paper

Neural networks/deep learning, Non-convex optimization

Verifying Reinforcement Learning up to Infinity

Edoardo Bacci, Mirco Giacobbe, David Parker

Keywords Abstract Paper

Machine Learning, Deep Reinforcement Learning, Validation and Verification, Learning in Robotics

Robust Quantization: One Model to Rule Them All

Moran Shkolnik, Brian Chmiel, Ron Banner and Gil Shomron, Yury Nahshan, Alex Bronstein, Uri Weiser

Keywords Abstract Paper

Probabilistic Safety for Bayesian Neural Networks

Matthew Wicker, Luca Laurenti, Andrea Patane, Marta Kwiatkowska

Keywords Abstract Paper

On Recovering from Modeling Errors Using Testing Bayesian Networks

Haiying Huang, Adnan Darwiche

Keywords Abstract Paper

Probabilistic Methods, Graphical Models

Smoothness and Stability in GANs

Casey Chu, Kentaro Minami, Kenji Fukumizu

Keywords Abstract Paper

generative adversarial networks, stability, smoothness, convex conjugate

Feature Binding with Category-Dependant MixUp for Semantic Segmentation and Adversarial Robustness

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Yang Bai, Yuyuan Zeng, Yong Jiang and
Shu-Tao Xia, Daniel Ma, Yisen Wang

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Moran Shkolnik, Brian Chmiel, Ron Banner and
Gil Shomron, Yury Nahshan, Alex Bronstein, Uri Weiser

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Junhe Zhao, Linlin Yang, Baochang Zhang and
Guodong Guo, David Doermann

Keywords Paper

Keywords Paper

Mohammad Pezeshki, Oumar Kaba, Yoshua Bengio and
Aaron Courville, Doina Precup, Guillaume Lajoie

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Prasad Chalasani, Jiefeng Chen, Amrita Roy Chowdhury and
Xi Wu, Somesh Jha

Keywords Paper

Clare Lyle, Amy Zhang, Angelos Filos and
Shagun Sodhani, Marta Kwiatkowska, Yarin Gal, Doina Precup, Joelle Pineau

Keywords Paper

Pouya Bashivan, Reza Bayat, Adam Ibrahim and
Kartik Ahuja, Mojtaba Faramarzi, Touraj Laleh, Blake Richards, Irina Rish

Keywords Paper

Zifeng Wang, Tong Jian, Aria Masoomi and
Stratis Ioannidis, Jennifer Dy

Keywords Paper

George Stamatescu, Federica Gerace, Carlo Lucibello and
Ian Fuss, Langford White

Keywords Paper

Keywords Paper

Keywords Paper

Ali Shafahi, Parsa Saadatpanah, Chen Zhu and
Amin Ghiasi, Christoph Studer, David Jacobs, Tom Goldstein

Keywords Paper