On the Expected Complexity of Maxout Networks

Abstract: Learning with neural networks relies on the complexity of their representable functions, but more importantly, their particular assignment of typical parameters to functions of different complexity. Taking the number of activation regions as a complexity measure, recent works have shown that the practical complexity of deep ReLU networks is often far from the theoretical maximum. In this work, we show that this phenomenon also occurs in networks with maxout (multi-argument) activation functions and when considering the decision boundaries in classification tasks. We also show that the parameter space has a multitude of full-dimensional regions with widely different complexity, and obtain nontrivial lower bounds on the expected complexity. Finally, we investigate different parameter initialization procedures and show that they can increase the speed of convergence in training.

06/12/2021

On the Expected Complexity of Maxout Networks

Hanna Tseran, Guido Montufar

Comments

Similar Papers

What can linearized neural networks actually say about generalization?

Guillermo Ortiz-Jimenez, Seyed-Mohsen Moosavi-Dezfooli, Pascal Frossard

Keywords Abstract Paper

theory, deep learning

Neural Complexity Measures

Yoonho Lee, Juho Lee, Sung Ju Hwang and Eunho Yang, Seungjin Choi

Keywords Abstract Paper

Intrinsic Dimension, Persistent Homology and Generalization in Neural Networks

Tolga Birdal, Aaron Lou, Leonidas Guibas, Umut Simsekli

Keywords Abstract Paper

theory, deep learning, optimization

One Network Fits All? Modular versus Monolithic Task Formulations in Neural Networks

Atish Agarwala, Abhimanyu Das, Brendan Juba and Rina Panigrahy, Vatsal Sharan, Xin Wang, Qiuyi Zhang

Keywords Abstract Paper

deep learning theory, multi-task learning

It's Not What Machines Can Learn, It's What We Cannot Teach

Gal Yehuda, Moshe Gabel, Assaf Schuster

Keywords Abstract Paper

Supervised Learning

Representation Learning Beyond Linear Prediction Functions

Ziping Xu, Ambuj Tewari

Keywords Abstract Paper

theory, deep learning, optimization, representation learning, few shot learning

Generalization Error of Generalized Linear Models in High Dimensions

Melikasadat Emami, Mojtaba Sahraee-Ardakan, Parthe Pandit and Sundeep Rangan, Alyson Fletcher

Keywords Abstract Paper

Supervised Learning

Fractal Structure and Generalization Properties of Stochastic Optimization Algorithms

Alexander Camuto, George Deligiannidis, Murat Erdogdu and Mert Gurbuzbalaban, Umut Simsekli, Lingjiong Zhu

Keywords Abstract Paper

theory, deep learning, optimization

Set2Graph: Learning Graphs From Sets

Hadar Serviansky, Nimrod Segol, Jonathan Shlomi and Kyle Cranmer, Eilam Gross, Haggai Maron, Yaron Lipman

Keywords Abstract Paper

Towards flexible device participation in federated learning

Yichen Ruan, Xiaoxi Zhang, Shu-Che Liang, Carlee Joe-Wong

Keywords Abstract Paper

Meta-Learning Sparse Implicit Neural Representations

Jaeho Lee, Jihoon Tack, Namhoon Lee, Jinwoo Shin

Keywords Abstract Paper

deep learning, optimization, meta learning, representation learning

Dataset Condensation with Gradient Matching

Bo ZHAO, Konda Reddy Mopuri, Hakan Bilen

Keywords Abstract Paper

dataset condensation, image generation, data-efficient learning

Relative gradient optimization of the Jacobian term in unsupervised deep learning

Luigi Gresele, Giancarlo Fissore, Adrián Javaloy and Bernhard Schölkopf, Aapo Hyvarinen

Keywords Abstract Paper

Network-to-Network Regularization: Enforcing Occam's Razor to Improve Generalization

Rohan Ghosh, Mehul Motani

Keywords Abstract Paper

theory, deep learning, machine learning

Faster & more reliable tuning of neural networks: Bayesian optimization with importance sampling

Setareh Ariafar, Zelda Mariet, Dana Brooks and Jennifer Dy, Jasper Snoek

Keywords Abstract Paper

Learning Generalized Relational Heuristic Networks for Model-Agnostic Planning

Rushang Karia, Siddharth Srivastava

Keywords Abstract Paper

Global Optimality Beyond Two Layers: Training Deep ReLU Networks via Convex Programs

Tolga Ergen, Mert Pilanci

Keywords Abstract Paper

Optimization, Convex Optimization

Learn2Perturb: An End-to-End Feature Perturbation Learning to Improve Adversarial Robustness

Ahmadreza Jeddi, Mohammad Javad Shafiee, Michelle Karg and Christian Scharfenberger, Alexander Wong

Keywords Abstract Paper

adversarial robustness, network randomization, alternative back-propagation, trainable noise, adversarial training

Deep learning versus kernel learning: an empirical study of loss landscape geometry and the time evolution of the Neural Tangent Kernel

Stanislav Fort, Gintare Karolina Dziugaite, Mansheej Paul and Sepideh Kharaghani, Dan Roy, Surya Ganguli

Keywords Abstract Paper

An analytic theory of shallow networks dynamics for hinge loss classification

Franco Pellegrini, Giulio Biroli

Keywords Abstract Paper

, Deep Learning -> Optimization for Deep Networks

Learning Invariances in Neural Networks from Training Data

Greg Benton, Marc Finzi, Pavel Izmailov, Andrew Wilson

Keywords Abstract Paper

Keywords Paper

Yoonho Lee, Juho Lee, Sung Ju Hwang and
Eunho Yang, Seungjin Choi

Keywords Paper

Keywords Paper

Atish Agarwala, Abhimanyu Das, Brendan Juba and
Rina Panigrahy, Vatsal Sharan, Xin Wang, Qiuyi Zhang

Keywords Paper

Keywords Paper

Keywords Paper

Melikasadat Emami, Mojtaba Sahraee-Ardakan, Parthe Pandit and
Sundeep Rangan, Alyson Fletcher

Keywords Paper

Alexander Camuto, George Deligiannidis, Murat Erdogdu and
Mert Gurbuzbalaban, Umut Simsekli, Lingjiong Zhu

Keywords Paper

Hadar Serviansky, Nimrod Segol, Jonathan Shlomi and
Kyle Cranmer, Eilam Gross, Haggai Maron, Yaron Lipman

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Luigi Gresele, Giancarlo Fissore, Adrián Javaloy and
Bernhard Schölkopf, Aapo Hyvarinen

Keywords Paper

Keywords Paper

Setareh Ariafar, Zelda Mariet, Dana Brooks and
Jennifer Dy, Jasper Snoek

Keywords Paper

Keywords Paper

Keywords Paper

Ahmadreza Jeddi, Mohammad Javad Shafiee, Michelle Karg and
Christian Scharfenberger, Alexander Wong

Keywords Paper

Stanislav Fort, Gintare Karolina Dziugaite, Mansheej Paul and
Sepideh Kharaghani, Dan Roy, Surya Ganguli

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Haichen Shen, Jared Roesch, Zhi Chen and
wweic Chen, Yong Wu, Mu Li, Vin Sharma, Zachary Tatlock, Yida Wang

Keywords Paper

Haichen Shen, Jared Roesch, Zhi Chen and
wweic Chen, Yong Wu, Mu Li, Vin Sharma, Zachary Tatlock, Yida Wang

Keywords Paper

Keywords Paper

Keywords Paper

Kei Ota, Tomoaki Oiki, Devesh Jha and
Toshisada Mariyama, Daniel Nikovski

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Haichao Yu, Haoxiang Li, Humphrey Shi and
Thomas S. Huang, Gang Hua

Keywords Paper