Neural Arithmetic Units

Abstract: Neural networks can approximate complex functions, but they struggle to perform exact arithmetic operations over real numbers. The lack of inductive bias for arithmetic operations leaves neural networks without the underlying logic necessary to extrapolate on tasks such as addition, subtraction, and multiplication. We present two new neural network components: the Neural Addition Unit (NAU), which can learn exact addition and subtraction; and the Neural Multiplication Unit (NMU) that can multiply subsets of a vector. The NMU is, to our knowledge, the first arithmetic neural network component that can learn to multiply elements from a vector, when the hidden size is large. The two new components draw inspiration from a theoretical analysis of recently proposed arithmetic components. We find that careful initialization, restricting parameter space, and regularizing for sparsity is important when optimizing the NAU and NMU. Our proposed units NAU and NMU, compared with previous neural units, converge more consistently, have fewer parameters, learn faster, can converge for larger hidden sizes, obtain sparse and meaningful weights, and can extrapolate to negative and small values.

06/12/2020

Neural Arithmetic Units

Andreas Madsen, Alexander Rosenberg Johansen

Comments

Similar Papers

Neural Power Units

Niklas Heim, Tomas Pevny, Vasek Smidl

Keywords Abstract Paper

, Reinforcement Learning and Planning -> Reinforcement Learning

GradInit: Learning to Initialize Neural Networks for Stable and Efficient Training

Chen Zhu, Renkun Ni, Zheng Xu and Kezhi Kong, W. Ronny Huang, Tom Goldstein

Keywords Abstract Paper

deep learning, transformers, vision

Robust Pruning at Initialization

Soufiane Hayou, Jean-Francois Ton, Arnaud Doucet, Yee Whye Teh

Keywords Abstract Paper

Pruning, Compression, Initialization

Provable neuromorphic advantages for computing shortest paths

James B. Aimone, Yang Ho, Ojas Parekh and Cynthia A. Phillips, Ali Pinar, William Severa, Yipu Wang

Keywords Abstract Paper

graph algorithms, neuromorphic computing, neuromorphic complexity, shortest paths, spiking neural networks

Complex Query Answering with Neural Link Predictors

Erik Arakelyan, Daniel Daza, Pasquale Minervini, Michael Cochez

Keywords Abstract Paper

neural link prediction, complex query answering

Scalable Verification of Quantized Neural Networks

Thomas A. Henzinger, Mathias Lechner, Đorđe Žikelić

Keywords Abstract Paper

FleXOR: Trainable Fractional Quantization

Dongsoo Lee, Se Jung Kwon, Byeongwook Kim and Yongkweon Jeon, Baeseong Park, Jeongin Yun

Keywords Abstract Paper

The Surprising Simplicity of the Early-Time Learning Dynamics of Neural Networks

Wei Hu, Lechao Xiao, Ben Adlam, Jeffrey Pennington

Keywords Abstract Paper

Go with the flow: Adaptive control for Neural ODEs

Mathieu Chalvidal, Matthew Ricci, Rufin VanRullen, Thomas Serre

Keywords Abstract Paper

Neural ODEs, Normalizing flows, Hypernetworks, Optimal Control Theory

Towards Lower Bounds on the Depth of ReLU Neural Networks

Christoph Hertrich, Amitabh Basu, Marco Di Summa, Martin Skutella

Keywords Abstract Paper

theory, deep learning, optimization

Opening the Blackbox: Accelerating Neural Differential Equations by Regularizing Internal Solver Heuristics

Avik Pal, Yingbo Ma, Viral Shah, Christopher Rackauckas

Keywords Abstract Paper

Deep Learning

Post-training Quantization with Multiple Points: Mixed Precision without Mixed Precision

Xingchao Liu, Mao Ye, Dengyong Zhou, Qiang Liu

Keywords Abstract Paper

Towards Fast Adaptation of Neural Architectures with Meta Learning

Dongze Lian, Yin Zheng, Yintao Xu and Yanxiong Lu, Leyu Lin, Peilin Zhao, Junzhou Huang, Shenghua Gao

Keywords Abstract Paper

Fast adaptation, Meta learning, NAS

Scalable Learning and MAP Inference for Nonsymmetric Determinantal Point Processes

Mike Gartrell, Insu Han, Elvis Dohmatob and Jennifer Gillenwater, Victor-Emmanuel Brunel

Keywords Abstract Paper

submodular optimization, determinantal point processes, unsupervised learning, representation learning

Learn-to-Share: A Hardware-friendly Transfer Learning Framework Exploiting Computation and Parameter Sharing

Cheng Fu, Hanxian Huang, Xinyun Chen and Yuandong Tian, Jishen Zhao

Keywords Abstract Paper

Applications, Natural Language Processing

Neural Architecture Search on ImageNet in Four GPU Hours: A Theoretically Inspired Perspective

Wuyang Chen, Xinyu Gong, Zhangyang Wang

Keywords Abstract Paper

number of linear regions, neural tangent kernel, Neural Architecture Search

Any-Precision Deep Neural Networks

Haichao Yu, Haoxiang Li, Humphrey Shi and Thomas S. Huang, Gang Hua

Keywords Abstract Paper

A computational approach to packet classification

Alon Rashelbach, Ori Rottenstreich, Mark Silberstein

Keywords Abstract Paper

Neural Networks, Virtual Switches, Packet Classification

Learning N:M Fine-grained Structured Sparse Neural Networks From Scratch

Aojun Zhou, Yukun Ma, Junnan Zhu and Jianbo Liu, Zhijie Zhang, Kun Yuan, Wenxiu Sun, Hongsheng Li

Keywords Abstract Paper

sparsity, efficient training and inference.

Provable Benefits of Overparameterization in Model Compression: From Double Descent to Pruning Neural Networks

Xiangyu Chang, Yingcong Li, Samet Oymak, Christos Thrampoulidis

Keywords Abstract Paper

Deep Learning for Functional Data Analysis with Adaptive Basis Layers

Junwen Yao, Jonas Mueller, Jane-Ling Wang

Keywords Paper

Chen Zhu, Renkun Ni, Zheng Xu and
Kezhi Kong, W. Ronny Huang, Tom Goldstein

Keywords Paper

Keywords Paper

James B. Aimone, Yang Ho, Ojas Parekh and
Cynthia A. Phillips, Ali Pinar, William Severa, Yipu Wang

Keywords Paper

Keywords Paper

Keywords Paper

Dongsoo Lee, Se Jung Kwon, Byeongwook Kim and
Yongkweon Jeon, Baeseong Park, Jeongin Yun

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Dongze Lian, Yin Zheng, Yintao Xu and
Yanxiong Lu, Leyu Lin, Peilin Zhao, Junzhou Huang, Shenghua Gao

Keywords Paper

Mike Gartrell, Insu Han, Elvis Dohmatob and
Jennifer Gillenwater, Victor-Emmanuel Brunel

Keywords Paper

Cheng Fu, Hanxian Huang, Xinyun Chen and
Yuandong Tian, Jishen Zhao

Keywords Paper

Keywords Paper

Haichao Yu, Haoxiang Li, Humphrey Shi and
Thomas S. Huang, Gang Hua

Keywords Paper

Keywords Paper

Aojun Zhou, Yukun Ma, Junnan Zhu and
Jianbo Liu, Zhijie Zhang, Kun Yuan, Wenxiu Sun, Hongsheng Li

Keywords Paper

Keywords Paper

Keywords Paper

Jianfei Chen, Lianmin Zheng, Zhewei Yao and
Dequan Wang, Ion Stoica, Michael Mahoney, Joseph E Gonzalez

Keywords Paper

Keywords Paper

Wenshuo Li, Hanting Chen, Mingqiang Huang and
Xinghao Chen, Chunjing Xu, Yunhe Wang

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Haoran You, Xiaohan Chen, Yongan Zhang and
Chaojian Li, Sicheng Li, Zihao Liu, Zhangyang Wang, Yingyan Lin

Keywords Paper

Keywords Paper

Keywords Paper

Xiandong Zhao, Ying Wang, Xuyi Cai and
Cheng Liu, Lei Zhang

Keywords Paper

Alex Norcliffe, Cristian Bodnar, Ben Day and
Nikola Simidjievski, Pietro Lió

Keywords Paper

Keywords Paper

Keywords Paper

Jiaqi Gu, Chenghao Feng, Zheng Zhao and
Zhoufeng Ying, Ray T. Chen, David Z. Pan

Keywords Paper

Keywords Paper