SoftSort: A Differantiable Continuous Relaxation of the argsort Operator

12/07/2020

SoftSort: A Differantiable Continuous Relaxation of the argsort Operator

Sebastian Prillo, Julian Eisenschlos

Keywords: Deep Learning - Algorithms

Abstract Paper Similar Papers

Abstract: Sorting is an important procedure in computer science. However, the argsort operator - which takes as input a vector and returns its sorting per-mutation - has a discrete image and thus zero gradients almost everywhere. This prohibits end-to-end, gradient-based learning of models that rely on the argsort operator. A natural way to overcome this problem is to replace the argsort operator with a continuous relaxation. Recent work has shown a number of ways to do this. However, the relaxations proposed so far are computationally complex. In this work we propose a simple continuous relaxation for the argsort operator. Unlike previous works, our relaxation is straight-forward: it can be implemented in three lines of code, achieves state-of-the-art performance, is easy to reason about mathematically - substantially simplifying proofs - and is up to six times faster than competing approaches. We open-source the code to reproduce all of the experiments

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ICML 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

14/09/2020

Squeezing Correlated Neurons for Resource-Efficient Deep Neural Networks

Elbruz Ozen, Alex Orailoglu

Keywords Paper

deep learning, information redundancy, pruning

0

0

0

0

14:48

06/12/2021

Smooth Bilevel Programming for Sparse Regularization

Clarice Poon, Gabriel Peyré

Keywords Paper

machine learning

0

0

0

0

13:06

12/07/2020

How to Solve Fair k-Center in Massive Data Models

Ashish Chiplunkar, Sagar Kale, Sivaramakrishnan Natarajan Ramamoorthy

Keywords Paper

Fairness, Equity, Justice, and Safety

0

0

0

0

13:45

15/06/2020

NVTraverse: In NVRAM data structures, the destination is more important than the journey

Michal Friedman, Naama Ben-David, Yuanhao Wei and
Guy E. Blelloch, Erez Petrank

Keywords Paper

Non-blocking, Lock-free, Concurrent Data Structures, Non-volatile Memory

0

1

0

1

16:56

06/12/2021

Closing the Gap: Tighter Analysis of Alternating Stochastic Gradient Methods for Bilevel Problems

Tianyi Chen, Yuejiao Sun, Wotao Yin

Keywords Paper

theory, optimization, reinforcement learning and planning, machine learning

0

0

0

0

7:27

03/08/2020

Lagrangian Decomposition for Neural Network Verification

Rudy Bunel, Alessandro De Palma, Alban Desmaison and
Krishnamurthy Dvijotham, Pushmeet Kohli, Philip Torr, M. Pawan Kumar

Keywords Paper

0

0

0

0

8:05

26/08/2020

Risk Bounds for Learning Multiple Components with Permutation-Invariant Losses

Fabien Lauer

Keywords Paper

0

0

0

0

14:16

26/04/2020

CLN2INV: Learning Loop Invariants with Continuous Logic Networks

Gabriel Ryan, Justin Wong, Jianan Yao and
Ronghui Gu, Suman Jana

Keywords Paper

loop invariants, deep learning, logic learning

0

0

0

0

5:12

06/12/2020

Tensor Completion Made Practical

Allen Liu, Ankur Moitra

Keywords Paper

Neuroscience and Cognitive Science -> Neuroscience; Neuroscience and Cognitive Science -> Plasticity and Adaptation; Neuroscien, Neuroscience and Cognitive Science

0

0

0

0

3:05

02/02/2021

A Bottom-Up DAG Structure Extraction Model for Math Word Problems

Yixuan Cao, Feng Hong, Hongwei Li, Ping Luo

Keywords Paper

0

0

0

0

14:01

15/06/2020

Learning fast and precise numerical analysis

Jingxuan He, Gagandeep Singh, Markus Püschel, Martin Vechev

Keywords Paper

Abstract interpretation, Performance optimization, Machine learning, Numerical domains

0

0

0

0

14:20

14/06/2020

Quasi-Newton Solver for Robust Non-Rigid Registration

Yuxin Yao, Bailin Deng, Weiwei Xu, Juyong Zhang

Keywords Paper

non-rigid registration, robust estimator, quasi-newton, welsch's function, mm algorithm, l-bfgs, deformation graph.

0

0

0

0

4:56

15/11/2020

Proving Highly-Concurrent Traversals Correct

Yotam M. Y. Feldman, Artem Khyzha, Constantin Enea and
Adam Morrison, Aleksandar Nanevski, Noam Rinetzky, Sharon Shoham

Keywords Paper

traversal correctness, proof framework, concurrent data structures, linearizability, traversal

0

0

0

0

12:08

06/12/2021

Learned Robust PCA: A Scalable Deep Unfolding Approach for High-Dimensional Outlier Detection

HanQin Cai, Jialin Liu, Wotao Yin

Keywords Paper

deep learning, machine learning

0

0

0

0

8:07

15/06/2020

Inductive sequentialization of asynchronous programs

Bernhard Kragl, Constantin Enea, Thomas A. Henzinger and
Suha Orhun Mutluergil, Shaz Qadeer

Keywords Paper

movers, layers, verification, abstraction, invariants, induction, concurrency, refinement, asynchrony, reduction

0

0

0

0

14:40

02/02/2021

Automated Clustering of High-dimensional Data with a Feature Weighted Mean Shift Algorithm

Saptarshi Chakraborty, Debolina Paul, Swagatam Das

Keywords Paper

0

0

0

0

20:09

06/12/2020

Kernel Methods Through the Roof: Handling Billions of Points Efficiently

Giacomo Meanti, Luigi Carratino, Lorenzo Rosasco, Alessandro Rudi

Keywords Paper

0

0

0

0

3:28

12/07/2020

The continuous categorical: a novel simplex-valued exponential family

Elliott Gordon-Rodriguez, Gabriel Loaiza-Ganem, John Cunningham

Keywords Paper

Probabilistic Inference - Models and Probabilistic Programming

0

0

0

0

14:59

04/08/2021

Group testing and local search: is there a computational-statistical gap?

Fotis Iliopoulos, Ilias Zadik

Keywords Paper

0

0

0

0

17:50

15/06/2020

Learning nonlinear loop invariants with gated continuous logic networks

Jianan Yao, Gabriel Ryan, Justin Wong and
Suman Jana, Ronghui Gu

Keywords Paper

Loop Invariant Inference, Continuous Logic Networks, Program Verification

0

0

0

0

14:18

06/12/2021

Hyperparameter Tuning is All You Need for LISTA

Xiaohan Chen, Jialin Liu, Zhangyang Wang, Wotao Yin

Keywords Paper

deep learning

0

0

0

0

15:05

12/07/2020

Fast Deterministic CUR Matrix Decomposition with Accuracy Assurance

Yasutoshi Ida, Sekitoshi Kanai, Yasuhiro Fujiwara and
Tomoharu Iwata, Koh Takeuchi, Hisashi Kashima

Keywords Paper

Optimization - General

0

0

0

0

12:24

13/04/2021

Tensor networks for probabilistic sequence modeling

Jacob Miller, Guillaume Rabusseau, John Terilla

Keywords Paper

0

0

0

0

3:01

12/07/2020

Inducing and Exploiting Activation Sparsity for Fast Inference on Deep Neural Networks

Mark Kurtz, Justin Kopinsky, Rati Gelashvili and
Alexander Matveev, John Carr, Michael Goin, William Leiserson, Sage Moore, Nir Shavit, Dan Alistarh

Keywords Paper

Deep Learning - Algorithms

0

0

0

0

14:41

26/04/2020

Minimizing FLOPs to Learn Efficient Sparse Representations

Biswajit Paria, Chih-Kuan Yeh, Ian E.H. Yen and
Ning Xu, Pradeep Ravikumar, Barnabás Póczos

Keywords Paper

sparse embeddings, deep representations, metric learning, regularization

0

0

0

0

4:41

06/12/2021

Differentiable Synthesis of Program Architectures

Guofeng Cui, He Zhu

Keywords Paper

optimization, machine learning, interpretability

0

0

0

0

13:31

20/08/2020

The Simple Essence of Algebraic Subtyping: Principal Type Inference with Subtyping Made Easy (Functional Pearl)

Lionel Parreaux

Keywords Paper

subtyping, principal types, type inference

0

0

0

0

14:39

15/11/2020

Fast Linear Programming through Transprecision Computing on Small and Sparse Data

Tobias Grosser, Theodoros Theodoridis, Maximilian Falkenstein and
Arjun Pitchanathan, Michael Kruse, Manuel Rigger, Zhendong Su, Torsten Hoefler

Keywords Paper

Presburger Arithmetic, Transprecision, Linear Programming, Simplex

0

0

0

0

13:35

13/04/2021

PClean: Bayesian data cleaning at scale with domain-specific probabilistic programming

Alexander Lew, Monica Agrawal, David Sontag, Vikash Mansinghka

Keywords Paper

0

0

0

0

3:08

06/12/2021

Benign Overfitting in Multiclass Classification: All Roads Lead to Interpolation

Ke Wang, Vidya Muthukumar, Christos Thrampoulidis

Keywords Paper

machine learning

0

0

0

0

12:38

18/07/2021

A Novel Sequential Coreset Method for Gradient Descent Algorithms

Jiawei Huang, Ruomin Huang, wenjie liu and
Nikolaos Freris, Hu Ding

Keywords Paper

Optimization

0

0

0

0

5:15

17/08/2020

Massively parallel rendering of complex closed-form implicit surfaces

Matthew J. Keeter

Keywords Paper

signed distance field, rasterization, octrees, gpu, freps, cuda, implicit surface

0

0

0

0

18:18

15/06/2020

Effective function merging in the SSA form

Rodrigo C. O. Rocha, Pavlos Petoumenos, Zheng Wang and
Murray Cole, Hugh Leather

Keywords Paper

Code Size Reduction, LTO, Function Merging

0

0

0

0

11:42

06/12/2020

Transferable Graph Optimizers for ML Compilers

yanqiz Zhou, Sudip Roy, Amirali Abdolrashidi and
Daniel Wong, Peter Ma, Qiumin Xu, Hanxiao Liu, Phitchaya Phothilimtha, Shen Wang, Anna Goldie, Azalia Mirhoseini, James Laudon

Keywords Paper

0

0

0

0

3:05

19/01/2020

The Weak Call-By-Value λ-Calculus is Reasonable for Both Time and Space

Yannick Forster, Fabian Kunze, Marc Roth

Keywords Paper

lambda calculus, time and space complexity, abstract machines, invariance thesis, weak call-by-value reduction

0

0

0

0

22:05

13/04/2021

Differentiating the value function by using convex duality

Sheheryar Mehmood, Peter Ochs

Keywords Paper

0

0

0

0

2:55

19/01/2020

Undecidability of D<: and Its Decidable Fragments

Jason Z.S. Hu, Ondřej Lhoták

Keywords Paper

Undecidability, Algorithmic Typing, D_{<:}, Dependent Object Types

0

0

0

0

21:21

06/12/2020

Sparse Spectrum Warped Input Measures for Nonstationary Kernel Learning

Anthony Tompkins, Rafael Oliveira, Fabio Ramos

Keywords Paper

0

0

0

0

3:20

23/06/2021

Proving Non-termination by Program Reversal

Krishnendu Chatterjee, Ehsan Kafshdar Goharshady, Petr Novotný, Đorđe Žikelić

Keywords Paper

Static Analysis, Program Termination, Backward Analysis, Invariant Generation, Completeness Guarantees

0

0

0

0

24:14

15/06/2020

SCAF: A speculation-aware collaborative dependence analysis framework

Sotiris Apostolakis, Ziyang Xu, Zujun Tan and
Greg Chan, Simone Campanoni, David I. August

Keywords Paper

speculation, collaboration, dependence analysis

0

0

0

0

16:16