Cortex: A Compiler for Recursive Deep Learning Models

05/04/2021

Cortex: A Compiler for Recursive Deep Learning Models

Pratik Fegade, Tianqi Chen, Phillip Gibbons, Todd Mowry

Keywords:

Abstract Paper Similar Papers

Abstract: Optimizing deep learning models is generally performed in two steps: (i) high-level graph optimizations such as kernel fusion and (ii) low level kernel optimizations such as those found in vendor libraries. This approach often leaves significant performance on the table, especially for the case of recursive deep learning models. In this paper, we present Cortex, a compiler-based approach to generate highly-efficient code for recursive models for low latency inference. Our compiler approach and low reliance on vendor libraries enables us to perform end-to-end optimizations, leading to up to 14X lower inference latencies over past work, across different backends.

The video of this talk cannot be embedded. You can watch it here:

https://slideslive.com/38952671

(Link will open in new window)

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at MLSYS 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

26/04/2020

Reinforced Genetic Algorithm Learning for Optimizing Computation Graphs

Aditya Paliwal, Felix Gimeno, Vinod Nair and
Yujia Li, Miles Lubin, Pushmeet Kohli, Oriol Vinyals

Keywords Paper

reinforcement learning, learning to optimize, combinatorial optimization, computation graphs, model parallelism, learning for systems

0

0

0

0

4:21

14/07/2020

Communication-optimal tilings for projective nested loops with arbitrary bounds

Grace Dinh, James Demmel

Keywords Paper

optimal tilings, communication-avoiding algorithms, cache complexity

0

0

0

0

7:33

15/11/2020

Fast Linear Programming through Transprecision Computing on Small and Sparse Data

Tobias Grosser, Theodoros Theodoridis, Maximilian Falkenstein and
Arjun Pitchanathan, Michael Kruse, Manuel Rigger, Zhendong Su, Torsten Hoefler

Keywords Paper

Presburger Arithmetic, Transprecision, Linear Programming, Simplex

0

0

0

0

13:35

18/07/2021

Learn2Hop: Learned Optimization on Rough Landscapes

Amil Merchant, Luke Metz, Samuel Schoenholz, Ekin Cubuk

Keywords Paper

Applications, Others

0

0

0

0

5:19

05/04/2021

A Distributed Graph-Theoretic Framework for Automatic Parallelization in Multi-core Systems

Guixiang Ma, Yao Xiao, Theodore Willke and
Nesreen Ahmed, Shahin Nazarian, Paul Bogdan

Keywords Paper

0

0

0

0

5:29

05/04/2021

A Distributed Graph-Theoretic Framework for Automatic Parallelization in Multi-core Systems

Guixiang Ma, Yao Xiao, Theodore Willke and
Nesreen Ahmed, Shahin Nazarian, Paul Bogdan

Keywords Paper

0

0

0

0

19:43

15/11/2020

Neural Reverse Engineering of Stripped Binaries using Augmented Control Flow Graphs

Yaniv David, Uri Alon, Eran Yahav

Keywords Paper

Static Binary Analysis, Neural Reverse Engineering

0

0

0

0

14:27

26/04/2020

Learning execution through neural code fusion

Zhan Shi, Kevin Swersky, Daniel Tarlow and
Parthasarathy Ranganathan, Milad Hashemi

Keywords Paper

code understanding, graph neural networks, learning program execution, execution traces, program performance

0

0

0

0

5:03

18/07/2021

Unbiased Gradient Estimation in Unrolled Computation Graphs with Persistent Evolution Strategies

Paul Vicol, Luke Metz, Jascha Sohl-Dickstein

Keywords Paper

Deep Learning

0

0

0

0

22:54

18/07/2021

DeepWalking Backwards: From Embeddings Back to Graphs

Sudhanshu Chanpuriya, Cameron Musco, Konstantinos Sotiropoulos, Babis Tsourakakis

Keywords Paper

Algorithms, Uncertainty Estimation, Algorithms, Missing Data; Theory, Regularization, Algorithms, Networks and Relational Learning

0

0

0

0

5:03

06/12/2021

Efficient Training of Retrieval Models using Negative Cache

Erik Lindgren, Sashank Reddi, Ruiqi Guo, Sanjiv Kumar

Keywords Paper

deep learning, machine learning

0

0

0

0

10:41

06/12/2021

Robust and Fully-Dynamic Coreset for Continuous-and-Bounded Learning (With Outliers) Problems

Zixiu Wang, Yiwen Guo, Hu Ding

Keywords Paper

optimization, machine learning, adversarial robustness and security, clustering

0

0

0

0

8:38

06/12/2021

Data driven semi-supervised learning

Maria-Florina Balcan, Dravyansh Sharma

Keywords Paper

theory, generative model, graph learning, online learning, semi-supervised learning

0

0

0

0

12:57

14/09/2020

Online Binary Incomplete Multi-view Clustering

Longqi Yang, Liangliang Zhang, Yuhua Tang

Keywords Paper

0

0

0

0

3:04

26/04/2020

Global Relational Models of Source Code

Vincent J. Hellendoorn, Charles Sutton, Rishabh Singh and
Petros Maniatis, David Bieber

Keywords Paper

Models of Source Code, Graph Neural Networks, Structured Learning

0

0

0

1

5:17

06/12/2020

AdaTune: Adaptive Tensor Program Compilation Made Efficient

Menghao Li, Minjia Zhang, Chi Wang, Mingqin Li

Keywords Paper

0

0

0

0

3:16

02/02/2021

Towards Balanced Defect Prediction with Better Information Propagation

Xianda Zheng, Yuan-Fang Li, Huan Gao and
Yuncheng Hua, Guilin Qi

Keywords Paper

0

0

0

0

15:11

06/12/2021

Approximate Decomposable Submodular Function Minimization for Cardinality-Based Components

Nate Veldt, Austin Benson, Jon Kleinberg

Keywords Paper

machine learning, graph learning, clustering

0

0

0

0

15:08

02/02/2021

Searching for Machine Learning Pipelines Using a Context-Free Grammar

Radu Marinescu, Akihiro Kishimoto, Parikshit Ram and
Ambrish Rawat, Martin Wistuba, Paulito P. Palmes, Adi Botea

Keywords Paper

0

0

0

0

15:25

06/12/2020

Learning to Dispatch for Job Shop Scheduling via Deep Reinforcement Learning

Cong Zhang, Wen Song, Zhiguang Cao and
Jie Zhang, Puay Siew Tan, Xu Chi

Keywords Paper

0

0

0

0

3:17

15/06/2020

Daydream: Accurately Estimating the Efficacy of Optimizations for DNN Training

Hongyu Zhu, Amar Phanishayee, Gennady Pekhimenko

Keywords Paper

0

0

0

0

25:03

12/07/2020

p-Norm Flow Diffusion for Local Graph Clustering

Kimon Fountoulakis, Di Wang, Shenghao Yang

Keywords Paper

Unsupervised and Semi-Supervised Learning

0

0

0

0

14:43

26/08/2020

ASAP: Architecture Search, Anneal and Prune

Asaf Noy, Niv Nayman, Tal Ridnik and
Nadav Zamir, Sivan Doveh, Itamar Friedman, Raja Giryes, Lihi Zelnik

Keywords Paper

0

0

0

0

11:59

14/06/2020

Distribution-Induced Bidirectional Generative Adversarial Network for Graph Representation Learning

Shuai Zheng, Zhenfeng Zhu, Xingxing Zhang and
Zhizhe Liu, Jian Cheng, Yao Zhao

Keywords Paper

graph representation learning, generative adversarial network, graph neural network, unsupervised learning

0

0

0

0

1:01

12/07/2020

Einsum Networks: Fast and Scalable Learning of Tractable Probabilistic Circuits

Robert Peharz, Steven Lang, Antonio Vergari and
Karl Stelzner, Alejandro Molina, Martin Trapp, Guy Van den Broeck, Kristian Kersting, Zoubin Ghahramani

Keywords Paper

Probabilistic Inference - Models and Probabilistic Programming

0

0

0

0

15:29

15/06/2020

OSCA: An Online-Model Based Cache Allocation Scheme in Cloud Block Storage Systems

Yu Zhang, Ping Huang, Ke Zhou and
Hua Wang, Jianying Hu, Yongguang Ji, Bin Cheng

Keywords Paper

0

0

0

0

21:30

15/11/2020

Learning Graph-Based Heuristics for Pointer Analysis without Handcrafting Application-Specific Features

Minseok Jeon, Myungho Lee, Hakjoo Oh

Keywords Paper

Context sensitivity, Pointer analysis, Data-driven static analysis, Heap abstraction, Machine learning for program analysis

0

0

0

0

14:38

13/07/2020

Understanding and Finding Crash-Consistency Bugs in Parallel File Systems

Jinghan Sun, Chen Wang, Jian Huang, Marc Snir

Keywords Paper

0

0

0

0

12:17

06/12/2021

A Bi-Level Framework for Learning to Solve Combinatorial Optimization on Graphs

Runzhong Wang, Zhigang Hua, Gan Liu and
Jiayi Zhang, Junchi Yan, Feng Qi, Shuang Yang, Jun Zhou, Xiaokang Yang

Keywords Paper

deep learning, optimization, reinforcement learning and planning, machine learning, graph learning

0

0

0

0

11:19

12/07/2020

Computational and Statistical Tradeoffs in Inferring Combinatorial Structures of Ising Model

Ying Jin, Zhaoran Wang, Junwei Lu

Keywords Paper

Probabilistic Inference - Models and Probabilistic Programming

0

0

0

0

13:08

03/08/2020

Semi-bandit Optimization in the Dispersed Setting

Travis Dick, Wesley Pegden, Maria-Florina Balcan

Keywords Paper

0

0

0

0

8:04

15/06/2020

PMEvo: Portable inference of port mappings for out-of-order processors by evolutionary optimization

Fabian Ritter, Sebastian Hack

Keywords Paper

port mapping, processor reverse engineering, evolutionary algorithm

0

0

0

0

14:43

06/12/2020

Kernel Methods Through the Roof: Handling Billions of Points Efficiently

Giacomo Meanti, Luigi Carratino, Lorenzo Rosasco, Alessandro Rudi

Keywords Paper

0

0

0

0

3:28

23/08/2020

ALO-NMF: Accelerated locality-optimized non-negative matrix factorization

Gordon E. Moon, J. Austin Ellis, Aravind Sukumaran-Rajam and
Srinivasan Parthasarathy, P. Sadayappan

Keywords Paper

dimensionality reduction, data locality optimization, parallel non-negative matrix factorization

0

0

0

0

10:41

23/08/2020

SCE: Scalable network embedding from sparsest cut

Shengzhong Zhang, Zengfeng Huang, Haicang Zhou, Ziang Zhou

Keywords Paper

network embedding, graph partition, graph neural networks

0

0

0

0

15:05

18/07/2021

ProGraML: A Graph-based Program Representation for Data Flow Analysis and Compiler Optimizations

Chris Cummins, Zacharias Fisches, Tal Ben-Nun and
Torsten Hoefler, Michael O'Boyle, Hugh Leather

Keywords Paper

Applications, Hardware and Systems

0

0

0

0

5:01

04/11/2020

A Tensor Compiler for Unified Machine Learning Prediction Serving

Supun Nakandala, Karla Saur, Gyeong-In Yu and
Konstantinos Karanasos, Carlo Curino, Markus Weimer, Matteo Interlandi

Keywords Paper

0

0

0

0

19:56

14/09/2020

Graph-Revised Convolutional Network

Donghan Yu, Ruohong Zhang, Zhengbao Jiang and
Yuexin Wu, Yiming Yang

Keywords Paper

graph convolutional network, graph learning, semi-supervised learning

0

0

0

0

14:47

26/08/2020

Minimax Bounds for Structured Prediction Based on Factor Graphs

Kevin Bello, Asish Ghoshal, Jean Honorio

Keywords Paper

0

0

0

0

14:51