DeepCuts: A Deep Learning Optimization Framework for Versatile GPU Workloads

23/06/2021

DeepCuts: A Deep Learning Optimization Framework for Versatile GPU Workloads

Wookeun Jung, Thanh Tuan Dao, Jaejin Lee

Keywords: GPU, Deep Learning, Code Generation

Abstract Paper Code Similar Papers

Abstract: Widely used Deep Learning (DL) frameworks, such as TensorFlow, PyTorch, and MXNet, heavily rely on the NVIDIA cuDNN for performance. However, using cuDNN does not always give the best performance. One reason is that it is hard to handle every case of versatile DNN models and GPU architectures with a library that has a fixed implementation. Another reason is that cuDNN lacks kernel fusion functionality that gives a lot of chances to improve performance. In this paper, we propose a DL optimization framework for versatile GPU workloads, called DeepCuts. It considers both kernel implementation parameters and GPU architectures. It analyzes the DL workload, groups multiple DL operations into a single GPU kernel, and generates optimized GPU kernels considering kernel implementation parameters and GPU architecture parameters. The evaluation result with various DL workloads for inference and training indicates that DeepCuts outperforms cuDNN/cuBLAS-based implementations and the state-of-the-art DL optimization frameworks, such as TVM, TensorFlow XLA, and TensorRT.

The video of this talk cannot be embedded. You can watch it here:

https://slideslive.com/38956263

(Link will open in new window)

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at PLDI 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

12/07/2020

Optimizing Dynamic Structures with Bayesian Generative Search

Minh Hoang, Carleton Kingsford

Keywords Paper

Transfer, Multitask and Meta-learning

0

0

0

0

15:00

05/04/2021

RL-Scope: Cross-stack Profiling for Deep Reinforcement Learning Workloads

James Gleeson, Sri Krishnan, Moshe Gabel and
Vijay Janapa Reddi, Eyal de Lara, Gennady Pekhimenko

Keywords Paper

0

0

0

0

20:00

05/04/2021

RL-Scope: Cross-stack Profiling for Deep Reinforcement Learning Workloads

James Gleeson, Sri Krishnan, Moshe Gabel and
Vijay Janapa Reddi, Eyal de Lara, Gennady Pekhimenko

Keywords Paper

0

0

0

0

5:17

06/12/2020

AdaTune: Adaptive Tensor Program Compilation Made Efficient

Menghao Li, Minjia Zhang, Chi Wang, Mingqin Li

Keywords Paper

0

0

0

0

3:16

04/11/2020

Ansor: Generating High-Performance Tensor Programs for Deep Learning

Lianmin Zheng, Chengfan Jia, Minmin Sun and
Zhao Wu, Cody Hao Yu, Ameer Haj-Ali, Yida Wang, Jun Yang, Danyang Zhuo, Koushik Sen, Joseph E. Gonzalez, Ion Stoica

Keywords Paper

0

0

0

0

20:10

15/06/2020

Offload Annotations: Bringing Heterogeneous Computing to Existing Libraries and Workloads

Gina Yuan, Shoumik Palkar, Deepak Narayanan, Matei Zaharia

Keywords Paper

0

0

0

0

23:04

06/12/2021

Revisiting Deep Learning Models for Tabular Data

Yury Gorishniy, Ivan Rubachev, Valentin Khrulkov, Artem Babenko

Keywords Paper

deep learning, transformers

0

0

0

0

12:14

06/12/2021

Differentiable Optimization of Generalized Nondecomposable Functions using Linear Programs

Zihang Meng, Lopamudra Mukherjee, Yichao Wu and
Vikas Singh, Sathya Narayanan Ravi

Keywords Paper

deep learning, optimization

0

0

0

0

13:21

06/12/2020

Nimble: Lightweight and Parallel GPU Task Scheduling for Deep Learning

Woosuk Kwon, Gyeong-In Yu, Eunji Jeong, Byung-Gon Chun

Keywords Paper

0

0

1

0

3:23

03/05/2021

Deep Equals Shallow for ReLU Networks in Kernel Regimes

Alberto Bietti, Francis Bach

Keywords Paper

approximation, neural tangent kernels, deep learning, kernels

0

0

0

0

5:27

12/07/2020

Multi-Precision Policy Enforced Training (MuPPET) : A Precision-Switching Strategy for Quantised Fixed-Point Training of CNNs

Aditya Rajagopal, Diederik Vink, Stylianos Venieris, Christos-Savvas Bouganis

Keywords Paper

Applications - Other

0

0

0

0

15:30

03/05/2021

Contextual Transformation Networks for Online Continual Learning

Quang Pham, Chenghao Liu, Doyen Sahoo, Steven HOI

Keywords Paper

Continual Learning

0

0

0

0

4:48

04/11/2020

Rammer: Enabling Holistic Deep Learning Compiler Optimizations with rTasks

Lingxiao Ma, Zhiqiang Xie, Zhi Yang and
Jilong Xue, Youshan Miao, Wei Cui, Wenxiang Hu, Fan Yang, Lintao Zhang, Lidong Zhou

Keywords Paper

0

0

0

0

16:30

06/12/2021

Kernel Functional Optimisation

Arun Kumar Anjanapura Venkatesh, Alistair Shilton, Santu Rana and
Sunil Gupta, Svetha Venkatesh

Keywords Paper

machine learning, kernel methods

0

0

0

0

12:48

02/02/2021

Learning Compositional Sparse Gaussian Processes with a Shrinkage Prior

Anh Tong, Toan M Tran, Hung Bui, Jaesik Choi

Keywords Paper

0

0

0

0

18:06

14/06/2020

Generalized Zero-Shot Learning via Over-Complete Distribution

Rohit Keshari, Richa Singh, Mayank Vatsa

Keywords Paper

deep learning, zero-shot leaning, cvae, triplet loss, center loss

0

0

0

0

0:50

06/12/2020

Kernel Methods Through the Roof: Handling Billions of Points Efficiently

Giacomo Meanti, Luigi Carratino, Lorenzo Rosasco, Alessandro Rudi

Keywords Paper

0

0

0

0

3:28

06/12/2021

Representation Learning Beyond Linear Prediction Functions

Ziping Xu, Ambuj Tewari

Keywords Paper

theory, deep learning, optimization, representation learning, few shot learning

0

0

0

0

11:00

06/12/2020

MetaPerturb: Transferable Regularizer for Heterogeneous Tasks and Architectures

Jeong Un Ryu, JWoong Shin, Hae Beom Lee, Sung Ju Hwang

Keywords Paper

0

0

0

0

3:32

14/09/2020

Off-the-grid: fast and effective hyperparameter search for kernel clustering

Bruno Ordozgoiti, Lluís Belanche

Keywords Paper

clustering, kernels, kernel k-means, hyperparameter tuning, grid search

0

0

0

0

15:47

06/12/2020

On Infinite-Width Hypernetworks

Etai Littwin, Tomer Galanti, Lior Wolf, Greg Yang

Keywords Paper

0

0

0

0

2:57

18/07/2021

Selecting Data Augmentation for Simulating Interventions

Max Ilse, Jakub Tomczak, Patrick Forré

Keywords Paper

Algorithms, Supervised Learning

0

0

0

0

4:14

03/05/2021

QPLEX: Duplex Dueling Multi-Agent Q-Learning

Jianhao Wang, Zhizhou Ren, Terry Liu and
Yang Yu, Chongjie Zhang

Keywords Paper

Dueling structure, Value factorization, Multi-agent reinforcement learning

0

0

0

0

4:52

06/12/2021

Continual Learning via Local Module Composition

Oleksiy Ostapenko, Pau Rodriguez, Massimo Caccia, Laurent Charlin

Keywords Paper

continual learning, transfer learning

1

0

0

1

14:32

12/07/2020

On hyperparameter tuning in general clustering problemsm

Xinjie Fan, Yuguang Yue, Purnamrita Sarkar, Y. X. Rachel Wang

Keywords Paper

Unsupervised and Semi-Supervised Learning

0

0

0

0

12:53

26/08/2020

ASAP: Architecture Search, Anneal and Prune

Asaf Noy, Niv Nayman, Tal Ridnik and
Nadav Zamir, Sivan Doveh, Itamar Friedman, Raja Giryes, Lihi Zelnik

Keywords Paper

0

0

0

0

11:59

04/11/2020

A Unified Architecture for Accelerating Distributed DNN Training in Heterogeneous GPU/CPU Clusters

Yimin Jiang, Yibo Zhu, Chang Lan and
Bairen Yi, Yong Cui, Chuanxiong Guo

Keywords Paper

0

0

0

0

19:36

14/06/2020

Fast Sparse ConvNets

Erich Elsen, Marat Dukhan, Trevor Gale, Karen Simonyan

Keywords Paper

vision, convolutional networks, cnns, efficient inference, sparsity, mobile, edge, tensorflow, xnnpack

0

0

0

0

1:01

06/12/2020

Gradient Surgery for Multi-Task Learning

Tianhe (Kevin) Yu, Saurabh Kumar, Abhishek Gupta and
Sergey Levine, Karol Hausman, Chelsea Finn

Keywords Paper

0

0

0

0

3:16

26/08/2020

Ensemble Gaussian Processes with Spectral Features for Online Interactive Learning with Scalability

Qin Lu, Georgios Karanikolas, Yanning Shen, Georgios B. Giannakis

Keywords Paper

0

0

0

0

14:11

02/02/2021

GLISTER: Generalization based Data Subset Selection for Efficient and Robust Learning

Krishnateja Killamsetty, Durga Sivasubramanian, Ganesh Ramakrishnan, Rishabh Iyer

Keywords Paper

0

0

0

0

19:14

23/06/2021

AKG: Automatic Kernel Generation for Neural Processing Units using Polyhedral Transformations

Jie Zhao, Bojie Li, Wang Nie and
Zhen Geng, Renwei Zhang, Xiong Gao, Bin Cheng, Chen Wu, Yun Cheng, Zheng Li, Peng Di, Kun Zhang, Xuefeng Jin

Keywords Paper

neural networks, neural processing units, polyhedral model, code generation, auto-tuning

0

0

0

0

21:49

06/12/2021

Intrinsic Dimension, Persistent Homology and Generalization in Neural Networks

Tolga Birdal, Aaron Lou, Leonidas Guibas, Umut Simsekli

Keywords Paper

theory, deep learning, optimization

0

0

0

0

14:38

03/05/2021

Conditional Generative Modeling via Learning the Latent Space

Sameera Ramasinghe, Kanchana Ranasinghe, Salman Khan and
Nick Barnes, Stephen Gould

Keywords Paper

Generative Modeling, Conditional Generation, Multimodal Spaces

0

0

0

0

4:57

26/04/2020

Neural Oblivious Decision Ensembles for Deep Learning on Tabular Data

Sergei Popov, Stanislav Morozov, Artem Babenko

Keywords Paper

tabular data, architectures, DNN

0

0

0

0

5:05

05/01/2021

Dynamic Routing Networks

Shaofeng Cai, Yao Shu, Wei Wang

Keywords Paper

0

0

0

0

4:52

03/05/2021

Learning N:M Fine-grained Structured Sparse Neural Networks From Scratch

Aojun Zhou, Yukun Ma, Junnan Zhu and
Jianbo Liu, Zhijie Zhang, Kun Yuan, Wenxiu Sun, Hongsheng Li

Keywords Paper

sparsity, efficient training and inference.

0

0

0

0

5:09

06/12/2020

Non-Euclidean Universal Approximation

Anastasis Kratsios, Eugene Bilokopytov

Keywords Paper

0

0

0

0

3:34

22/11/2021

Mode-Guided Feature Augmentation for Domain Generalization

Muhammad Haris Khan, Syed Muhammad talha Zaidi, Salman Khan, Fahad Shahbaz Khan

Keywords Paper

out-of-domain robustness, domain generalization, domain adaptation, convolutional neural networks, data augmentation, feature augmentation, subspace similarity, covariate shift, in-domain generalization, robust objective function

0

0

0

0

2:56

14/06/2020

Learning to Optimize on SPD Manifolds

Zhi Gao, Yuwei Wu, Yunde Jia, Mehrtash Harandi

Keywords Paper

riemannian optimization, symmetric positive definite (spd) manifolds, optimization-based meta-learning, automatical spd optimizer design, learning to optimize, gradiend-based spd optimization, optimization problems with spd constraints

0

0

0

0

0:50