Challenges and Opportunities of Building Fast GBDT Systems

19/08/2021

Challenges and Opportunities of Building Fast GBDT Systems

Zeyi Wen, Qinbin Li, Bingsheng He, Bin Cui

Keywords: Machine learning, General

Abstract Paper Similar Papers

Abstract: In the last few years, Gradient Boosting Decision Trees (GBDTs) have been widely used in various applications such as online advertising and spam filtering. However, GBDT training is often a key performance bottleneck for such data science pipelines, especially for training a large number of deep trees on large data sets. Thus, many parallel and distributed GBDT systems have been researched and developed to accelerate the training process. In this survey paper, we review the recent GBDT systems with respect to accelerations with emerging hardware as well as cluster computing, and compare the advantages and disadvantages of the existing implementations. Finally, we present the research opportunities and challenges in designing fast next generation GBDT systems.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at IJCAI 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

26/08/2020

Scalable Feature Selection for (Multitask) Gradient Boosted Trees

Cuize Han, Nikhil Rao, Daria Sorokina, Karthik Subbian

Keywords Paper

0

0

0

0

19:44

02/02/2021

GLISTER: Generalization based Data Subset Selection for Efficient and Robust Learning

Krishnateja Killamsetty, Durga Sivasubramanian, Ganesh Ramakrishnan, Rishabh Iyer

Keywords Paper

0

0

0

0

19:14

14/09/2020

Active Learning for Hierarchical Multi-Label Classification

Felipe Kenji Nakano, Ricardo Cerri, Vens Celin

Keywords Paper

0

0

0

0

15:42

23/08/2020

Controllable multi-interest framework for recommendation

Yukuo Cen, Jianwei Zhang, Xu Zou and
Chang Zhou, Hongxia Yang, Jie Tang

Keywords Paper

recommender system, multi-interest framework, sequential recommendation

0

0

0

0

15:59

13/04/2021

Nonparametric estimation of heterogeneous treatment effects: From theory to learning algorithms

Alicia Curth, Mihaela Schaar

Keywords Paper

0

0

0

0

3:01

26/08/2020

Accelerating Gradient Boosting Machines

Haihao Lu, Sai Praneeth Karimireddy, Natalia Ponomareva, Vahab Mirrokni

Keywords Paper

0

0

0

0

14:56

06/12/2021

EvoGrad: Efficient Gradient-Based Meta-Learning and Hyperparameter Optimization

Ondrej Bohdal, Yongxin Yang, Timothy Hospedales

Keywords Paper

deep learning, optimization, graph learning, meta learning, few shot learning

0

0

0

0

14:09

06/12/2020

Interstellar: Searching Recurrent Architecture for Knowledge Graph Embedding

Yongqi Zhang, Quanming Yao, Lei Chen

Keywords Paper

0

0

0

0

3:23

18/07/2021

DG-LMC: A Turn-key and Scalable Synchronous Distributed MCMC Algorithm via Langevin Monte Carlo within Gibbs

Vincent Plassier, Maxime Vono, Alain Durmus, Eric Moulines

Keywords Paper

Probabilistic Methods, Monte Carlo Methods

0

0

0

0

16:51

03/05/2021

Dataset Condensation with Gradient Matching

Bo ZHAO, Konda Reddy Mopuri, Hakan Bilen

Keywords Paper

dataset condensation, image generation, data-efficient learning

0

0

0

0

15:09

26/04/2020

Neural Oblivious Decision Ensembles for Deep Learning on Tabular Data

Sergei Popov, Stanislav Morozov, Artem Babenko

Keywords Paper

tabular data, architectures, DNN

0

0

0

0

5:05

26/04/2020

Decentralized Deep Learning with Arbitrary Communication Compression

Anastasia Koloskova, Tao Lin, Sebastian U Stich, Martin Jaggi

Keywords Paper

0

0

0

0

5:16

05/01/2021

Dynamic Routing Networks

Shaofeng Cai, Yao Shu, Wei Wang

Keywords Paper

0

0

0

0

4:52

05/04/2021

Towards Scalable Distributed Training of Deep Learning on Public Cloud Clusters

Shaohuai Shi, Xianhao Zhou, Shutao Song and
Xingyao Wang, Zilin Zhu, Xue Huang, Xinan Jiang, Feihu Zhou, Zhenyu Guo, Liqiang Xie, Rui Lan, Xianbin Ouyang, Yan Zhang, Jieqian Wei, Jing Gong, Weiliang Lin, Ping Gao, Peng Meng, Xiaomin Xu, Chenyang Guo, Bo Yang, Zhibo Chen, Yongjian Wu, Xiaowen Chu

Keywords Paper

0

0

0

0

5:02

05/04/2021

Towards Scalable Distributed Training of Deep Learning on Public Cloud Clusters

Shaohuai Shi, Xianhao Zhou, Shutao Song and
Xingyao Wang, Zilin Zhu, Xue Huang, Xinan Jiang, Feihu Zhou, Zhenyu Guo, Liqiang Xie, Rui Lan, Xianbin Ouyang, Yan Zhang, Jieqian Wei, Jing Gong, Weiliang Lin, Ping Gao, Peng Meng, Xiaomin Xu, Chenyang Guo, Bo Yang, Zhibo Chen, Yongjian Wu, Xiaowen Chu

Keywords Paper

0

0

0

0

21:24

12/07/2020

Puzzle Mix: Exploiting Saliency and Local Statistics for Optimal Mixup

Jang-Hyun Kim, Wonho Choo, Hyun Oh Song

Keywords Paper

Deep Learning - Algorithms

0

0

0

0

13:40

18/07/2021

Self-Damaging Contrastive Learning

Ziyu Jiang, Tianlong Chen, Bobak Mortazavi, Zhangyang Wang

Keywords Paper

Algorithms, Unsupervised Learning

0

0

0

1

5:10

02/02/2021

Physarum Powered Differentiable Linear Programming Layers and Applications

Zihang Meng, Sathya N. Ravi, Vikas Singh

Keywords Paper

0

0

0

0

16:57

05/01/2021

MPRNet: Multi-Path Residual Network for Lightweight Image Super Resolution

Armin Mehri, Parichehr B. Ardakani, Angel D. Sappa

Keywords Paper

0

0

0

0

4:57

12/07/2020

Adversarial Robustness via Runtime Masking and Cleansing

Yi-Hsuan Wu, Chia-Hung Yuan, Shan-Hung (Brandon) Wu

Keywords Paper

Adversarial Examples

0

0

0

0

13:38

26/04/2020

Continual Learning with Adaptive Weights (CLAW)

Tameem Adel, Han Zhao, Richard E. Turner

Keywords Paper

Continual learning

0

0

0

0

4:58

06/12/2021

Tuning Mixed Input Hyperparameters on the Fly for Efficient Population Based AutoRL

Jack Parker-Holder, Vu Nguyen, Shaan Desai, Stephen J Roberts

Keywords Paper

optimization, reinforcement learning and planning, bandits

0

0

0

0

14:41

05/04/2021

TT-Rec: Tensor Train Compression for Deep Learning Recommendation Models

Chunxing Yin, Bilge Acun, Carole-Jean Wu, Xing Liu

Keywords Paper

0

0

0

0

23:05

05/04/2021

TT-Rec: Tensor Train Compression for Deep Learning Recommendation Models

Chunxing Yin, Bilge Acun, Carole-Jean Wu, Xing Liu

Keywords Paper

0

0

0

0

5:15

18/07/2021

Training Graph Neural Networks with 1000 Layers

Guohao Li, Matthias Müller, Bernard Ghanem, Vladlen Koltun

Keywords Paper

Algorithms, Large Scale Learning

0

0

0

0

5:14

03/05/2021

Understanding the effects of data parallelism and sparsity on neural network training

Namhoon Lee, Thalaiyasingam Ajanthan, Philip Torr, Martin Jaggi

Keywords Paper

sparsity, neural network training, data parallelism

0

0

0

0

4:52

19/10/2020

AutoADR: Automatic model design for ad relevance

Yiren Chen, Yaming Yang, Hong Sun and
Yujing Wang, Yu Xu, Wei Shen, Rong Zhou, Yunhai Tong, Jing Bai, Ruofei Zhang

Keywords Paper

neural architecture search, knowledge distillation, ad relevance

0

0

0

0

9:24

06/12/2021

Federated Hyperparameter Tuning: Challenges, Baselines, and Connections to Weight-Sharing

Mikhail Khodak, Renbo Tu, Tian Li and
Liam Li, Maria-Florina Balcan, Virginia Smith, Ameet S Talwalkar

Keywords Paper

deep learning, optimization, machine learning, meta learning, federated learning

0

0

0

0

11:11

13/04/2021

Towards flexible device participation in federated learning

Yichen Ruan, Xiaoxi Zhang, Shu-Che Liang, Carlee Joe-Wong

Keywords Paper

0

0

0

0

3:01

12/07/2020

Multi-Precision Policy Enforced Training (MuPPET) : A Precision-Switching Strategy for Quantised Fixed-Point Training of CNNs

Aditya Rajagopal, Diederik Vink, Stylianos Venieris, Christos-Savvas Bouganis

Keywords Paper

Applications - Other

0

0

0

0

15:30

06/12/2020

Efficient Algorithms for Device Placement of DNN Graph Operators

Jakub Tarnawski, Amar Phanishayee, Nikhil Devanur and
Divya Mahajan, Fanny Nina Paravecino

Keywords Paper

0

0

1

0

3:20

03/05/2021

Efficient Continual Learning with Modular Networks and Task-Driven Priors

Tom Veniat, Ludovic Denoyer, Marc'Aurelio Ranzato

Keywords Paper

Benchmark, Neural Network, Modular network, Lifelong learning, Continual learning

0

0

0

0

5:26

14/07/2020

On the limits of parallelizing convolutional neural networks on GPUs

Behnam Pourghassemi, Chenghao Zhang, Joo Hwan Lee, Aparna Chandramowlishwaran

Keywords Paper

GPU, non-linear networks, convolutional neural networks (CNNs), resource utilization, parallelization

0

0

0

0

7:42

06/12/2021

BatchQuant: Quantized-for-all Architecture Search with Robust Quantizer

Haoping Bai, Meng Cao, Ping Huang, Jiulong Shan

Keywords Paper

deep learning, optimization

0

0

0

0

4:12

03/05/2021

Learning N:M Fine-grained Structured Sparse Neural Networks From Scratch

Aojun Zhou, Yukun Ma, Junnan Zhu and
Jianbo Liu, Zhijie Zhang, Kun Yuan, Wenxiu Sun, Hongsheng Li

Keywords Paper

sparsity, efficient training and inference.

0

0

0

0

5:09

23/08/2020

AutoML pipeline selection: Efficiently navigating the combinatorial space

Chengrun Yang, Jicong Fan, Ziyang Wu, Madeleine Udell

Keywords Paper

pipeline search, greedy algorithms, experiment design, AutoML, tensor decomposition, submodular optimization, meta-learning

0

0

0

0

13:40

06/12/2020

Throughput-Optimal Topology Design for Cross-Silo Federated Learning

Othmane MARFOQ, CHUAN XU, Giovanni Neglia, Richard Vidal

Keywords Paper

0

0

0

0

3:21

06/12/2021

Efficient Training of Retrieval Models using Negative Cache

Erik Lindgren, Sashank Reddi, Ruiqi Guo, Sanjiv Kumar

Keywords Paper

deep learning, machine learning

0

0

0

0

10:41

19/10/2020

SDM-RDFizer: An RML interpreter for the efficient creation of RDF knowledge graphs

Enrique Iglesias, Samaneh Jozashoori, David Chaves-Fraga and
Diego Collarana, Maria-Esther Vidal

Keywords Paper

knowledge graph, rml, rdf

0

0

0

0

8:44

14/06/2020

L2-GCN: Layer-Wise and Learned Efficient Training of Graph Convolutional Networks

Yuning You, Tianlong Chen, Zhangyang Wang, Yang Shen

Keywords Paper

graph convolutional network, efficient training, mini-batch training

0

0

0

0

1:00