FlexServe: Deployment of PyTorch Models as Flexible REST Endpoints

Abstract: The integration of artificial intelligence capabilities into modern software systems is increasingly being simplified through the use of cloud-based machine learning services and representational state transfer architecture design. However, insufficient information regarding underlying model provenance and the lack of control over model evolution serve as an impediment to more widespread adoption of these services in operational environments which have strict security requirements. Furthermore, although tools such as TensorFlow Serving allow models to be deployed as RESTful endpoints, they require the error-prone process of converting the PyTorch models into static computational graphs needed by TensorFlow. To enable rapid deployments of PyTorch models without the need for intermediate transformations, we have developed FlexServe, a simple library to deploy multi-model ensembles with flexible batching.

04/11/2020

FlexServe: Deployment of PyTorch Models as Flexible REST Endpoints

Edward Verenich, Alvaro Velasquez, M. G. Sarwar Murshed, Faraz Hussain

Comments

Similar Papers

A Tensor Compiler for Unified Machine Learning Prediction Serving

Supun Nakandala, Karla Saur, Gyeong-In Yu and Konstantinos Karanasos, Carlo Curino, Markus Weimer, Matteo Interlandi

Keywords Abstract Paper

Guided Linking: Dynamic Linking without the Costs

Sean Bartell, Will Dietz, Vikram S. Adve

Keywords Abstract Paper

LTO, LLVM, Code deduplication, Dynamic Linking, Shared Libraries, Link-Time Optimization, Plugins, IR

Shiftry: RNN Inference in 2KB of RAM

Aayan Kumar, Vivek Seshadri, Rahul Sharma

Keywords Abstract Paper

Programming language, Fixed-point, Memory management, Machine learning, Embedded devices, Compiler, IoT device

Learning execution through neural code fusion

Zhan Shi, Kevin Swersky, Daniel Tarlow and Parthasarathy Ranganathan, Milad Hashemi

Keywords Abstract Paper

code understanding, graph neural networks, learning program execution, execution traces, program performance

PyGlove: Symbolic Programming for Automated Machine Learning

Daiyi Peng, Xuanyi Dong, Esteban Real and Mingxing Tan, Yifeng Lu, Gabriel Bender, Hanxiao Liu, Adam Kraft, Chen Liang, Quoc V Le

Keywords Abstract Paper

Sealing Pointer-Based Optimizations Behind Pure Functions

Daniel Selsam, Simon Hudon, Leonardo De Moura

Keywords Abstract Paper

functional programming, interactive theorem proving, Lean

Dynamic Routing Networks

Shaofeng Cai, Yao Shu, Wei Wang

Keywords Abstract Paper

MetaNorm: Learning to Normalize Few-Shot Batches Across Domains

Yingjun Du, Xiantong Zhen, Ling Shao, Cees G Snoek

Keywords Abstract Paper

batch normalization, Meta-learning, few-shot domain generalization

Variational Template Machine for Data-to-Text Generation

Rong Ye, Wenxian Shi, Hao Zhou and Zhongyu Wei, Lei Li

Keywords Abstract Paper

Graph Filter-based Multi-view Attributed Graph Clustering

Zhiping Lin, Zhao Kang

Keywords Abstract Paper

Machine Learning, Clustering, Multi-instance; Multi-label; Multi-view learning, Clustering, Unsupervised Learning

A Novel Sequential Coreset Method for Gradient Descent Algorithms

Jiawei Huang, Ruomin Huang, wenjie liu and Nikolaos Freris, Hu Ding

Keywords Abstract Paper

Optimization

MetaPerturb: Transferable Regularizer for Heterogeneous Tasks and Architectures

Jeong Un Ryu, JWoong Shin, Hae Beom Lee, Sung Ju Hwang

Keywords Abstract Paper

Graph Neural Network to Dilute Outliers for Refactoring Monolith Application

Utkarsh Desai, Sambaran Bandyopadhyay, Srikanth Tamilselvam

Keywords Abstract Paper

JAX MD: A Framework for Differentiable Physics

Sam Schoenholz, Dogus Cubuk

Keywords Abstract Paper

Learning fast and precise numerical analysis

Jingxuan He, Gagandeep Singh, Markus Püschel, Martin Vechev

Keywords Abstract Paper

Abstract interpretation, Performance optimization, Machine learning, Numerical domains

Minimizing FLOPs to Learn Efficient Sparse Representations

Biswajit Paria, Chih-Kuan Yeh, Ian E.H. Yen and Ning Xu, Pradeep Ravikumar, Barnabás Póczos

Keywords Abstract Paper

sparse embeddings, deep representations, metric learning, regularization

Deeplite NeutrinoTM: A BlackBox Framework for Constrained Deep Learning Model Optimization

Anush Sankaran, Olivier Mastropietro, Ehsan Saboori and Yasser Idris, Davis Sawyer, MohammadHossein AskariHemmat, Ghouthi Boukli Hacene

Keywords Abstract Paper

Learning Generalized Relational Heuristic Networks for Model-Agnostic Planning

Rushang Karia, Siddharth Srivastava

Keywords Abstract Paper

Contextual Transformation Networks for Online Continual Learning

Quang Pham, Chenghao Liu, Doyen Sahoo, Steven HOI

Keywords Abstract Paper

Continual Learning

Automatic Perturbation Analysis for Scalable Certified Robustness and Beyond

Kaidi Xu, Zhouxing Shi, Huan Zhang and Yihan Wang, Kai-Wei Chang, Minlie Huang, Bhavya Kailkhura, Xue Lin, Cho-Jui Hsieh

Keywords Abstract Paper

Scalable Deep Generative Modeling for Sparse Graphs

Hanjun Dai, Azade Nazi, Yujia Li and Bo Dai, Dale Schuurmans

Keywords Abstract Paper

Deep Learning - Generative Models and Autoencoders

ProGraML: A Graph-based Program Representation for Data Flow Analysis and Compiler Optimizations

Chris Cummins, Zacharias Fisches, Tal Ben-Nun and Torsten Hoefler, Michael O'Boyle, Hugh Leather

Supun Nakandala, Karla Saur, Gyeong-In Yu and
Konstantinos Karanasos, Carlo Curino, Markus Weimer, Matteo Interlandi

Keywords Paper

Keywords Paper

Keywords Paper

Zhan Shi, Kevin Swersky, Daniel Tarlow and
Parthasarathy Ranganathan, Milad Hashemi

Keywords Paper

Daiyi Peng, Xuanyi Dong, Esteban Real and
Mingxing Tan, Yifeng Lu, Gabriel Bender, Hanxiao Liu, Adam Kraft, Chen Liang, Quoc V Le

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Rong Ye, Wenxian Shi, Hao Zhou and
Zhongyu Wei, Lei Li

Keywords Paper

Keywords Paper

Jiawei Huang, Ruomin Huang, wenjie liu and
Nikolaos Freris, Hu Ding

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Biswajit Paria, Chih-Kuan Yeh, Ian E.H. Yen and
Ning Xu, Pradeep Ravikumar, Barnabás Póczos

Keywords Paper

Anush Sankaran, Olivier Mastropietro, Ehsan Saboori and
Yasser Idris, Davis Sawyer, MohammadHossein AskariHemmat, Ghouthi Boukli Hacene

Keywords Paper

Keywords Paper

Keywords Paper

Kaidi Xu, Zhouxing Shi, Huan Zhang and
Yihan Wang, Kai-Wei Chang, Minlie Huang, Bhavya Kailkhura, Xue Lin, Cho-Jui Hsieh

Keywords Paper

Hanjun Dai, Azade Nazi, Yujia Li and
Bo Dai, Dale Schuurmans

Keywords Paper

Chris Cummins, Zacharias Fisches, Tal Ben-Nun and
Torsten Hoefler, Michael O'Boyle, Hugh Leather

Keywords Paper

Jeff Zhang, Sameh Elnikety, Shuayb Zarar and
Atul Gupta, Siddharth Garg

Keywords Paper

Keywords Paper

Keywords Paper

Sid Jayakumar, Razvan Pascanu, Jack Rae and
Simon Osindero, Erich Elsen

Keywords Paper

Keywords Paper

Keywords Paper

Jakub Tarnawski, Amar Phanishayee, Nikhil Devanur and
Divya Mahajan, Fanny Nina Paravecino

Keywords Paper

Keywords Paper

Keywords Paper

Ferran Alet, Javier Lopez-Contreras, James Koppel and
Maxwell Nye, Armando Solar-Lezama, Tomas Lozano-Perez, Leslie Kaelbling, Josh Tenenbaum

Keywords Paper

Keywords Paper

Jia-Heng Tang, Weikai Chen, jie Yang and
Bo Wang, Songrun Liu, Bo Yang, Lin Gao

Keywords Paper

Vincent J. Hellendoorn, Charles Sutton, Rishabh Singh and
Petros Maniatis, David Bieber

Keywords Paper

Senhui Guo, Jing Xu, Dapeng Chen and
Chao Zhang, Xiaogang Wang, Rui Zhao

Keywords Paper