19/10/2020

MetaTPOT: Enhancing a tree-based pipeline optimization tool using meta-learning

Doron Laadan, Roman Vainshtein, Yarden Curiel, Gilad Katz, Lior Rokach

Keywords: tpot, meta-learning, genetic programming(gp), automl

Abstract: Automatic machine learning (AutoML) aims to automate the different aspects of the data science process and, by extension, allow non-experts to utilize "off the shelf" machine learning solution. One of the more popular AutoML methods is the Tree-based Pipeline Optimization Tool (TPOT), which uses genetic programming (GP) to efficiently explore the vast space of ML pipelines and produce a working ML solution. However, TPOT’s GP process comes with substantial time and computational costs. In this study, we explore TPOT’s GP process and propose MetaTPOT, an enhanced variant that uses a meta learning-based approach to predict the performance of TPOT’s pipeline candidates. MetaTPOT leverages domain knowledge in the form of pipelines pre-ranking to improve TPOT’s speed and performance. Evaluation on 65 classification datasets shows that our approach often improves the outcome of the genetic process while simultaneously substantially reduce its running time and computational cost.

The video of this talk cannot be embedded. You can watch it here:
https://dl.acm.org/doi/10.1145/3340531.3412147#sec-supp
(Link will open in new window)
 0
 0
 0
 0
This is an embedded video. Talk and the respective paper are published at CIKM 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment
no comments yet
code of conduct: tbd Characters remaining: 140

Similar Papers