TacticZero: Learning to Prove Theorems from Scratch with Deep Reinforcement Learning

06/12/2021

TacticZero: Learning to Prove Theorems from Scratch with Deep Reinforcement Learning

Minchao Wu, Michael Norrish, Christian Walder, Amir Dezfouli

Keywords: reinforcement learning and planning

Abstract Paper Similar Papers

Abstract: We propose a novel approach to interactive theorem-proving (ITP) using deep reinforcement learning. The proposed framework is able to learn proof search strategies as well as tactic and arguments prediction in an end-to-end manner. We formulate the process of ITP as a Markov decision process (MDP) in which each state represents a set of potential derivation paths. This structure allows us to introduce a novel backtracking mechanism which enables the agent to efficiently discard (predicted) dead-end derivations and restart the derivation from promising alternatives. We implement the framework in the HOL theorem prover. Experimental results show that the framework using learned search strategies outperforms existing automated theorem provers (i.e., hammers) available in HOL when evaluated on unseen problems. We further elaborate the role of key components of the framework using ablation studies.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at NeurIPS 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

26/04/2020

Learning Heuristics for Quantified Boolean Formulas through Reinforcement Learning

Gil Lederman, Markus Rabe, Sanjit Seshia, Edward A. Lee

Keywords Paper

Logic, QBF, Logical Reasoning, SAT, Graph, Reinforcement Learning, GNN

0

0

0

0

5:33

19/08/2021

Learning CNF Theories Using MDL and Predicate Invention

Arcchit Jain, Clément Gautrais, Angelika Kimmig, Luc De Raedt

Keywords Paper

Machine Learning, Relational Learning, Constraints and Data Mining; Constraints and Machine Learning

0

0

0

0

15:00

18/11/2020

A state aggregation approach for solving knapsack problem with deep reinforcement learning

Reza Refaei Afshar, Yingqian Zhang, Murat Firat, Uzay Kaymak

Keywords Paper

0

0

0

0

12:23

06/12/2021

Heuristic-Guided Reinforcement Learning

Ching-An Cheng, Andrey Kolobov, Adith Swaminathan

Keywords Paper

theory, reinforcement learning and planning

0

0

0

0

8:39

06/12/2021

On the Convergence of Prior-Guided Zeroth-Order Optimization Algorithms

Shuyu Cheng, Guoqiang Wu, Jun Zhu

Keywords Paper

optimization, reinforcement learning and planning, adversarial robustness and security

0

0

0

0

13:49

06/12/2020

Modeling and Optimization Trade-off in Meta-learning

Katelyn Gao, Ozan Sener

Keywords Paper

0

0

0

0

3:21

02/02/2021

A Deep Reinforcement Learning Approach to First-Order Logic Theorem Proving

Maxwell Crouse, Ibrahim Abdelaziz, Bassem Makni and
Spencer Whitehead, Cristina Cornelio, Pavan Kapanipathi, Kavitha Srinivas, Veronika Thost, Michael Witbrock, Achille Fokoue

Keywords Paper

0

0

0

0

20:40

03/05/2021

Solving Compositional Reinforcement Learning Problems via Task Reduction

Yunfei Li, Yilin Wu, Huazhe Xu and
Xiaolong Wang, Yi Wu

Keywords Paper

reinforcement learning, task reduction, compositional task, sparse reward, imitation learning

0

0

0

0

4:57

16/11/2020

Understanding the Mechanics of SPIGOT: Surrogate Gradients for Latent Structure Learning

Tsvetomila Mihaylova, Vlad Niculae, André F. T. Martins

Keywords Paper

pipeline systems, ste, latent models, end-to-end training

0

0

0

0

11:50

06/12/2020

Discovering Reinforcement Learning Algorithms

Junhyuk Oh, Matteo Hessel, Wojciech Czarnecki and
Zhongwen Xu, Hado van Hasselt, Satinder Singh, David Silver

Keywords Paper

0

0

0

0

3:21

06/12/2021

Accelerating Quadratic Optimization with Reinforcement Learning

Jeffrey Ichnowski, Paras Jain, Bartolomeo Stellato and
Goran Banjac, Michael Luo, Francesco Borrelli, Joseph Gonzalez, Ion Stoica, Ken Goldberg

Keywords Paper

optimization, reinforcement learning and planning, machine learning

0

0

0

0

12:36

12/07/2020

Learning Reasoning Strategies in End-to-End Differentiable Proving

Pasquale Minervini, Tim Rocktäschel, Sebastian Riedel and
Edward Grefenstette, Pontus Stenetorp

Keywords Paper

Sequential, Network, and Time-Series Modeling

0

0

0

0

16:38

26/08/2020

Deep Active Learning: Unified and Principled Method for Query and Training

Changjian Shui, Fan Zhou, Christian Gagné, Boyu Wang

Keywords Paper

0

0

0

0

12:12

13/04/2021

Towards a theoretical understanding of the robustness of variational autoencoders

Alexander Camuto, Matthew Willetts, Stephen Roberts and
Chris Holmes, Tom Rainforth

Keywords Paper

0

0

0

0

3:00

12/07/2020

Sequential Transfer in Reinforcement Learning with a Generative Model

Andrea Tirinzoni, Riccardo Poiani, Marcello Restelli

Keywords Paper

Reinforcement Learning - General

0

0

0

0

10:54

06/12/2020

Understanding Deep Architecture with Reasoning Layer

Xinshi Chen, Yufei Zhang, Christoph Reisinger, Le Song

Keywords Paper

0

0

0

0

3:28

02/02/2021

Open-Set Recognition with Gaussian Mixture Variational Autoencoders

Alexander Cao, Yuan Luo, Diego Klabjan

Keywords Paper

0

0

0

0

20:22

06/12/2020

Probabilistic Linear Solvers for Machine Learning

Jonathan Wenger, Philipp Hennig

Keywords Paper

0

0

0

0

3:37

18/07/2021

Batch Value-function Approximation with Only Realizability

Tengyang Xie, Nan Jiang

Keywords Paper

Algorithms, Multitask and Transfer Learning, Algorithms, Unsupervised Learning; Applications, Image Segmentation, Theory, RL, Decisions and Control Theory

0

0

0

0

5:05

12/07/2020

Provable Representation Learning for Imitation Learning via Bi-level Optimization

Sanjeev Arora, Simon Du, Sham Kakade and
Yuping Luo, Nikunj Umesh Saunshi

Keywords Paper

Reinforcement Learning - Theory

0

0

0

0

15:04

26/04/2020

Meta-Learning Acquisition Functions for Transfer Learning in Bayesian Optimization

Michael Volpp, Lukas P. Fröhlich, Kirsten Fischer and
Andreas Doerr, Stefan Falkner, Frank Hutter, Christian Daniel

Keywords Paper

Transfer Learning, Meta Learning, Bayesian Optimization, Reinforcement Learning

0

0

0

0

5:05

06/12/2020

Inverse Reinforcement Learning from a Gradient-based Learner

Giorgia Ramponi, Gianluca Drappo, Marcello Restelli

Keywords Paper

0

0

0

0

2:42

03/05/2021

Few-Shot Bayesian Optimization with Deep Kernel Surrogates

Martin Wistuba, Josif Grabocka

Keywords Paper

automl, bayesian optimization, metalearning, few-shot learning

0

0

0

0

5:18

18/07/2021

Variational Empowerment as Representation Learning for Goal-Conditioned Reinforcement Learning

Jongwook Choi, Archit Sharma, Honglak Lee and
Sergey Levine, Shixiang Gu

Keywords Paper

Neuroscience and Cognitive Science, Neuroscience, Reinforcement Learning and Planning, Algorithms, Representation Learning; Algorithms, Sparse Coding and Dimensionality Expansion; Applications, Matrix and Ten

0

0

0

0

5:16

06/12/2020

Structured Prediction for Conditional Meta-Learning

Ruohan Wang, Yiannis Demiris, Carlo Ciliberto

Keywords Paper

0

0

0

0

3:12

06/12/2020

Joint Contrastive Learning with Infinite Possibilities

Qi Cai, Yu Wang, Yingwei Pan and
Ting Yao, Tao Mei

Keywords Paper

0

0

0

0

3:06

06/12/2021

USCO-Solver: Solving Undetermined Stochastic Combinatorial Optimization Problems

Guangmo Tong

Keywords Paper

optimization

0

0

0

0

15:00

03/05/2021

INT: An Inequality Benchmark for Evaluating Generalization in Theorem Proving

Yuhuai Wu, Albert Jiang, Jimmy Ba, Roger Grosse

Keywords Paper

Graph neural networks, Generalization, Monte Carlo Tree Search, Transformers, Synthetic benchmark dataset, Theorem proving

0

0

0

0

17:13

26/04/2020

On the Variance of the Adaptive Learning Rate and Beyond

Liyuan Liu, Haoming Jiang, Pengcheng He and
Weizhu Chen, Xiaodong Liu, Jianfeng Gao, Jiawei Han

Keywords Paper

warmup, adam, adaptive learning rate, variance

0

0

0

0

4:38

03/05/2021

Deep symbolic regression: Recovering mathematical expressions from data via risk-seeking policy gradients

Brenden Petersen, Mikel Landajuela Larma, Terrell N Mundhenk and
Claudio Santiago, Soo Kim, Joanne Kim

Keywords Paper

reinforcement learning, automated machine learning, symbolic regression

0

0

0

0

15:02

13/04/2021

Adversarially robust estimate and risk analysis in linear regression

Yue Xing, Ruizhi Zhang, Guang Cheng

Keywords Paper

0

0

0

0

3:03

16/11/2020

PRover: Proof Generation for Interpretable Reasoning over Rules

Swarnadeep Saha, Sayan Ghosh, Shashank Srivastava, Mohit Bansal

Keywords Paper

inference, qa generation, generalization, qa task

0

0

0

0

11:30

12/07/2020

Sub-Goal Trees -- a Framework for Goal-Based Reinforcement Learning

Tom Jurgenson, Or Avner, Edward Groshev, Aviv Tamar

Keywords Paper

Reinforcement Learning - General

0

0

0

0

15:04

12/07/2020

Graph-based, Self-Supervised Program Repair from Diagnostic Feedback

Michihiro Yasunaga, Percy Liang

Keywords Paper

Applications - Language, Speech and Dialog

0

0

0

1

14:39

06/12/2020

Classification with Valid and Adaptive Coverage

Yaniv Romano, Matteo Sesia, Emmanuel Candes

Keywords Paper

0

0

0

0

3:14

18/07/2021

Provably Efficient Algorithms for Multi-Objective Competitive RL

Tiancheng Yu, Yi Tian, Jingzhao Zhang, Suvrit Sra

Keywords Paper

Theory, RL, Decisions and Control Theory

0

0

0

0

17:04

18/07/2021

MURAL: Meta-Learning Uncertainty-Aware Rewards for Outcome-Driven Reinforcement Learning

Kevin Li, Abhishek Gupta, Ashwin D Reddy and
Vitchyr Pong, Aurick Zhou, Justin Yu, Sergey Levine

Keywords Paper

Reinforcement Learning and Planning, Deep RL

0

0

0

0

5:16

02/02/2021

Learning to Attack Real-World Models for Person Re-identification via Virtual-Guided Meta-Learning

Fengxiang Yang, Zhun Zhong, Hong Liu and
Zheng Wang, Zhiming Luo, Shaozi Li, Nicu Sebe, Shin'ichi Satoh

Keywords Paper

0

0

0

0

14:19

12/07/2020

Why Are Learned Indexes So Effective?

Paolo Ferragina, Fabrizio Lillo, Giorgio Vinciguerra

Keywords Paper

Applications - Other

0

0

0

0

13:22

14/09/2020

Active deep Q-learning with demonstration

Si-An Chen,Hsuan-Tien Lin, Voot Tangkaratt, Masashi Sugiyam

Keywords Paper

0

0

0

0

13:42