Model-based reinforcement learning for biological sequence design

Abstract: The ability to design biological structures such as DNA or proteins would have considerable medical and industrial impact. Doing so presents a challenging black-box optimization problem characterized by the large-batch, low round setting due to the need for labor-intensive wet lab evaluations. In response, we propose using reinforcement learning (RL) based on proximal-policy optimization (PPO) for biological sequence design. RL provides a flexible framework for optimization generative sequence models to achieve specific criteria, such as diversity among the high-quality sequences discovered. We propose a model-based variant of PPO, DyNA-PPO, to improve sample efficiency, where the policy for a new round is trained offline using a simulator fit on functional measurements from prior rounds. To accommodate the growing number of observations across rounds, the simulator model is automatically selected at each round from a pool of diverse models of varying capacity. On the tasks of designing DNA transcription factor binding sites, designing antimicrobial proteins, and optimizing the energy of Ising models based on protein structure, we find that DyNA-PPO performs significantly better than existing methods in settings in which modeling is feasible, while still not performing worse in situations in which a reliable model cannot be learned.

12/07/2020

Population-Based Black-Box Optimization for Biological Sequence Design

Christof Angermueller, David Belanger, Andreea Gane and
Zelda Mariet, David Dohan, Kevin Murphy, Lucy Colwell , D. Sculley

Fourier-transform-based attribution priors improve the interpretability and stability of deep learning models for genomics

Probabilistic Methods, Gaussian Processes, Algorithms, Multitask and Transfer Learning; Probabilistic Methods, Variational Inference, Applications, Computational Biology and Bioinformatics

5:09

12/07/2020

pipeline search, greedy algorithms, experiment design, AutoML, tensor decomposition, submodular optimization, meta-learning

13:40

26/04/2020

Models for code, Differentiable program generator, Combinatorial optimization, Program obfuscation, Adversarial computer programs, Machine Learning (ML) for Programming Languages (PL)/Software Engineering (SE)

6:27

06/12/2021

Adversarial Reweighting for Partial Domain Adaptation

Xiang Gu, Xi Yu, yan yang and
Jian Sun, Zongben Xu

Heuristic Search and Game Playing, Evaluation and Analysis, Heuristic Search and Machine Learning, Meta-Reasoning and Meta-Heuristics

13:51

18/07/2021

Gaining Insight into SARS-CoV-2 Infection and COVID-19 Severity Using Self-supervised Edge Features and Graph Neural Networks

Reinforcement Learning and Planning -> Model-Based RL; Reinforcement Learning and Planning -> Reinforcement Learning, Reinforcement Learning and Planning -> Multi-Agent RL

3:21

26/08/2020

Tianzhe Wang, Kuan Wang, Han Cai and
Ji Lin, Zhijian Liu, Hanrui Wang, Yujun Lin, Song Han

Jie Zhao, Bojie Li, Wang Nie and
Zhen Geng, Renwei Zhang, Xiong Gao, Bin Cheng, Chen Wu, Yun Cheng, Zheng Li, Peng Di, Kun Zhang, Xuefeng Jin

Keywords Paper

neural networks, neural processing units, polyhedral model, code generation, auto-tuning

21:49