Neural Program Generation Modulo Static Analysis

06/12/2021

Neural Program Generation Modulo Static Analysis

Rohan Mukherjee, Yeming Wen, Dipak Chaudhari, Thomas Reps, Swarat Chaudhuri, Christopher Jermaine

Keywords: deep learning, transformers, generative model

Abstract Paper Similar Papers

Abstract: State-of-the-art neural models of source code tend to be evaluated on the generation of individual expressions and lines of code, and commonly fail on long-horizon tasks such as the generation of entire method bodies. We propose to address this deficiency using weak supervision from a static program analyzer. Our neurosymbolic method allows a deep generative model to symbolically compute, using calls to a static analysis tool, long-distance semantic relationships in the code that it has already generated. During training, the model observes these relationships and learns to generate programs conditioned on them. We apply our approach to the problem of generating entire Java methods given the remainder of the class that contains the method. Our experiments show that the approach substantially outperforms a state-of-the-art transformer and a model that explicitly tries to learn program semantics on this task, both in terms of producing programs free of basic semantic errors and in terms of syntactically matching the ground truth.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at NeurIPS 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

29/06/2020

Embedding java classes with Code2vec: Improvements from variable obfuscation

Rhys Compton, Eibe Frank, Panos Patros, Abigail Koay

Keywords Paper

code2vec, machine learning, code obfuscation, source code, neural networks

0

0

0

0

14:20

12/07/2020

Structural Language Models of Code

Uri Alon, Roy Sadaka, Omer Levy, Eran Yahav

Keywords Paper

Applications - Language, Speech and Dialog

0

0

0

0

11:57

15/06/2020

Blended, precise semantic program embeddings

Ke Wang, Zhendong Su

Keywords Paper

Static and Dynamic Program Features, Attention Network, Semantic Program Embedding

0

0

0

0

15:39

04/07/2020

A Transformer-based Approach for Source Code Summarization

Wasi Ahmad, Saikat Chakraborty, Baishakhi Ray, Kai-Wei Chang

Keywords Paper

Source Summarization, summarization, ablation studies, Transformer-based Approach

0

0

0

0

6:14

04/07/2020

TAG : Type Auxiliary Guiding for Code Comment Generation

Ruichu Cai, Zhihao Liang, Boyan Xu and
zijian li, Yuexing Hao, Yao Chen

Keywords Paper

Code Generation, code task, adaptive code, TAG

0

0

0

0

11:22

23/06/2021

CompCertO: Compiling Certified Open C Components

Jérémie Koenig, Zhong Shao

Keywords Paper

Compositional Compiler Correctness, Game Semantics, Simulation Convention, Language Interface

0

0

0

0

24:57

18/07/2021

ProGraML: A Graph-based Program Representation for Data Flow Analysis and Compiler Optimizations

Chris Cummins, Zacharias Fisches, Tal Ben-Nun and
Torsten Hoefler, Michael O'Boyle, Hugh Leather

Keywords Paper

Applications, Hardware and Systems

0

0

0

0

5:01

03/05/2021

Interactive Weak Supervision: Learning Useful Heuristics for Data Labeling

Benedikt Boecking, Willie Neiswanger, Eric P Xing, Artur Dubrawski

Keywords Paper

active learning, data programming, data labeling, weak supervision

0

0

0

0

5:10

29/06/2020

An empirical study on the impact of deimplicitization on comprehension in programs using application frameworks

Jürgen Cito, Jiasi Shen, Martin Rinard

Keywords Paper

0

0

0

0

4:27

15/06/2020

Semantic code search via equational reasoning

Varot Premtoon, James Koppel, Armando Solar-Lezama

Keywords Paper

equational reasoning, code search

0

0

0

0

16:29

06/12/2021

Continual Learning via Local Module Composition

Oleksiy Ostapenko, Pau Rodriguez, Massimo Caccia, Laurent Charlin

Keywords Paper

continual learning, transfer learning

1

0

0

1

14:32

05/12/2020

Systematic generalization on gSCAN with language conditioned embedding

Tong Gao, Qi Huang, Raymond Mooney

Keywords Paper

0

0

0

0

14:19

15/11/2020

Neural Reverse Engineering of Stripped Binaries using Augmented Control Flow Graphs

Yaniv David, Uri Alon, Eran Yahav

Keywords Paper

Static Binary Analysis, Neural Reverse Engineering

0

0

0

0

14:27

04/07/2020

Deep Contextualized Self-training for Low Resource Dependency Parsing

Guy Rotman, Roi Reichart

Keywords Paper

Low Parsing, sequence tasks, Deep Self-training, Neural parsing

0

0

0

0

11:41

03/05/2021

Language-Agnostic Representation Learning of Source Code from Structure and Context

Daniel Zügner, Tobias Kirschstein, Michele Catasta and
Jure Leskovec, Stephan Günnemann

Keywords Paper

code summarization, machine learning for code

0

0

0

0

4:34

06/12/2020

Learning Sparse Prototypes for Text Generation

Junxian He, Taylor Berg-Kirkpatrick, Graham Neubig

Keywords Paper

0

0

0

0

3:22

16/11/2020

An Imitation Game for Learning Semantic Parsers from User Interaction

Ziyu Yao, Yiqi Tang, Wen-tau Yih and
Huan Sun, Yu Su

Keywords Paper

bootstrapping, fine-tuning parsers, theoretical analysis, text-to-sql problem

0

0

0

0

11:49

06/12/2021

Environment Generation for Zero-Shot Compositional Reinforcement Learning

Izzeddin Gur, Natasha Jaques, Yingjie Miao and
Jongwook Choi, Manoj Tiwari, Honglak Lee, Aleksandra Faust

Keywords Paper

reinforcement learning and planning, robustness, graph learning

0

0

0

0

8:40

03/05/2021

BUSTLE: Bottom-Up Program Synthesis Through Learning-Guided Exploration

Augustus Odena, Kensen Shi, David Bieber and
Rishabh Singh, Charles Sutton, Hanjun Dai

Keywords Paper

Program Synthesis

0

0

0

0

10:26

29/06/2020

Improved automatic summarization of subroutines via attention to file context

Sakib Haque, Alexander LeClair, Lingfei Wu, Collin McMillan

Keywords Paper

neural networks, natural language processing, documentation generation, source code summarization, artificial intelligence

0

0

0

0

16:04

26/04/2020

CLN2INV: Learning Loop Invariants with Continuous Logic Networks

Gabriel Ryan, Justin Wong, Jianan Yao and
Ronghui Gu, Suman Jana

Keywords Paper

loop invariants, deep learning, logic learning

0

0

0

0

5:12

15/06/2020

Debug information validation for optimized code

Yuanbo Li, Shuo Ding, Qirun Zhang, Davide Italiano

Keywords Paper

Optimizing Compilers, Debug Information

0

0

0

0

14:59

23/06/2021

RbSyn: Type- and Effect-Guided Program Synthesis

Sankha Narayan Guria, Jeffrey S. Foster, David Van Horn

Keywords Paper

program synthesis, type and effect systems, Ruby

0

0

0

0

12:40

15/06/2020

SCAF: A speculation-aware collaborative dependence analysis framework

Sotiris Apostolakis, Ziyang Xu, Zujun Tan and
Greg Chan, Simone Campanoni, David I. August

Keywords Paper

speculation, collaboration, dependence analysis

0

0

0

0

16:16

06/12/2020

Compositional Generalization via Neural-Symbolic Stack Machines

Xinyun Chen, Chen Liang, Adams Wei Yu and
Dawn Song, Denny Zhou

Keywords Paper

Applications -> Computer Vision; Applications -> Visual Scene Analysis and Interpretation; Deep Learning -> Adversarial Network, Deep Learning -> Generative Models

0

0

0

0

3:26

15/11/2020

Designing Types for R, Empirically

Alexi Turcotte, Aviral Goel, Filip Křikava, Jan Vitek

Keywords Paper

R, dynamic languages, type declarations

0

0

0

0

16:04

18/07/2021

LIME: Learning Inductive Bias for Primitives of Mathematical Reasoning

Yuhuai Wu, Markus Rabe, Wenda Li and
Jimmy Ba, Roger Grosse, Christian Szegedy

Keywords Paper

Deep Learning

0

0

0

0

6:18

12/07/2020

Adversarial Robustness for Code

Pavol Bielik, Martin Vechev

Keywords Paper

Adversarial Examples

0

0

0

0

15:43

14/09/2020

Self-Supervised Log Parsing

Sasho Nedelkoski, Jasmin Bogatinovski, Alexander Acker and
Jorge Cardoso, Odej Kao

Keywords Paper

representation learning, log parsing, transformers, anomaly detection, it systems

0

0

0

0

15:39

22/11/2021

Prototype-based Incremental Few-Shot Segmentation

Fabio Cermelli, Massimiliano Mancini, Yongqin Xian and
Zeynep Akata, Barbara Caputo

Keywords Paper

segmentation, incremental learning, continual learning, few shot learning, any shot learning, prototype, knowledge distillation

0

0

0

0

2:56

19/01/2020

Binders by Day, Labels by Night: Effect Instances via Lexically Scoped Handlers

Dariusz Biernacki, Maciej Piróg, Piotr Polesiuk, Filip Sieczkowski

Keywords Paper

effect handlers, logical relations, algebraic effects

0

0

0

0

21:13

03/05/2021

Learning to Recombine and Resample Data For Compositional Generalization

Ekin Akyürek, Afra Feyza Akyürek, Jacob Andreas

Keywords Paper

sequence models, language processing, compositional generalization, data augmentation, generative modeling

0

0

0

0

6:14

23/06/2021

Logical Bytecode Reduction

Christian Gram Kalhauge, Jens Palsberg

Keywords Paper

input reduction, type-safe code transformation

0

0

0

0

19:40

15/11/2020

Precise Inference of Expressive Units of Measurement Types

Tongtong Xiang, Jeff Y. Luo, Werner Dietl

Keywords Paper

Scientific computing, Pluggable type system, Dimensional analysis, Units of measurements, Type inference

0

0

0

0

13:39

23/06/2021

Incremental Whole-Program Analysis in Datalog with Lattices

Tamás Szabó, Sebastian Erdweg, Gábor Bergmann

Keywords Paper

Static Analysis, Incremental Computing, Datalog

0

0

0

0

22:53

19/08/2021

Lifting Symmetry Breaking Constraints with Inductive Logic Programming

Alice Tarzariol, Martin Gebser, Konstantin Schekotihin

Keywords Paper

Knowledge Representation and Reasoning, Leveraging Knowledge and Learning, Explainable/Interpretable Machine Learning, Constraints

0

0

0

0

14:34

19/10/2020

Feature extraction for large-scale text collections

Luke Gallagher, Antonio Mallia, J. Shane Culpepper and
Torsten Suel, B. Barla Cambazoglu

Keywords Paper

clueweb, feature index, feature extraction, feature repository, lambdamart, ltr, learning to rank, feature importance

0

0

0

0

9:41

02/02/2021

Towards Balanced Defect Prediction with Better Information Propagation

Xianda Zheng, Yuan-Fang Li, Huan Gao and
Yuncheng Hua, Guilin Qi

Keywords Paper

0

0

0

0

15:11

19/01/2020

Partial Type Constructors: Or, Making Ad Hoc Datatypes Less Ad Hoc

Mark Jones, J. Garrett Morris, Richard A. Eisenberg

Keywords Paper

Type constructors, Parametric polymorphism

0

0

0

0

21:37

18/07/2021

Training Data Subset Selection for Regression with Controlled Generalization Error

Durga S, Rishabh Iyer, Ganesh Ramakrishnan, Abir De

Keywords Paper

, Algorithms, Online Learning, Algorithms, Supervised Learning

0

0

0

0

4:15