Generating Pragmatic Examples to Train Neural Program Synthesizers

Vaduguru, Saujas; Fried, Daniel; Pu, Yewen

Computer Science > Machine Learning

arXiv:2311.05740 (cs)

[Submitted on 9 Nov 2023 (v1), last revised 16 Apr 2025 (this version, v2)]

Title:Generating Pragmatic Examples to Train Neural Program Synthesizers

Authors:Saujas Vaduguru, Daniel Fried, Yewen Pu

View PDF HTML (experimental)

Abstract:Programming-by-example is the task of synthesizing a program that is consistent with a set of user-provided input-output examples. As examples are often an under-specification of one's intent, a good synthesizer must choose the intended program from the many that are consistent with the given set of examples. Prior work frames program synthesis as a cooperative game between a listener (that synthesizes programs) and a speaker (a user choosing examples), and shows that models of computational pragmatic inference are effective in choosing the user intended programs. However, these models require counterfactual reasoning over a large set of programs and examples, which is infeasible in realistic program spaces. In this paper, we propose PraX, a novel way to amortize this search with neural networks. We sample pairs of programs and examples via self-play between listener and speaker models, and use pragmatic inference to choose informative training examples from this sample. We then use the informative dataset to train models to improve the synthesizer's ability to disambiguate user-provided examples without human supervision. We validate PraX on the challenging task of synthesizing regular expressions from example strings, and find that our method (1) outperforms models trained without choosing pragmatic examples by 23% (a 51% relative increase) (2) matches the performance of supervised learning on a dataset of pragmatic examples provided by humans, despite using no human data in training.

Comments:	ICLR 2024
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Programming Languages (cs.PL)
Cite as:	arXiv:2311.05740 [cs.LG]
	(or arXiv:2311.05740v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2311.05740

Submission history

From: Saujas Vaduguru [view email]
[v1] Thu, 9 Nov 2023 20:53:00 UTC (2,477 KB)
[v2] Wed, 16 Apr 2025 18:08:02 UTC (1,906 KB)

Computer Science > Machine Learning

Title:Generating Pragmatic Examples to Train Neural Program Synthesizers

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Generating Pragmatic Examples to Train Neural Program Synthesizers

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators