Replicating ReLM Results: Validating Large Language Models with ReLM

Adamson, Reece; Song, Erin

Computer Science > Computation and Language

arXiv:2504.12357 (cs)

[Submitted on 16 Apr 2025]

Title:Replicating ReLM Results: Validating Large Language Models with ReLM

Authors:Reece Adamson, Erin Song

View PDF HTML (experimental)

Abstract:Validating Large Language Models with ReLM explores the application of formal languages to evaluate and control Large Language Models (LLMs) for memorization, bias, and zero-shot performance. Current approaches for evaluating these types behavior are often slow, imprecise, costly, or introduce biases of their own, but are necessary due to the importance of this behavior when productionizing LLMs. This project reproduces key results from the original ReLM paper and expounds on the approach and applications with an emphasis on the relevance to the field of systems for machine learning.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2504.12357 [cs.CL]
	(or arXiv:2504.12357v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2504.12357

Submission history

From: Reece Adamson [view email]
[v1] Wed, 16 Apr 2025 02:58:48 UTC (959 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2025-04

Change to browse by:

References & Citations

export BibTeX citation

Computer Science > Computation and Language

Title:Replicating ReLM Results: Validating Large Language Models with ReLM

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Replicating ReLM Results: Validating Large Language Models with ReLM

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators