A Comprehensive Survey of Reward Models: Taxonomy, Applications, Challenges, and Future

Zhong, Jialun; Shen, Wei; Li, Yanzeng; Gao, Songyang; Lu, Hua; Chen, Yicheng; Zhang, Yang; Zhou, Wei; Gu, Jinjie; Zou, Lei

Computer Science > Computation and Language

arXiv:2504.12328 (cs)

[Submitted on 12 Apr 2025]

Title:A Comprehensive Survey of Reward Models: Taxonomy, Applications, Challenges, and Future

Authors:Jialun Zhong, Wei Shen, Yanzeng Li, Songyang Gao, Hua Lu, Yicheng Chen, Yang Zhang, Wei Zhou, Jinjie Gu, Lei Zou

View PDF HTML (experimental)

Abstract:Reward Model (RM) has demonstrated impressive potential for enhancing Large Language Models (LLM), as RM can serve as a proxy for human preferences, providing signals to guide LLMs' behavior in various tasks. In this paper, we provide a comprehensive overview of relevant research, exploring RMs from the perspectives of preference collection, reward modeling, and usage. Next, we introduce the applications of RMs and discuss the benchmarks for evaluation. Furthermore, we conduct an in-depth analysis of the challenges existing in the field and dive into the potential research directions. This paper is dedicated to providing beginners with a comprehensive introduction to RMs and facilitating future studies. The resources are publicly available at github\footnote{this https URL}.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2504.12328 [cs.CL]
	(or arXiv:2504.12328v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2504.12328

Submission history

From: Jialun Zhong [view email]
[v1] Sat, 12 Apr 2025 16:07:36 UTC (509 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2025-04

Change to browse by:

cs
cs.AI

References & Citations

export BibTeX citation

Computer Science > Computation and Language

Title:A Comprehensive Survey of Reward Models: Taxonomy, Applications, Challenges, and Future

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:A Comprehensive Survey of Reward Models: Taxonomy, Applications, Challenges, and Future

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators