Why Lift so Heavy? Slimming Large Language Models by Cutting Off the Layers

Yuan, Shuzhou; Nie, Ercong; Ma, Bolei; Färber, Michael

Computer Science > Computation and Language

arXiv:2402.11700 (cs)

[Submitted on 18 Feb 2024 (v1), last revised 16 Apr 2025 (this version, v2)]

Title:Why Lift so Heavy? Slimming Large Language Models by Cutting Off the Layers

Authors:Shuzhou Yuan, Ercong Nie, Bolei Ma, Michael Färber

View PDF HTML (experimental)

Abstract:Large Language Models (LLMs) possess outstanding capabilities in addressing various natural language processing (NLP) tasks. However, the sheer size of these models poses challenges in terms of storage, training and inference due to the inclusion of billions of parameters through layer stacking. While traditional approaches such as model pruning or distillation offer ways for reducing model size, they often come at the expense of performance retention. In our investigation, we systematically explore the approach of reducing the number of layers in LLMs. Surprisingly, we observe that even with fewer layers, LLMs maintain similar or better performance levels, particularly in prompt-based fine-tuning for text classification tasks. Remarkably, in certain cases, models with a single layer outperform their fully layered counterparts. These findings offer valuable insights for future work aimed at mitigating the size constraints of LLMs while preserving their performance, thereby opening avenues for significantly more efficient use of LLMs.

Comments:	IJCNN 2025
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2402.11700 [cs.CL]
	(or arXiv:2402.11700v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2402.11700

Submission history

From: Shuzhou Yuan [view email]
[v1] Sun, 18 Feb 2024 20:47:10 UTC (7,950 KB)
[v2] Wed, 16 Apr 2025 18:41:28 UTC (8,674 KB)

Computer Science > Computation and Language

Title:Why Lift so Heavy? Slimming Large Language Models by Cutting Off the Layers

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Why Lift so Heavy? Slimming Large Language Models by Cutting Off the Layers

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators