Simplifying Graph Transformers

Ma, Liheng; Pal, Soumyasundar; Zhang, Yingxue; Torr, Philip H. S.; Coates, Mark

Computer Science > Machine Learning

arXiv:2504.12588 (cs)

[Submitted on 17 Apr 2025]

Title:Simplifying Graph Transformers

Authors:Liheng Ma, Soumyasundar Pal, Yingxue Zhang, Philip H.S. Torr, Mark Coates

View PDF HTML (experimental)

Abstract:Transformers have attained outstanding performance across various modalities, employing scaled-dot-product (SDP) attention mechanisms. Researchers have attempted to migrate Transformers to graph learning, but most advanced Graph Transformers are designed with major architectural differences, either integrating message-passing or incorporating sophisticated attention mechanisms. These complexities prevent the easy adoption of Transformer training advances. We propose three simple modifications to the plain Transformer to render it applicable to graphs without introducing major architectural distortions. Specifically, we advocate for the use of (1) simplified $L_2$ attention to measure the magnitude closeness of tokens; (2) adaptive root-mean-square normalization to preserve token magnitude information; and (3) a relative positional encoding bias with a shared encoder. Significant performance gains across a variety of graph datasets justify the effectiveness of our proposed modifications. Furthermore, empirical evaluation on the expressiveness benchmark reveals noteworthy realized expressiveness in the graph isomorphism.

Subjects:	Machine Learning (cs.LG); Computation and Language (cs.CL)
Cite as:	arXiv:2504.12588 [cs.LG]
	(or arXiv:2504.12588v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2504.12588

Submission history

From: Liheng Ma [view email]
[v1] Thu, 17 Apr 2025 02:06:50 UTC (3,057 KB)

Computer Science > Machine Learning

Title:Simplifying Graph Transformers

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Simplifying Graph Transformers

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators