The Hitchhiker's Guide to Program Analysis, Part II: Deep Thoughts by LLMs

Li, Haonan; Zhang, Hang; Pei, Kexin; Qian, Zhiyun

Computer Science > Software Engineering

arXiv:2504.11711 (cs)

[Submitted on 16 Apr 2025 (v1), last revised 17 Apr 2025 (this version, v2)]

Title:The Hitchhiker's Guide to Program Analysis, Part II: Deep Thoughts by LLMs

Authors:Haonan Li, Hang Zhang, Kexin Pei, Zhiyun Qian

View PDF HTML (experimental)

Abstract:Static analysis is a cornerstone for software vulnerability detection, yet it often struggles with the classic precision-scalability trade-off. In practice, such tools often produce high false positive rates, particularly in large codebases like the Linux kernel. This imprecision can arise from simplified vulnerability modeling and over-approximation of path and data constraints. While large language models (LLMs) show promise in code understanding, their naive application to program analysis yields unreliable results due to inherent reasoning limitations. We introduce BugLens, a post-refinement framework that significantly improves static analysis precision. BugLens guides an LLM to follow traditional analysis steps by assessing buggy code patterns for security impact and validating the constraints associated with static warnings. Evaluated on real-world Linux kernel bugs, BugLens raises precision from 0.10 (raw) and 0.50 (semi-automated refinement) to 0.72, substantially reducing false positives and revealing four previously unreported vulnerabilities. Our results suggest that a structured LLM-based workflow can meaningfully enhance the effectiveness of static analysis tools.

Subjects:	Software Engineering (cs.SE); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2504.11711 [cs.SE]
	(or arXiv:2504.11711v2 [cs.SE] for this version)
	https://doi.org/10.48550/arXiv.2504.11711

Submission history

From: Haonan Li [view email]
[v1] Wed, 16 Apr 2025 02:17:06 UTC (5,134 KB)
[v2] Thu, 17 Apr 2025 02:28:35 UTC (5,333 KB)

Computer Science > Software Engineering

Title:The Hitchhiker's Guide to Program Analysis, Part II: Deep Thoughts by LLMs

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Software Engineering

Title:The Hitchhiker's Guide to Program Analysis, Part II: Deep Thoughts by LLMs

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators