CVE-2025-52566

HIGH

llama.cpp < b5721 - Heap Overflow via Tokenizer Integer Overflow

Title source: llm

Description

llama.cpp is an inference of several LLM models in C/C++. Prior to version b5721, there is a signed vs. unsigned integer overflow in llama.cpp's tokenizer implementation (llama_vocab::tokenize) (src/llama-vocab.cpp:3036) resulting in unintended behavior in tokens copying size comparison. Allowing heap-overflowing llama.cpp inferencing engine with carefully manipulated text input during tokenization process. This issue has been patched in version b5721.

References (2)

Core 2

Core References

Exploit, Vendor Advisory x_refsource_confirm

https://github.com/ggml-org/llama.cpp/security/advisories/GHSA-7rxv-5jhh-j6xx

Patch x_refsource_misc

https://github.com/ggml-org/llama.cpp/commit/dd6e6d0b6a4bbe3ebfc931d1eb14db2f2b1d70af

View Patch ZIP pw:eip

Scores

CVSS v3 8.6

EPSS 0.0032

EPSS Percentile 23.7%

Attack Vector LOCAL

CVSS:3.1/AV:L/AC:L/PR:N/UI:R/S:C/C:H/I:H/A:H

CISA SSVC

Vulnrichment

Exploitation poc

Automatable no

Technical Impact total

Details

CWE

CWE-119 CWE-195

Status published

Products (1)

ggml/llama.cpp < b5721

Published Jun 24, 2025

Tracked Since Feb 18, 2026