9.8
/ 10
CRITICAL
CVSS:3.1/AV:N/AC:L/PR:N/UI:N/S:U/C:H/I:H/A:H
Description
llama.cpp is an inference of several LLM models in C/C++. Prior to version b8492, the RPC backend's deserialize_tensor() skips all bounds validation when a tensor's buffer field is 0. An unauthenticated attacker can read and write arbitrary process memory via crafted GRAPH_COMPUTE messages. Combined with pointer leaks from ALLOC_BUFFER/BUFFER_GET_BASE, this gives full ASLR bypass and remote code execution. No authentication required, just TCP access to the RPC server port. This issue has been patched in version b8492.
Basic Information
ID
CVE-2026-34159
Source
GitHub_M
Published
Apr 1, 2026 at 16:59
Modified
Apr 2, 2026 at 03:56
Affected Product
Vendor
ggml-org
Product
llama.cpp
Version
< b8492
Affected Versions
ggml-org llama.cpp < b8492