Home

Description

llama.cpp is an inference of several LLM models in C/C++. Prior to version b8492, the RPC backend's deserialize_tensor() skips all bounds validation when a tensor's buffer field is 0. An unauthenticated attacker can read and write arbitrary process memory via crafted GRAPH_COMPUTE messages. Combined with pointer leaks from ALLOC_BUFFER/BUFFER_GET_BASE, this gives full ASLR bypass and remote code execution. No authentication required, just TCP access to the RPC server port. This issue has been patched in version b8492.

PUBLISHED Reserved 2026-03-25 | Published 2026-04-01 | Updated 2026-04-02 | Assigner GitHub_M




CRITICAL: 9.8CVSS:3.1/AV:N/AC:L/PR:N/UI:N/S:U/C:H/I:H/A:H

Problem types

CWE-119: Improper Restriction of Operations within the Bounds of a Memory Buffer

Product status

< b8492
affected

References

github.com/...ma.cpp/security/advisories/GHSA-j8rj-fmpv-wcxw

github.com/ggml-org/llama.cpp/pull/20908

github.com/...ommit/39bf0d3c6a95803e0f41aaba069ffbee26721042

cve.org (CVE-2026-34159)

nvd.nist.gov (CVE-2026-34159)

Download JSON