Description
vLLM is an inference and serving engine for large language models (LLMs). Prior to 0.22.0, vLLM's revision pinning controls do not consistently apply to all artifacts loaded for a model. A deployment that supplies --revision or --code-revision can still load dynamic code, GGUF files, image processors, retrieval side weights, or same-repository subfolder weights/config from an unpinned/default revision. This is a supply-chain integrity issue for pinned vLLM deployments. Operators can believe they are serving a reviewed model revision while vLLM resolves behavior-affecting nested or sibling artifacts outside that reviewed revision. This vulnerability is fixed in 0.22.0.
Problem types
CWE-345: Insufficient Verification of Data Authenticity
Product status
References
github.com/...t/vllm/security/advisories/GHSA-3ww4-5jv9-j5gm
github.com/vllm-project/vllm/pull/42616
github.com/...ommit/d26a28ab033697f55a1414b5b0435de7cd6045b6
huntr.com/bounties/3f1e24c0-87d2-4f6c-a705-820f380879ac