CVE-2025-24357

HIGH

vllm < 0.7.0 - Remote Code Execution via Pickle Deserialization in Model Weight Loading

Title source: llm
STIX 2.1

Description

vLLM is a library for LLM inference and serving. vllm/model_executor/weight_utils.py implements hf_model_weights_iterator to load the model checkpoint, which is downloaded from huggingface. It uses the torch.load function and the weights_only parameter defaults to False. When torch.load loads malicious pickle data, it will execute arbitrary code during unpickling. This vulnerability is fixed in v0.7.0.

Scores

CVSS v3 7.5
EPSS 0.0065
EPSS Percentile 46.1%
Attack Vector NETWORK
CVSS:3.1/AV:N/AC:H/PR:N/UI:R/S:U/C:H/I:H/A:H

CISA SSVC

Vulnrichment
Exploitation none
Automatable no
Technical Impact total

Details

CWE
CWE-502
Status published
Products (2)
pypi/vllm 0 - 0.7.0PyPI
vllm/vllm < 0.7.0
Published Jan 27, 2025
Tracked Since Feb 18, 2026