CVE Tools

CVE-2026-54235

vLLM: temperature=NaN and temperature=Infinity bypass validation and propagate to GPU kernels

Published: Jun 22, 2026Updated: Jun 24, 2026 Sources: CVE List NVDCWE-1287

Description

vLLM is an inference and serving engine for large language models (LLMs). Prior to 0.23.1rc0, ll temperature validation gates use comparison operators (<, >), which silently evaluate to False for NaN and for positive Infinity in Python's IEEE 754 float semantics. Both values pass every guard and propagate to GPU sampling kernels, where they produce undefined behavior or CUDA errors that can crash the inference worker. This vulnerability is fixed in 0.23.1rc0.

No summary for this CVE yet.

CVSS Vector Breakdown

AV:NAC:LPR:NUI:NS:UC:NI:LA:L
Exploitability
AV:NAttack Vector
Network
AC:LAttack Complexity
Low
PR:NPrivileges Required
None
UI:NUser Interaction
None
Scope
S:UScope
Unchanged
Impact
C:NConfidentiality
None
I:LIntegrity
Low
A:LAvailability
Low

Weaknesses

Affected Products

vllmoss-projectAI / MLaka aibrix
vllm-projectoss-projectAI / MLaka vllm, vllm-project/vllm

Exploitability

Official Patch Available

References

Timeline

Published
Jun 22, 2026
Last Updated
Jun 24, 2026

Unlock Complete Vulnerability Intelligence

Get the full picture for CVE-2026-54235 and every CVE in our database. Create a free account — no credit card required.

Create Free Account
Plain-language analysis
Impact assessment and exploitation scenario in plain English
Attack graph visualization
Interactive attack path and kill chain mapping
Exploit details & PoC links
ExploitDB, Metasploit, GitHub PoCs with direct links
Nuclei scanner templates
Ready-to-use vulnerability scanner templates
Full remediation guide
Patch instructions, workarounds, and compliance impact
Interactive AI chat
Ask questions about this vulnerability in natural language
Related vulnerabilities
Semantically similar CVEs and attack patterns
REST API & MCP access
Integrate vulnerability data into your workflows