CVE Tools

CVE-2026-44223

vLLM: extract_hidden_states speculative decoding crashes server on any request with penalty parameters

Published: May 12, 2026Updated: Jun 22, 2026 Sources: CVE List NVDCWE-131

No Known Exploits

Patch Available

Description

vLLM is an inference and serving engine for large language models (LLMs). From 0.18.0 to before 0.20.0, the extract_hidden_states speculative decoding proposer in vLLM returns a tensor with an incorrect shape after the first decode step, causing a RuntimeError that crashes the EngineCore process. The crash is triggered when any request in the batch uses sampling penalty parameters (repetition_penalty, frequency_penalty, or presence_penalty). A single request with a penalty parameter (e.g., "repetition_penalty": 1.1) is sufficient to crash the server. This vulnerability is fixed in 0.20.0.

No summary for this CVE yet.

CVSS Vector Breakdown

Exploitability

AV:NAttack Vector

Network

AC:LAttack Complexity

Low

PR:LPrivileges Required

Low

UI:NUser Interaction

None

Scope

S:UScope

Unchanged

Impact

C:NConfidentiality

None

I:NIntegrity

None

A:HAvailability

High

Open in CVSS calculator →

Weaknesses

CWE-131 CWE-704

Affected Products

vllm vllm-project

Mixed

AI / ML / llm-serving-inference

Mixed

AI / ML / llm-serving-inference

vllm-projectoss-projectAI / MLaka vllm, vllm-project/vllm

vllmoss-projectAI / MLaka aibrix

Exploitability

Official Patch Available

Workaround Available

Attack Graph

Products CVE Techniques Tactics

Click technique nodes to view MITRE ATT&CK details. Scroll to zoom, drag to pan.

MITRE ATT&CK

1 technique

Initial Access

View detailed technique mapping

References

https://github.com/vllm-project/vllm/security/advisories/GHSA-83vm-p52w-f9pw

https://github.com/vllm-project/vllm/pull/38610

Timeline

Published

May 12, 2026

Last Updated

Jun 22, 2026

Unlock Complete Vulnerability Intelligence

Get the full picture for CVE-2026-44223 and every CVE in our database. Create a free account — no credit card required.

Create Free Account

Plain-language analysis

Impact assessment and exploitation scenario in plain English

Attack graph visualization

Interactive attack path and kill chain mapping

Exploit details & PoC links

ExploitDB, Metasploit, GitHub PoCs with direct links

Nuclei scanner templates

Ready-to-use vulnerability scanner templates

Full remediation guide

Patch instructions, workarounds, and compliance impact

Interactive AI chat

Ask questions about this vulnerability in natural language

Related vulnerabilities

Semantically similar CVEs and attack patterns

REST API & MCP access

Integrate vulnerability data into your workflows