CVE Tools

CVE-2026-53923

vLLM GGUF Kernels: int64_t to int truncation of tensor dimensions causes GPU buffer overflow

Published: Jun 22, 2026Updated: Jun 24, 2026 Sources: CVE List NVDCWE-200

No Known Exploits

Patch Available

Description

vLLM is an inference and serving engine for large language models (LLMs). From 0.5.5 until 0.23.1rc0, integer truncation of tensor dimensions in vLLM's GGUF dequantize kernels (csrc/quantization/gguf/gguf_kernel.cu) causes partial tensor processing. The output tensor is allocated at full size via torch::empty (uninitialized memory), but the dequantize CUDA kernel processes only a truncated number of elements. The unfilled portion of the output tensor retains whatever was previously in GPU memory. In multi-tenant inference deployments, this residual GPU memory may contain tensor data from other users' inference requests, constituting information disclosure. This vulnerability is fixed in 0.23.1rc0.

No summary for this CVE yet.

CVSS Vector Breakdown

Exploitability

AV:NAttack Vector

Network

AC:LAttack Complexity

Low

PR:NPrivileges Required

None

UI:NUser Interaction

None

Scope

S:UScope

Unchanged

Impact

C:HConfidentiality

High

I:NIntegrity

None

A:NAvailability

None

Open in CVSS calculator →

Weaknesses

CWE-200 CWE-681

Affected Products

Mixed

AI / ML / llm-serving-inference

vllm vllm-project

Mixed

AI / ML / llm-serving-inference

vllmoss-projectAI / MLaka aibrix

vllm-projectoss-projectAI / MLaka vllm, vllm-project/vllm

Exploitability

Official Patch Available

No detection template yetWe can run the check for you — a free external exposure review, no install. Check my exposure

Attack Graph

Products CVE Techniques Tactics

Click technique nodes to view MITRE ATT&CK details. Scroll to zoom, drag to pan.

MITRE ATT&CK

1 technique

Collection

View detailed technique mapping

References

https://github.com/vllm-project/vllm/commit/f219788f91952827132fa4fdf916427cd20d225e

https://github.com/vllm-project/vllm/pull/44971

https://github.com/vllm-project/vllm/security/advisories/GHSA-5jv2-g5wq-cmr4

Timeline

Published

Jun 22, 2026

Last Updated

Jun 24, 2026

Unlock Complete Vulnerability Intelligence

Get the full picture for CVE-2026-53923 and every CVE in our database. Create a free account — no credit card required.

Create Free Account

Plain-language analysis

Impact assessment and exploitation scenario in plain English

Attack graph visualization

Interactive attack path and kill chain mapping

Exploit details & PoC links

ExploitDB, Metasploit, GitHub PoCs with direct links

Nuclei scanner templates

Ready-to-use vulnerability scanner templates

Full remediation guide

Patch instructions, workarounds, and compliance impact

Interactive AI chat

Ask questions about this vulnerability in natural language

Related vulnerabilities

Semantically similar CVEs and attack patterns

REST API & MCP access

Integrate vulnerability data into your workflows