Incorrect Conversion between Numeric Types in vllm | CVE-2026-53923

Q: How to fix?

A fix was pushed into the master branch but not yet published.

Threat Intelligence

0.28% (20^th percentile)

Do your applications use this vulnerable package?

In a few clicks we can analyze your entire application and see what components are vulnerable in your application, and suggest you quick fixes.

Test your applications

Snyk IDSNYK-PYTHON-VLLM-17372775
published18 Jun 2026
disclosed17 Jun 2026
creditAviral Srivastava

Report a new vulnerability Found a mistake?

Introduced: 17 Jun 2026

NewCVE-2026-53923 (opens in a new tab) CWE-681 (opens in a new tab)

How to fix?

A fix was pushed into the master branch but not yet published.

Overview

vllm is an A high-throughput and memory-efficient inference and serving engine for LLMs

Affected versions of this package are vulnerable to Incorrect Conversion between Numeric Types in the ggml_dequantize, ggml_mul_mat_vec_a8, ggml_mul_mat_a8, and ggml_moe_a8 functions when tensor dimensions are truncated due to an integer overflow. An attacker can access residual GPU memory contents from previous inference requests by supplying a specially crafted model file with tensor dimensions whose product exceeds the maximum value of a 32-bit integer.

Note: This is only exploitable if the deployment is multi-tenant and loads attacker-controlled GGUF model files.

References

CVSS Base Scores

version 4.0

version 3.1

Attack Vector (AV)
Network
Attack Complexity (AC)
Low
Attack Requirements (AT)
Present
Privileges Required (PR)
None
User Interaction (UI)
Passive

Confidentiality (VC)
Low
Integrity (VI)
Low
Availability (VA)
Low

Confidentiality (SC)
None
Integrity (SI)
None
Availability (SA)
None

Incorrect Conversion between Numeric Types Affecting vllm package, versions [0.5.5,]

Severity