Incorrect Conversion between Numeric Types Affecting vllm package, versions [0.5.5,]


Severity

Recommended
0.0
low
0
10

CVSS assessment by Snyk's Security Team. Learn more

Threat Intelligence

EPSS
0.28% (20th percentile)

Do your applications use this vulnerable package?

In a few clicks we can analyze your entire application and see what components are vulnerable in your application, and suggest you quick fixes.

Test your applications
  • Snyk IDSNYK-PYTHON-VLLM-17372775
  • published18 Jun 2026
  • disclosed17 Jun 2026
  • creditAviral Srivastava

Introduced: 17 Jun 2026

NewCVE-2026-53923  (opens in a new tab)
CWE-681  (opens in a new tab)

How to fix?

A fix was pushed into the master branch but not yet published.

Overview

vllm is an A high-throughput and memory-efficient inference and serving engine for LLMs

Affected versions of this package are vulnerable to Incorrect Conversion between Numeric Types in the ggml_dequantize, ggml_mul_mat_vec_a8, ggml_mul_mat_a8, and ggml_moe_a8 functions when tensor dimensions are truncated due to an integer overflow. An attacker can access residual GPU memory contents from previous inference requests by supplying a specially crafted model file with tensor dimensions whose product exceeds the maximum value of a 32-bit integer.

Note: This is only exploitable if the deployment is multi-tenant and loads attacker-controlled GGUF model files.

CVSS Base Scores

version 4.0
version 3.1