Allocation of Resources Without Limits or Throttling in py3.10-vllm-cuda-12.6 | CVE-2025-32381

Q: How to fix?

Upgrade Chainguard py3.10-vllm-cuda-12.6 to version 0.8.4-r0 or higher.

Threat Intelligence

0.04% (13^th percentile)

Do your applications use this vulnerable package?

In a few clicks we can analyze your entire application and see what components are vulnerable in your application, and suggest you quick fixes.

Test your applications

Snyk Learn

Learn about Allocation of Resources Without Limits or Throttling vulnerabilities in an interactive lesson.

Start learning

Snyk IDSNYK-CHAINGUARDLATEST-PY310VLLMCUDA126-9692975
published15 Apr 2025
disclosed9 Apr 2025

Report a new vulnerability Found a mistake?

Introduced: 9 Apr 2025

NewCVE-2025-32381 (opens in a new tab) CWE-770 (opens in a new tab)

How to fix?

Upgrade Chainguard py3.10-vllm-cuda-12.6 to version 0.8.4-r0 or higher.

NVD Description

Note: Versions mentioned in the description apply only to the upstream py3.10-vllm-cuda-12.6 package and not the py3.10-vllm-cuda-12.6 package as distributed by Chainguard. See How to fix? for Chainguard relevant fixed versions and status.

XGrammar is an open-source library for efficient, flexible, and portable structured generation. Prior to 0.1.18, Xgrammar includes a cache for compiled grammars to increase performance with repeated use of the same grammar. This cache is held in memory. Since the cache is unbounded, a system making use of xgrammar can be abused to fill up a host's memory and case a denial of service. For example, sending many small requests to an LLM inference server with unique JSON schemas would eventually cause this denial of service to occur. This vulnerability is fixed in 0.1.18.

Allocation of Resources Without Limits or Throttling Affecting py3.10-vllm-cuda-12.6 package, versions <0.8.4-r0

Severity