The probability is the direct output of the EPSS model, and conveys an overall sense of the threat of exploitation in the wild. The percentile measures the EPSS probability relative to all known EPSS scores. Note: This data is updated daily, relying on the latest available EPSS model version. Check out the EPSS documentation for more details.
In a few clicks we can analyze your entire application and see what components are vulnerable in your application, and suggest you quick fixes.
Test your applicationsA fix was pushed into the master branch but not yet published.
vllm is an A high-throughput and memory-efficient inference and serving engine for LLMs
Affected versions of this package are vulnerable to Incorrect Conversion between Numeric Types in the ggml_dequantize, ggml_mul_mat_vec_a8, ggml_mul_mat_a8, and ggml_moe_a8 functions when tensor dimensions are truncated due to an integer overflow. An attacker can access residual GPU memory contents from previous inference requests by supplying a specially crafted model file with tensor dimensions whose product exceeds the maximum value of a 32-bit integer.
Note: This is only exploitable if the deployment is multi-tenant and loads attacker-controlled GGUF model files.