CVE-2025-46560 vLLM phi4mm: Quadratic Time Complexity in Input Token Processing...

CVE-2025-46560 vLLM phi4mm: Quadratic Time Complexity in Input Token Processing leads to denial of service

April 29, 2025 invoker category_cve

Vulnerability Details

Basic Information

Title	CVE-2025-46560 vLLM phi4mm: Quadratic Time Complexity in Input Token Processing leads to denial of service
Type	cvelist
Published	2025-04-30T00:24:53
Last Seen	2025-04-30T00:58:57
CVSS Score	6.5 (MEDIUM)

CVSS v3 Details

Attack Vector	NETWORK
Attack Complexity	LOW
Privileges Required	LOW
User Interaction	NONE
Scope	UNCHANGED
Confidentiality Impact	NONE
Integrity Impact	NONE
Availability Impact	HIGH

CVE Information

CVE IDs	CVE-2025-46560
CWE	CWE-1333
Bulletin Family	cve

Description

vLLM is a high-throughput and memory-efficient inference and serving engine for LLMs. Versions starting from 0.8.0 and prior to 0.8.5 are affected by a critical performance vulnerability in the input preprocessing logic of the multimodal tokenizer. The code dynamically replaces placeholder tokens (e.g., <|audio_|>, <|image_|>) with repeated tokens based on precomputed lengths. Due to inefficient list concatenation operations, the algorithm exhibits quadratic time complexity (O(n²)), allowing malicious actors to trigger resource exhaustion via specially crafted inputs. This issue has been patched in version 0.8.5.

Impact Assessment

Base Score	6.5
Severity	MEDIUM

View full CVE details