NVD Vulnerability Detail

CVE-2026-44223

Summary	vLLM is an inference and serving engine for large language models (LLMs). From to before 0.20.0, the extract_hidden_states speculative decoding proposer in vLLM returns a tensor with an incorrect shape after the first decode step, causing a RuntimeError that crashes the EngineCore process. The crash is triggered when any request in the batch uses sampling penalty parameters (repetition_penalty, frequency_penalty, or presence_penalty). A single request with a penalty parameter (e.g., "repetition_penalty": 1.1) is sufficient to crash the server. This vulnerability is fixed in 0.20.0.
Publication Date	May 13, 2026, 5:16 a.m.
Registration Date	May 15, 2026, 4:18 a.m.
Last Update	May 16, 2026, 12:16 a.m.

CVSS3.1 : MEDIUM
スコア	6.5
Vector	CVSS:3.1/AV:N/AC:L/PR:L/UI:N/S:U/C:N/I:N/A:H
攻撃元区分(AV)	ネットワーク
攻撃条件の複雑さ(AC)	低
攻撃に必要な特権レベル(PR)	低
利用者の関与(UI)	不要
影響の想定範囲(S)	変更なし
機密性への影響(C)	なし
完全性への影響(I)	なし
可用性への影響(A)	高

Affected software configurations

Configuration1		or higher	or less	more than	less than
cpe:2.3:a:vllm:vllm::::::::		0.18.0			0.20.0

Related information, measures and tools

No	URL	refsource
1	https://github.com/vllm-project/vllm/pull/38610	security-advisories@github.com
2	https://github.com/vllm-project/vllm/security/advisories/GHSA-83vm-p52w-f9pw	security-advisories@github.com
3	https://github.com/vllm-project/vllm/pull/38610	134c704f-9b21-4f2e-91b3-4a467353bcc0
4	https://github.com/vllm-project/vllm/security/advisories/GHSA-83vm-p52w-f9pw	134c704f-9b21-4f2e-91b3-4a467353bcc0

Common Vulnerabilities List

No	CWE	Name	URL
1	CWE-131	正しくないバッファサイズ計算	https://cwe.mitre.org/data/definitions/131.html
2	CWE-704	不正な型変換またはキャスト	https://cwe.mitre.org/data/definitions/704.html

JVN Vulnerability Information

vLLMにおける複数の脆弱性

Title	vLLMにおける複数の脆弱性
Summary	vLLMは大規模言語モデル（LLM）の推論およびサービングエンジンです。0.20.0より前のバージョンでは、vLLMのextract_hidden_states投機的デコーディング提案者が、最初のデコードステップ後に誤った形状のテンソルを返し、EngineCoreプロセスがクラッシュするRuntimeErrorを引き起こしていました。このクラッシュは、バッチ内の任意のリクエストがサンプリングペナルティパラメーター（repetition_penalty、frequency_penalty、presence_penalty）を使用した場合に発生しました。単一のリクエストにペナルティパラメーター（例: "repetition_penalty": 1.1）が設定されているだけでサーバーがクラッシュしました。この脆弱性は0.20.0で修正されています。
Possible impacts	当該ソフトウェアが扱う情報について、外部への漏えいは発生しません。また、当該ソフトウェアが扱う情報について、書き換えは発生しません。さらに、当該ソフトウェアが完全に停止する可能性があります。そして、この脆弱性を悪用した攻撃の影響は、他のソフトウェアには及びません。
Solution	正式な対策が公開されています。ベンダ情報を参照して適切な対策を実施してください。
Publication Date	May 12, 2026, midnight
Registration Date	May 18, 2026, 12:12 p.m.
Last Update	May 18, 2026, 12:12 p.m.

Affected System

vLLM

vLLM 0.18.0 以上 0.20.0 未満

CVE (情報セキュリティ共通脆弱性識別子)

No	Name	URL
Common Vulnerabilities and Exposures (CVE)
1	CVE-2026-44223	https://www.cve.org/CVERecord?id=CVE-2026-44223
National Vulnerability Database (NVD)
2	CVE-2026-44223	https://nvd.nist.gov/vuln/detail/CVE-2026-44223

CWE (共通脆弱性タイプ一覧)

No	Name	URL
JVNDB
1	CWE-131	https://cwe.mitre.org/data/definitions/131.html
2	CWE-704	https://cwe.mitre.org/data/definitions/704.html

ベンダー情報

No	Name	URL
GitHub
1	extract_hidden_states speculative decoding crashes server on any request with penalty parameters Advisory vllm-project/vllm GitHub	https://github.com/vllm-project/vllm/security/advisories/GHSA-83vm-p52w-f9pw

その他

No	Name	URL
関連文書
1	[Spec Decode] fix returning size mismatch on extract hidden states proposer by zzaebok Pull Request #38610 vllm-project/vllm GitHub	https://github.com/vllm-project/vllm/pull/38610

Change Log

No	Changed Details	Date of change
1	[2026年05月18日] 掲載	May 18, 2026, 12:12 p.m.