CVE-2025-23320

HIGH LAB

NVIDIA Triton Inference Server < 25.07 - Information Disclosure via Python Backend Shared Memory Exhaustion

Title source: llm
STIX 2.1

Exploitation Summary

EIP tracks 1 public exploit for CVE-2025-23320. PoCs published by There-was-a-bird.

AI-analyzed exploit summary This repository demonstrates CVE-2025-23320, an information leakage vulnerability in NVIDIA Triton Inference Server (CWE-209). The exploit leverages error messages containing sensitive shared memory keys to trigger a race condition, leading to an out-of-bounds write via Triton's unregister/register API.

Description

NVIDIA Triton Inference Server for Windows and Linux contains a vulnerability in the Python backend, where an attacker could cause the shared memory limit to be exceeded by sending a very large request. A successful exploit of this vulnerability might lead to information disclosure.

Exploits (1)

github WORKING POC
by There-was-a-bird · pythonpoc
https://github.com/There-was-a-bird/triton-cve-2025-23320

This repository demonstrates CVE-2025-23320, an information leakage vulnerability in NVIDIA Triton Inference Server (CWE-209). The exploit leverages error messages containing sensitive shared memory keys to trigger a race condition, leading to an out-of-bounds write via Triton's unregister/register API.

Classification
Working Poc 95%
Attack Type
Info Leak
Complexity
Moderate
Reliability
Racy
Target: NVIDIA Triton Inference Server 25.06 and earlier
No auth needed
Prerequisites: Docker environment · Network access to Triton server · Shared memory key extraction from error messages
devstral-2 · analyzed Feb 19, 2026 Full analysis →

References (3)

Core 3

Scores

CVSS v3 7.5
EPSS 0.0088
EPSS Percentile 54.1%
Attack Vector NETWORK
CVSS:3.1/AV:N/AC:L/PR:N/UI:N/S:U/C:H/I:N/A:N

CISA SSVC

Vulnrichment
Exploitation none
Automatable yes
Technical Impact partial

Lab Environment

COMMUNITY
Community Lab
docker pull nvcr.io/nvidia/tritonserver:25.06-py3

Details

CWE
CWE-209
Status published
Products (1)
nvidia/triton_inference_server < 25.07
Published Aug 06, 2025
Tracked Since Feb 18, 2026