Unauthenticated and Critical: How a LiteLLM Flaw Became a CVSS 10 Path Into AI Infrastructure
Cybersecurity

Unauthenticated and Critical: How a LiteLLM Flaw Became a CVSS 10 Path Into AI Infrastructure

A command injection bug in the LiteLLM proxy, chained with a Starlette header flaw, turns into unauthenticated remote code execution scored at a perfect CVSS 10, and CISA says it is already being exploited.

PublishedJune 9, 2026
Read time5 min read
Share

A Proxy Sitting in the Critical Path

LiteLLM has quietly become one of the most load-bearing pieces of enterprise AI plumbing. The open source proxy gives teams a single OpenAI-compatible endpoint that fans out to dozens of model providers, handling routing, rate limiting, key management and spend tracking along the way. That convenience is exactly why a flaw in it is so dangerous: the proxy holds provider credentials, sees every prompt and response, and often sits inside the trusted core of an AI platform. When the gateway is compromised, the blast radius is not one application but the whole model layer behind it.

The vulnerability now drawing CISA's attention, CVE-2026-42271, lives in the endpoints LiteLLM uses to preview a Model Context Protocol server before saving it. Those routes, the test connection and test tools list handlers, accept a full server configuration in the request body, including the command, arguments and environment fields used by the stdio transport. As Horizon3.ai put it, the endpoints accept a complete server configuration, and when invoked they spawn the supplied command as a subprocess on the proxy host with the privileges of the proxy process. That is command injection by design, not by accident.

How a Two Bug Chain Removes the Login

On its own, CVE-2026-42271 was disclosed in April as a flaw that required an authenticated user, and LiteLLM shipped a fix in version 1.83.7 that restricts the affected endpoints to admins. The real escalation came when researchers combined it with a second weakness. Horizon3.ai confirmed that when chained with CVE-2026-48710, a Starlette host header validation bypass, the authentication requirement can be bypassed entirely. Starlette versions at or below 1.0.0 fail to validate the host header correctly, which lets an attacker slip past LiteLLM's authentication layer without any credentials at all.

The combined result is the worst case for any internet-facing service. The researchers describe it plainly: the result is unauthenticated remote code execution against vulnerable LiteLLM deployments, allowing attackers to execute commands as the LiteLLM proxy process. The chained vulnerability has been assessed as CVSS 10.0 Critical, a score reserved for flaws that need no login, no API key and no prior foothold. Horizon3.ai validated the path on June 1, and within days the technique was circulating widely enough for CISA to act on evidence of exploitation in the wild.

What an Attacker Walks Away With

Code execution on the proxy is not the end of the story, it is the beginning. Because LiteLLM brokers access to every model a team uses, an attacker who lands on the host can siphon the provider credentials and API keys the proxy stores, read the prompts and completions flowing through it, and pivot into whatever internal systems the gateway is wired to. In many deployments that includes vector databases, retrieval pipelines and orchestration services that were considered safe because they sat behind the proxy. The single most trusted node in the AI stack becomes the single best place to start an intrusion.

This is the uncomfortable shape of AI security in 2026. The components that make agentic systems convenient, shared gateways, MCP servers, tool-calling endpoints, are precisely the components that expand the attack surface. We have written before about agents that can be hijacked through their tools, and this is the infrastructure equivalent. A feature meant to let developers preview an MCP server became a remote shell, and the abstraction that was supposed to simplify model access became the fastest route to the credentials behind it.

CISA Moves and the Clock Starts

CISA added CVE-2026-42271 to its Known Exploited Vulnerabilities catalog on June 8, alongside a Check Point Security Gateway authentication flaw, and directed federal civilian agencies to remediate by June 22, 2026. The KEV listing is the government's strongest signal that a vulnerability is not theoretical: it is reserved for bugs with confirmed active exploitation, and under Binding Operational Directive 22-01 it triggers a hard patching deadline for the federal enterprise. When a flaw with a CVSS 10 chain lands on that list, the implicit message to the private sector is that the window for quiet patching has already closed.

The remediation itself is not complicated, which makes inaction harder to defend. Fixes exist in LiteLLM 1.83.7 and later, which add authorization controls so that only users with the proxy admin role can reach the affected endpoints, and in Starlette 1.0.1, which closes the host header bypass. The catch is dependency hygiene: many LiteLLM deployments pull Starlette in transitively, so teams that patched only the proxy may still carry the vulnerable web framework underneath it. Patching one without the other leaves the chain intact.

What We Would Do This Week

Our advice to security and platform teams is to treat the June 22 federal deadline as a personal one. Inventory every LiteLLM instance, including the shadow deployments that data science teams stood up without telling anyone, and confirm both the proxy and the Starlette dependency are on fixed versions. Until that is done, the affected MCP test endpoints should not be reachable from untrusted networks, and the proxy should never be exposed directly to the internet. Rotating any provider credentials the proxy has touched is prudent, because a stolen key remains valid long after the host is patched.

Beyond the immediate fix, this incident is a prompt to govern AI infrastructure the way we govern any other privileged system. Gateways that hold credentials and execute tool calls deserve network segmentation, least-privilege service accounts, egress monitoring and a real software bill of materials that surfaces transitive dependencies like Starlette. The agentic stack is moving faster than most security programs, and CVE-2026-42271 is a reminder that the convenience layer and the attack surface are now the same layer.

Tagged#news#security#ai-security#cisa#zero-day#litellm#rce