7
/ 10
HIGH
CVSS:3.1/AV:L/AC:H/PR:N/UI:R/S:U/C:H/I:H/A:H
Description
ndaybench A benchmark for measuring whether AI agents can build working exploits from n-day patches. This repository hosts the data-acquisition pipeline that feeds the benchmark — patchwatch — along with the planning docs for the benchmark harness...
Basic Information
ID
23233E3A-13E8-50E4-9A0B-80BE686DD799
Published
May 26, 2026 at 03:04
Modified
May 26, 2026 at 03:05