sakutto
Generative AI· GPT-5.5-Cyber

What Is OpenAI Daybreak? GPT-5.5-Cyber and AI Cyber Defense

OpenAICybersecurityGPT-5.5-Cyber

What Is OpenAI Daybreak (a cyber defense initiative)

The four pillars of Daybreak

Codex SecurityFinds vulnerabilities next to the developer, builds patches, and verifies them
GPT-5.5-CyberA dedicated model specialized for cyber defense, more permissive and more capable
Partner ProgramA framework letting security companies deliver defensive capability in their own products
Patch the PlanetSupports open source from finding vulnerabilities all the way to fixing them

OpenAI Daybreak is a cybersecurity initiative run by OpenAI, and its expansion was announced on June 22, 2026. What stands out is that it goes beyond finding vulnerabilities to accelerate the whole flow from validation to remediation with AI. First, let's lay out Daybreak's aim and overall picture, and why now.

The definition of Daybreak and the shift from "finding" to "fixing"

Daybreak is OpenAI's comprehensive cybersecurity initiative that tries to use AI to accelerate vulnerabilities from "finding" through to "fixing." Historically, finding serious vulnerabilities (software flaws) was itself hard, requiring rare expertise and time. But as frontier models (the most advanced large-scale AI) accelerated discovery, defenders are now overwhelmed by the sheer number of vulnerabilities found, and OpenAI explains that the bottleneck has moved from "finding" to "fixing."

View official source →
"We're expanding Daybreak to help democratize patching vulnerable software at machine speed." — From the Daybreak announcement

OpenAI goes so far as to say that vulnerability reports on their own protect no one. The value emerges only when you validate the issue, understand its impact, develop and test a patch, coordinate disclosure, and help deploy the fix. Daybreak invests in these latter steps.

View official source →
"Vulnerability reports, on their own, do not protect anyone. The value comes from validating the issue, understanding its impact, developing and testing a patch, coordinating disclosure, and helping teams deploy the fix." — From the Daybreak announcement

The four elements that make up Daybreak

Daybreak is not a single product but a collection of efforts. As the figure above shows, the four pillars are the developer-facing Codex Security, the dedicated GPT-5.5-Cyber model, the Daybreak Cyber Partner Program with security companies, and the open-source Patch the Planet. All four point toward the same goal: carrying a discovered vulnerability safely to a fix under human control.

OpenAI argues that such defensive capability should not be concentrated in a few hands. Since software touches every aspect of life — from critical infrastructure to business apps and government networks — defenders in particular should have broad access to these capabilities. Daybreak is a framework built around that "democratize" idea.

Why use AI for cyber defense now

OpenAI's view is that AI has changed the physics of cybersecurity. As models became able to read large codebases, reason through attack paths, and validate hypotheses, problems that used to stay hidden now surface one after another. How quickly the flood of discovered vulnerabilities can be fixed is the new front line.

This move of generative AI into such specialized territory is continuous with the evolution of conversational services. To grasp the big picture of AI and the differences between major models, the ChatGPT (GPT-5) guide and the guide to Claude (Anthropic's generative AI) are also useful references.

What Is GPT-5.5-Cyber (performance and benchmarks)

GPT-5.5-Cyber vs GPT-5.5 benchmark comparison (official figures, %)

GPT-5.5-Cyber GPT-5.5 (general)
CyberGym (reproducing known vulnerabilities)
GPT-5.5-Cyber85.6
GPT-5.581.8
ExploitGym (turning vulnerabilities into working exploits)
GPT-5.5-Cyber39.5
GPT-5.525.95
SEC-bench Pro (long-horizon discovery and PoC generation)
GPT-5.5-Cyber69.8
GPT-5.563.1

At the core of Daybreak is GPT-5.5-Cyber, a model specialized for cyber defense. Let's look at its positioning, its official benchmarks, and the conditions for using it.

GPT-5.5-Cyber's positioning (capability paired with permissiveness)

GPT-5.5-Cyber is a model that has become both more permissive and more capable for authorized, advanced cybersecurity work. The initial preview mainly aimed to reduce unnecessary refusals in specialized work; this update goes further and strengthens its ability to help find and fix vulnerabilities. It keeps GPT-5.5's general-purpose intelligence and its ability to handle long, complex tasks, while being optimized for security use.

It is designed to support the whole "flow to a fix": identifying security-relevant parts of a large codebase, tracing whether vulnerable code is reachable, validating likely issues in controlled environments, developing and testing patches, and preparing evidence for human review.

CyberGym, ExploitGym, and SEC-bench Pro scores

The figures OpenAI published are clear. On CyberGym, which measures reproducing known vulnerabilities, GPT-5.5-Cyber scored 85.6%, beating the general-purpose GPT-5.5 (81.8%). OpenAI calls it the highest single-model score it has measured.

View official source →
"the updated GPT-5.5-Cyber reached 85.6% in single-model evaluations, compared with 81.8% for GPT-5.5. This is the highest CyberGym score we have measured from a single model." — From the Daybreak announcement

It also beat GPT-5.5 on two more practical benchmarks. On ExploitGym, which measures whether known vulnerabilities can be turned into working exploits, the result was 39.5% versus 25.95%; on SEC-bench Pro, which evaluates long-horizon vulnerability discovery and PoC (proof-of-concept) generation against complex software, it reached 69.8% versus 63.1%.

View official source →
"GPT-5.5-Cyber also outperformed GPT-5.5 on two demanding real-world security benchmarks: 39.5% versus 25.95% on ExploitGym ... On SEC-bench Pro ... GPT-5.5-Cyber reached 69.8%, compared with 63.1% for GPT-5.5." — From the Daybreak announcement

That said, OpenAI itself adds that benchmarks are only one part of the story. What matters in practice is whether a model can find real vulnerabilities, separate them from noise, and land fixes safely. It is realistic to read the numbers as one indicator of capability.

Conditions for use (limited to verified defenders)

For all its capability, GPT-5.5-Cyber is not something anyone can freely use. For most defenders, GPT-5.5 with Trusted Access for Cyber and Codex Security is the starting point, and GPT-5.5-Cyber is offered only to verified defenders.

View official source →
"For most defenders, GPT-5.5 with Trusted Access for Cyber and Codex Security remains the right starting point. GPT-5.5-Cyber is intended for verified defenders whose authorized work requires our most advanced cyber capabilities and more permissive behavior, paired with stronger verification, monitoring, scoped controls, and review." — From the Daybreak announcement

It is delivered with stronger identity verification, monitoring, scoped controls, and review, so it can be used for defensive purposes only while preventing abuse for attacks. In short, the model's basic stance pairs high capability with cautious distribution — "offensive performance with defensive governance."

Free ToolURL to Markdown ConverterConvert any public web page URL to Markdown. Preserves headings, tables, lists, and links — perfect for LLM and RAG preprocessing, research notes, and archiving web articles.Try it now →

Inside Patch the Planet and Codex Security

The Patch the Planet flow (from finding to landing the fix)

1. Researchers find vulnerabilities with AI (GPT-5.5-Cyber and Codex Security)
2. Trail of Bits experts manually validate, deduplicate, and reassess severity for every finding
3. Confirmed vulnerabilities and patches are submitted to maintainers
4. Maintainers decide on applying fixes and on disclosure (they stay in control)

Daybreak's "all the way to a fix" is supported by the developer-facing Codex Security and the open-source Patch the Planet. Let's look at what each is and the early results already emerging.

What Codex Security can do (over 30 million commits scanned)

Codex Security was built around the idea of placing a security engineer next to every developer. Rather than just raising alerts, it understands a team's code and threat model, identifies vulnerability candidates, judges reachability, gathers validation evidence, develops a targeted patch, and verifies the result. Humans stay in control of which findings to investigate and which changes to apply.

Its scale is already large. Since the research preview began in March 2026, Codex Security has scanned more than 30 million commits across over 30,000 codebases. Human reviewers have marked more than 70,000 findings as fixed, and more than 500,000 have been automatically determined to be fixed.

View official source →
"Since launching Codex Security cloud in research preview in March, it has scanned over 30 million commits across more than 30,000 codebases; human reviewers have manually marked more than 70,000 findings as fixed, and over 500,000 findings have automatically been determined to be fixed." — From the Daybreak announcement

How Patch the Planet works (human review at its core)

Patch the Planet was founded with the security firm Trail of Bits, in collaboration with HackerOne and Calif. It equips expert researchers with Codex Security and high-capability models to work directly with open-source maintainers. The core is that human experts always validate AI-found vulnerabilities before they reach maintainers, so false positives don't exhaust maintainers further.

View official source →
"Trail of Bits engineers manually reviewed every security issue before it was submitted to a maintainer, and the added value of this step cannot be understated." — From the Patch the Planet announcement

Behind this design is a stark reality. Citing research from the Linux Foundation and Harvard, OpenAI notes that in 94% of the widely used projects studied, fewer than ten developers were responsible for more than 90% of the code added in a year. The more AI increases reports, the heavier the load on maintainers, so a system that delivers only validated, reliable findings is needed.

Participating projects and early results (a five-day sprint)

Participation is spreading. With cURL, Go, Python, Sigstore, and pyca/cryptography among the initial participants, more than 30 open-source projects have committed to take part. They span networking, cryptography, supply chain, and language infrastructure — foundations that affect many downstream products.

View official source →
"More than 30 open-source projects have committed to participate, with initial participants including cURL, Go, Python, Sigstore, and pyca/cryptography." — From the Daybreak announcement

Results are starting to appear. Trail of Bits worked full-time across 19 open-source projects, already identifying hundreds of security issues and merging dozens of patches. The first five-day sprint also produced reusable infrastructure, such as fuzzing setups (which probe for flaws with random input) and pipelines that start from past CVEs (public vulnerability identifiers) to hunt for similar flaws. Participating projects receive ChatGPT Pro, conditional access to Codex Security, and API credits.

Vulnerabilities Daybreak Found and Partner Collaboration

Major vulnerabilities Daybreak reported (representative examples by layer)

OS (foundation)Auto-generated PoCs for info leaks and privilege escalation in the Linux kernel; a 23-year-old flaw found in OpenBSD
NetworkIdentified flaws matching several dnsmasq CVEs; found a large-scale HTTP/2 denial-of-service technique
BrowsersReported 5 in Chrome's V8, 10+ in Safari's WebKit, and 1 in Firefox

Daybreak's value shows not only in benchmark performance but in a real track record of finding vulnerabilities in widely used software. Let's go through concrete examples, the partners and national collaborations behind them, and what this means for general readers.

Real examples found in OS, network, and browsers

The published examples span every layer of software. GPT-5.5-Cyber analyzed more than 30 million lines of code in the Linux kernel and auto-generated 8 information-leak PoCs and 24 privilege-escalation exploits.

View official source →
"GPT-5.5-Cyber identified security-relevant components across more than 30 million lines of code, flagged potential security issues, and then validated them dynamically, generating 8 kernel pointer information leak proof-of-concepts (PoCs) and 24 local privilege escalation exploits." — From the Patch the Planet announcement

Lining up the main findings by layer shows how broad the reach is.

LayerTargetWhat was found
OSLinux kernelAuto-generated 8 info-leak PoCs and 24 privilege-escalation exploits
OSOpenBSDFound a 23-year-old use-after-free flaw
OSFreeBSDConfirmed 34 vulnerabilities and produced 7 privilege-escalation PoCs
NetworkdnsmasqCodex Security independently identified patterns matching 4 of the 6 later-fixed CVEs
NetworkHTTP/2Found a large-scale denial-of-service (DoS) technique affecting major implementations
BrowsersChrome (V8)Reported 5 exploitable bugs; 3 fixed within days of being introduced
BrowsersSafari (WebKit)Found and reported 10+ exploitable bugs in about a week
BrowsersFirefoxIdentified a WebAssembly vulnerability, patched two days before a competition

The network-layer HTTP/2 finding is especially far-reaching: analysis suggested that more than 880,000 Internet-facing websites were running affected server software with HTTP/2 enabled.

View official source →
"Calif's analysis suggested that more than 880,000 Internet-facing websites were running affected server software with HTTP/2 enabled." — From the Patch the Planet announcement

Partner companies and collaboration with governments and critical infrastructure

Daybreak is also advancing partnerships with security companies. Through the Daybreak Cyber Partner Program, participating partners can use defensive models inside their own products and services, so their customers benefit while direct model access stays in the partners' hands. The official partner list includes product partners such as CrowdStrike, Palo Alto Networks, Cisco, IBM, Okta, Cloudflare, Fortinet, and Zscaler, alongside service firms (GSI partners) such as Accenture, EY, KPMG, and PwC.

View official source →
"Daybreak brings OpenAI together with cybersecurity companies and service providers to develop practical, governed solutions for defenders." — From the Become a Daybreak partner page

Collaboration with governments and critical infrastructure is expanding too. OpenAI says that over the past month it has established cyber partnerships with Australia, Canada, France, Germany, Japan, and South Korea, as well as EU institutions such as ENISA. Japan is part of this framework, so it is not unrelated to domestic security trends.

View official source →
"In the past month we have already established Trusted Access for Cyber partnerships with Australia, Canada, France, Germany, Japan, Republic of Korea, and EU institutions like ENISA." — From the Daybreak announcement

What it means for general AI users, and cautions

Worth keeping in mind is that Daybreak is not a product general users adopt directly. GPT-5.5-Cyber is narrowed to verified defenders, a different user base from everyday ChatGPT use or ordinary AI work. Even so, it is no small thing that the security of the browsers and server software you use every day is being raised by efforts like this.

For anyone using AI in practice, since the same technology can serve both attack and defense, care with the information you input still matters. If you use AI to read through documents at hand, the practical guide to local LLMs is also a useful reference.

Summary of OpenAI Daybreak

OpenAI Daybreak pushes AI's role from "finding" vulnerabilities to "fixing" them. Centered on the capable, dedicated GPT-5.5-Cyber model, it combines the developer-facing Codex Security, the open-source Patch the Planet, and collaboration with security companies and governments to spread defensive capability widely. It is not a product general users will touch right away, but it is worth noting as a movement that underpins, from below, the safety of the software we use every day.

To keep following the latest model news, see the Claude Opus 4.8 release explainer as well. Specifications and availability will keep changing, so it is worth checking the official pages for the latest before you rely on this.

Official announcements and security advisories are often long English documents, and there are times you want AI to organize the key points. Converting a web page into Markdown first helps keep heading and table structure intact, which tends to improve accuracy.

Free ToolURL to Markdown ConverterConvert any public web page URL to Markdown. Preserves headings, tables, lists, and links — perfect for LLM and RAG preprocessing, research notes, and archiving web articles.Try it now →

FAQ

Q. What is OpenAI Daybreak?
It is OpenAI's cybersecurity initiative that bundles AI models, Codex Security, partners, and open-source support to accelerate not only finding vulnerabilities but also fixing them. An expansion was announced on June 22, 2026.
We're expanding Daybreak to help democratize patching vulnerable software at machine speed. OpenAI official (Daybreak)
Q. How capable is GPT-5.5-Cyber?
It scored 85.6% on CyberGym, which measures reproducing known vulnerabilities, beating the general-purpose GPT-5.5 (81.8%). OpenAI calls it the highest single-model score it has measured.
This model sets new state-of-the-art performance on CyberGym, reaching 85.6% compared with 81.8% for GPT-5.5. OpenAI official (Daybreak)
Q. Can anyone use GPT-5.5-Cyber?
No. It is not publicly available and is offered only to verified defenders whose authorized work requires it. Most users are pointed to GPT-5.5 and Codex Security as the starting point.
GPT-5.5-Cyber is intended for verified defenders whose authorized work requires our most advanced cyber capabilities and more permissive behavior, paired with stronger verification, monitoring, scoped controls, and review. OpenAI official (Daybreak)
Q. What is Patch the Planet?
An initiative to support open-source maintainers, validating vulnerabilities found with AI and carrying them through to fixes. More than 30 projects have committed to take part.
More than 30 open-source projects have committed to participate, with initial participants including cURL, Go, Python, Sigstore, and pyca/cryptography. OpenAI official (Daybreak)
Q. Has Daybreak actually found vulnerabilities?
Yes. It has a published track record of identifying and validating vulnerabilities in widely used software such as the Linux kernel, OpenBSD, Chrome, Safari, and Firefox.
GPT-5.5-Cyber identified security-relevant components across more than 30 million lines of code, flagged potential security issues, and then validated them dynamically, generating 8 kernel pointer information leak proof-of-concepts (PoCs) and 24 local privilege escalation exploits. OpenAI official (Patch the Planet)
Q. Is Japan involved with Daybreak?
Yes. OpenAI says it has established 'Trusted Access for Cyber' partnerships over the past month with several countries and regions, including Japan.
we have already established Trusted Access for Cyber partnerships with Australia, Canada, France, Germany, Japan, Republic of Korea, and EU institutions like ENISA. OpenAI official (Daybreak)

Related Tools

Related Tool Categories

Articles